Thomas Fang Zheng

Orcid: 0000-0002-0249-4767

According to our database1, Thomas Fang Zheng authored at least 127 papers between 2000 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Random Cycle Loss and Its Application to Voice Conversion.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

Enhancing Quantised End-to-End ASR Models via Personalisation.
CoRR, 2023

DialoguePCN: Perception and Cognition Network for Emotion Recognition in Conversations.
IEEE Access, 2023

CN-CVS: A Mandarin Audio-Visual Dataset for Large Vocabulary Continuous Visual to Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
CN-Celeb: Multi-genre speaker recognition.
Speech Commun., 2022

A Multi-Resolution Front-End for End-to-End Speech Anti-Spoofing.
Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022

2021
When Automatic Voice Disguise Meets Automatic Speaker Verification.
IEEE Trans. Inf. Forensics Secur., 2021

Cross-Database Replay Detection in Terminal-Dependent Speaker Verification.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Attack on Practical Speaker Verification System Using Universal Adversarial Perturbations.
Proceedings of the IEEE International Conference on Acoustics, 2021

Squeezing Value of Cross-Domain Labels: A Decoupled Scoring Approach for Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2021

How Speech is Recognized to Be Emotional - A Study Based on Information Decomposition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020
Deep generative factorization for speech signal.
CoRR, 2020

Neural Discriminant Analysis for Deep Speaker Embedding.
Proceedings of the Interspeech 2020, 2020

Domain-Invariant Speaker Vector Projection by Model-Agnostic Meta-Learning.
Proceedings of the Interspeech 2020, 2020

ASR-Free Pronunciation Assessment.
Proceedings of the Interspeech 2020, 2020

2019
Noise Robust Speaker Recognition Based on Adaptive Frame Weighting in GMM for i-Vector Extraction.
IEEE Access, 2019

Replay detection using CQT-based modified group delay feature and ResNeWt network in ASVspoof 2019.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018
Imbalance Learning-based Framework for Fear Recognition in the MediaEval Emotional Impact of Movies Task.
Proceedings of the Interspeech 2018, 2018

Deep Factorization for Speech Signal.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Full-Info Training for Deep Speaker Feature Learning.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

RACORN-K: Risk-Aversion Pattern Matching-based Portfolio Selection.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

MMANN: Multimodal Multilevel Attention Neural Network for Horror Clip Detection.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

Multilingual Stemming and Term extraction for Uyghur, Kazak and Kirghiz.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017
Distributed representation learning for knowledge graphs with entity descriptions.
Pattern Recognit. Lett., 2017

Enhanced Neural Machine Translation by Learning from Draft.
CoRR, 2017

M2ASR: Ambitions and first year progress.
Proceedings of the 20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment, 2017

A Study on Replay Attack and Anti-Spoofing for Automatic Speaker Verification.
Proceedings of the Interspeech 2017, 2017

Speaker segmentation using deep speaker vectors for fast speaker change scenarios.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Pseudo-pitch-synchronized phase information extraction and its application for robust speaker recognition.
Proceedings of the IEEE 6th Global Conference on Consumer Electronics, 2017

Language resource construction for Mongolian.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

A free Kazakh speech database and a speech recognition baseline.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Enhanced neural machine translation by learning from draft.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Free linguistic and speech resources for Tibetan.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Cross-lingual speaker verification with deep feature learning.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

A multilingual language processing tool for Uyghur, Kazak and Kirghiz.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Deep speaker verification: Do we need end to end?
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016
Unseen Noise Estimation Using Separable Deep Auto Encoder for Speech Enhancement.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Improving Short Utterance Speaker Recognition by Modeling Speech Unit Classes.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Improving speaker verification performance against long-term speaker variability.
Speech Commun., 2016

System Combination for Short Utterance Speaker Recognition.
CoRR, 2016

Probabilistic Belief Embedding for Large-Scale Knowledge Population.
Cogn. Comput., 2016

Learning Embedding Representations for Knowledge Inference on Imperfect and Incomplete Repositories.
Proceedings of the 2016 IEEE/WIC/ACM International Conference on Web Intelligence, 2016

Binary speaker embedding.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Max-margin metric learning for speaker recognition.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Feature transformation for speaker verification under speaking rate mismatch condition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

Relative entropy normalized Gaussian supervector for speech emotion recognition using kernel extreme learning machine.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

System combination for short utterance speaker recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

Decision making based on cohort scores for speaker verification.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

2015
Detection and reconstruction of clipped speech for speaker recognition.
Speech Commun., 2015

Statistical word sense aware topic models.
Soft Comput., 2015

Document Representation with Statistical Word Senses in Cross-Lingual Document Clustering.
Int. J. Pattern Recognit. Artif. Intell., 2015

Noisy training for deep neural networks in speech recognition.
EURASIP J. Audio Speech Music. Process., 2015

Deep Speaker Vectors for Semi Text-independent Speaker Verification.
CoRR, 2015

Parallel Knowledge Embedding with MapReduce on a Multi-core Processor.
CoRR, 2015

Probabilistic Belief Embedding for Knowledge Base Completion.
CoRR, 2015

Large Margin Nearest Neighbor Embedding for Knowledge Representation.
Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology, 2015

Distant Supervision for Entity Linking.
Proceedings of the 29th Pacific Asia Conference on Language, Information and Computation, 2015

An open/free database and Benchmark for Uyghur speaker recognition.
Proceedings of the 2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2015

Gender-dependent feature extraction for speaker recognition.
Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015

Cross-lingual speaker verification based on linear transform.
Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015

Transfer learning for speech and language processing.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

2014
Errata: Distant Supervision for Relation Extraction with Matrix Completion.
CoRR, 2014

Transition-based Knowledge Graph Embedding with Relational Mapping Properties.
Proceedings of the 28th Pacific Asia Conference on Language, Information and Computation, 2014

Research on generalization property of time-varying Fbank-weighted MFCC for i-vector based speaker verification.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Research on truncated speech in speaker verification.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Using Word Sense as a Latent Variable in LDA Can Improve Topic Modeling.
Proceedings of the ICAART 2014, 2014

Topic Models Incorporating Statistical Word Senses.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2014

Mining the Personal Interests of Microbloggers via Exploiting Wikipedia Knowledge.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2014

Block-wise training for i-vector.
Proceedings of the IEEE China Summit & International Conference on Signal and Information Processing, 2014

An overview of robustness related issues in speaker recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

Discriminative scoring for speaker recognition based on I-vectors.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

Distant Supervision for Relation Extraction with Matrix Completion.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

2013
Online Non-Negative Convolutive Pattern Learning for Speech Signals.
IEEE Trans. Signal Process., 2013

Understanding the Query: THCIB and THUIS at NTCIR-10 Intent Task.
Proceedings of the 10th NTCIR Conference on Evaluation of Information Access Technologies, 2013

Ranking Search Intents Underlying a Query.
Proceedings of the Natural Language Processing and Information Systems, 2013

Sequential model adaptation for speaker verification.
Proceedings of the INTERSPEECH 2013, 2013

A Fishervoice based feature fusion method for short utterance speaker recognition.
Proceedings of the 2013 IEEE China Summit and International Conference on Signal and Information Processing, 2013

Sequential UBM adaptation for speaker verification.
Proceedings of the 2013 IEEE China Summit and International Conference on Signal and Information Processing, 2013

Emotional speaker verification with linear adaptation.
Proceedings of the 2013 IEEE China Summit and International Conference on Signal and Information Processing, 2013

Emotional adaptive training for speaker verification.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

2012
Speech unit category based short utterance speaker recognition.
Comput. Sci. Inf. Syst., 2012

Content-Based Semantic Tag Ranking for Recommendation.
Proceedings of the 2012 IEEE/WIC/ACM International Conferences on Web Intelligence, 2012

Text-Dependent Speaker Recognition with long-term features based on functional data analysis.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

A K-phoneme-class based multi-model method for short utterance speaker recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

An investigation into better frequency warping for time-varying speaker recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

2011
Reliable accent specific unit generation with dynamic Gaussian mixture selection for multi-accent speech recognition.
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

2010
Using MMSE to improve session variability estimation.
Int. J. Biom., 2010

Using cepstral and prosodic features for Chinese accent identification.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Using phoneme recognition and text-dependent speaker verification to improve speaker segmentation for Chinese speech.
Proceedings of the INTERSPEECH 2010, 2010

2009
Effectiveness of n-gram fast match for query-by-humming systems.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

A phrase-level piecewise linear scaling algorithm for melody match in Query-by-Humming systems.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

2008
State-dependent phoneme-based model merging for dialectal Chinese speech recognition.
Speech Commun., 2008

Local Mismatch Phone for Confidence Measure in Standard and Accented Chinese Speech Recognition.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Job Information Retrieval Based on Document Similarity.
Proceedings of the Information Retrieval Technology, 2008

2007
A Cohort-Based Speaker Model Synthesis for Mismatched Channels in Speaker Verification.
IEEE Trans. Speech Audio Process., 2007

Using a small development set to build a robust dialectal Chinese speech recognizer.
Proceedings of the INTERSPEECH 2007, 2007

Emotion attribute projection for speaker recognition on emotional speech.
Proceedings of the INTERSPEECH 2007, 2007

Session Variability Subspace Projection Based Model Compensation for Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2007

2006
A tree-based kernel selection approach to efficient Gaussian mixture model-universal background model based speaker identification.
Speech Commun., 2006

CCC Speaker Recognition Evaluation 2006: Overview, Methods, Data, Results and Perspective.
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

Pitch Mean Based Frequency Warping.
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

UBM Based Speaker Segmentation and Clustering for 2-Speaker Detection.
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

Study on speaker verification on emotional speech.
Proceedings of the INTERSPEECH 2006, 2006

Automatic initial/final generation for dialectal Chinese speech recognition.
Proceedings of the INTERSPEECH 2006, 2006

Cohort-Based Speaker Model Synthesis for Channel Robust Speaker Recognition.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
The dynamically-adjustable histogram pruning method for embedded voice dialing.
Proceedings of the Signal and Image Processing (SIP 2005), 2005

Rapidly developing spoken Chinese dialogue systems with the d-ear SDS SDK.
Proceedings of the INTERSPEECH 2005, 2005

Real-time pitch tracking based on combined SMDSF.
Proceedings of the INTERSPEECH 2005, 2005

Modeling high-level information by using Gaussian mixture correlation for GMM-UBM based speaker recognition.
Proceedings of the INTERSPEECH 2005, 2005

The predictive differential amplitude spectrum for robust speaker recognition in stationary noises.
Proceedings of the INTERSPEECH 2005, 2005

Combining Selection Tree with Observation Reordering Pruning for Efficient Speaker Identification Using GMM-UBM.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
Making Full Use of Chinese Speech Corpora.
J. Chin. Lang. Comput., 2004

A two-step keyword spotting method based on context-dependent a posteriori probability.
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004

Weighting observation vectors for robust speech recognition in noisy environments.
Proceedings of the INTERSPEECH 2004, 2004

2002
Mandarin Pronunciation Modeling Based on CASS Corpus.
J. Comput. Sci. Technol., 2002

Reducing pronunciation lexicon confusion and using more data without phonetic transcription for pronunciation modeling.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

2001
Comparison of Different Implementations of MFCC.
J. Comput. Sci. Technol., 2001

Modeling pronunciation variation using context-dependent weighting and b/s refined acoustic modeling.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

A two-layer lexical tree based beam search in continuous Chinese speech recognition.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Improved context-dependent acoustic modeling for continuous Chinese speech recognition.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Robust parsing in spoken dialogue systems.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

An online incremental language model adaptation method.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

An MCE based classification tree using hierarchical feature-weighting in speech recognition.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Design of a semantic parser with support to ellipsis resolution in a Chinese spoken language dialogue system.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

A theme structure method for the ellipsis resolution.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Automatic generation of pronunciation lexicons for Mandarin spontaneous speech.
Proceedings of the IEEE International Conference on Acoustics, 2001

2000
Improving the Syllable-Synchronous Network Search Algorithm for Word Decoding in Continuous Chinese Speech Recognition.
J. Comput. Sci. Technol., 2000


  Loading...