Thomas Fang Zheng

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Emotional Atmosphere Soft Label for Emotion Recognition in Conversations.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 31st International Conference, 2024

Advancing Respiratory Sound Classification: Integration of Audio Spectrogram Transformer with ConnectMix and NEFTune Augmentation.

[BibT_eX]

[DOI]

Runze Huang

Proceedings of the Neural Information Processing - 31st International Conference, 2024

Enhancing Quantised End-to-End ASR Models Via Personalisation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

Random Cycle Loss and Its Application to Voice Conversion.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

DialoguePCN: Perception and Cognition Network for Emotion Recognition in Conversations.

[BibT_eX]

[DOI]

IEEE Access, 2023

CN-CVS: A Mandarin Audio-Visual Dataset for Large Vocabulary Continuous Visual to Speech Synthesis.

[BibT_eX]

[DOI]

Chen Chen

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

CN-Celeb: Multi-genre speaker recognition.

[BibT_eX]

[DOI]

Speech Commun., 2022

A Multi-Resolution Front-End for End-to-End Speech Anti-Spoofing.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022

2021

When Automatic Voice Disguise Meets Automatic Speaker Verification.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Forensics Secur., 2021

Cross-Database Replay Detection in Terminal-Dependent Speaker Verification.

[BibT_eX]

[DOI]

Xingliang Cheng

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Attack on Practical Speaker Verification System Using Universal Adversarial Perturbations.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Squeezing Value of Cross-Domain Labels: A Decoupled Scoring Approach for Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

How Speech is Recognized to Be Emotional - A Study Based on Information Decomposition.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020

Deep generative factorization for speech signal.

[BibT_eX]

[DOI]

CoRR, 2020

Neural Discriminant Analysis for Deep Speaker Embedding.

[BibT_eX]

[DOI]

Lantian Li

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Domain-Invariant Speaker Vector Projection by Model-Agnostic Meta-Learning.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

ASR-Free Pronunciation Assessment.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2019

Noise Robust Speaker Recognition Based on Adaptive Frame Weighting in GMM for i-Vector Extraction.

[BibT_eX]

[DOI]

IEEE Access, 2019

Replay detection using CQT-based modified group delay feature and ResNeWt network in ASVspoof 2019.

[BibT_eX]

[DOI]

Xingliang Cheng

Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018

Imbalance Learning-based Framework for Fear Recognition in the MediaEval Emotional Impact of Movies Task.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Deep Factorization for Speech Signal.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Full-Info Training for Deep Speaker Feature Learning.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

RACORN-K: Risk-Aversion Pattern Matching-based Portfolio Selection.

[BibT_eX]

[DOI]

Yang Wang

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

MMANN: Multimodal Multilevel Attention Neural Network for Horror Clip Detection.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

Multilingual Stemming and Term extraction for Uyghur, Kazak and Kirghiz.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017

Distributed representation learning for knowledge graphs with entity descriptions.

[BibT_eX]

[DOI]

Pattern Recognit. Lett., 2017

M2ASR: Ambitions and first year progress.

[BibT_eX]

[DOI]

Proceedings of the 20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment, 2017

A Study on Replay Attack and Anti-Spoofing for Automatic Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Speaker segmentation using deep speaker vectors for fast speaker change scenarios.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Pseudo-pitch-synchronized phase information extraction and its application for robust speaker recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE 6th Global Conference on Consumer Electronics, 2017

Language resource construction for Mongolian.

[BibT_eX]

[DOI]

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

A free Kazakh speech database and a speech recognition baseline.

[BibT_eX]

[DOI]

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Enhanced neural machine translation by learning from draft.

[BibT_eX]

[DOI]

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Free linguistic and speech resources for Tibetan.

[BibT_eX]

[DOI]

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Cross-lingual speaker verification with deep feature learning.

[BibT_eX]

[DOI]

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

A multilingual language processing tool for Uyghur, Kazak and Kirghiz.

[BibT_eX]

[DOI]

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Deep speaker verification: Do we need end to end?

[BibT_eX]

[DOI]

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016

Unseen Noise Estimation Using Separable Deep Auto Encoder for Speech Enhancement.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2016

Improving Short Utterance Speaker Recognition by Modeling Speech Unit Classes.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2016

Improving speaker verification performance against long-term speaker variability.

[BibT_eX]

[DOI]

Speech Commun., 2016

System Combination for Short Utterance Speaker Recognition.

[BibT_eX]

[DOI]

Lantian Li

CoRR, 2016

Probabilistic Belief Embedding for Large-Scale Knowledge Population.

[BibT_eX]

[DOI]

Cogn. Comput., 2016

Learning Embedding Representations for Knowledge Inference on Imperfect and Incomplete Repositories.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE/WIC/ACM International Conference on Web Intelligence, 2016

Binary speaker embedding.

[BibT_eX]

[DOI]

Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Max-margin metric learning for speaker recognition.

[BibT_eX]

[DOI]

Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Feature transformation for speaker verification under speaking rate mismatch condition.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

Relative entropy normalized Gaussian supervector for speech emotion recognition using kernel extreme learning machine.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

System combination for short utterance speaker recognition.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

Decision making based on cohort scores for speaker verification.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

2015

Detection and reconstruction of clipped speech for speaker recognition.

[BibT_eX]

[DOI]

Speech Commun., 2015

Statistical word sense aware topic models.

[BibT_eX]

[DOI]

Soft Comput., 2015

Document Representation with Statistical Word Senses in Cross-Lingual Document Clustering.

[BibT_eX]

[DOI]

Int. J. Pattern Recognit. Artif. Intell., 2015

Noisy training for deep neural networks in speech recognition.

[BibT_eX]

[DOI]

EURASIP J. Audio Speech Music. Process., 2015

Deep Speaker Vectors for Semi Text-independent Speaker Verification.

[BibT_eX]

[DOI]

CoRR, 2015

Parallel Knowledge Embedding with MapReduce on a Multi-core Processor.

[BibT_eX]

[DOI]

CoRR, 2015

Probabilistic Belief Embedding for Knowledge Base Completion.

[BibT_eX]

[DOI]

CoRR, 2015

Large Margin Nearest Neighbor Embedding for Knowledge Representation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology, 2015

Distant Supervision for Entity Linking.

[BibT_eX]

[DOI]

Proceedings of the 29th Pacific Asia Conference on Language, Information and Computation, 2015

An open/free database and Benchmark for Uyghur speaker recognition.

[BibT_eX]

[DOI]

Proceedings of the 2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2015

Gender-dependent feature extraction for speaker recognition.

[BibT_eX]

[DOI]

Lantian Li

Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015

Cross-lingual speaker verification based on linear transform.

[BibT_eX]

[DOI]

Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015

Transfer learning for speech and language processing.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

2014

Errata: Distant Supervision for Relation Extraction with Matrix Completion.

[BibT_eX]

[DOI]

CoRR, 2014

Transition-based Knowledge Graph Embedding with Relational Mapping Properties.

[BibT_eX]

[DOI]

Proceedings of the 28th Pacific Asia Conference on Language, Information and Computation, 2014

Clustering tweets usingWikipedia concepts.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Research on generalization property of time-varying Fbank-weighted MFCC for i-vector based speaker verification.

[BibT_eX]

[DOI]

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Research on truncated speech in speaker verification.

[BibT_eX]

[DOI]

Fanhu Bie

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Using Word Sense as a Latent Variable in LDA Can Improve Topic Modeling.

[BibT_eX]

[DOI]

Proceedings of the ICAART 2014, 2014

Topic Models Incorporating Statistical Word Senses.

[BibT_eX]

[DOI]

Proceedings of the Computational Linguistics and Intelligent Text Processing, 2014

Mining the Personal Interests of Microbloggers via Exploiting Wikipedia Knowledge.

[BibT_eX]

[DOI]

Proceedings of the Computational Linguistics and Intelligent Text Processing, 2014

Block-wise training for i-vector.

[BibT_eX]

[DOI]

Proceedings of the IEEE China Summit & International Conference on Signal and Information Processing, 2014

An overview of robustness related issues in speaker recognition.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

Discriminative scoring for speaker recognition based on I-vectors.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

Distant Supervision for Relation Extraction with Matrix Completion.

[BibT_eX]

[DOI]

Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

2013

Online Non-Negative Convolutive Pattern Learning for Speech Signals.

[BibT_eX]

[DOI]

IEEE Trans. Signal Process., 2013

Understanding the Query: THCIB and THUIS at NTCIR-10 Intent Task.

[BibT_eX]

[DOI]

Proceedings of the 10th NTCIR Conference on Evaluation of Information Access Technologies, 2013

Ranking Search Intents Underlying a Query.

[BibT_eX]

[DOI]

Proceedings of the Natural Language Processing and Information Systems, 2013

Sequential model adaptation for speaker verification.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

A Fishervoice based feature fusion method for short utterance speaker recognition.

[BibT_eX]

[DOI]

Chenhao Zhang

Proceedings of the 2013 IEEE China Summit and International Conference on Signal and Information Processing, 2013

Sequential UBM adaptation for speaker verification.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE China Summit and International Conference on Signal and Information Processing, 2013

Emotional speaker verification with linear adaptation.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE China Summit and International Conference on Signal and Information Processing, 2013

Emotional adaptive training for speaker verification.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

2012

Speech unit category based short utterance speaker recognition.

[BibT_eX]

[DOI]

Nakhat Fatima

Xiaojun Wu

Comput. Sci. Inf. Syst., 2012

Content-Based Semantic Tag Ranking for Recommendation.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE/WIC/ACM International Conferences on Web Intelligence, 2012

Text-Dependent Speaker Recognition with long-term features based on functional data analysis.

[BibT_eX]

[DOI]

Chenhao Zhang

Ruxin Chen

Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

A K-phoneme-class based multi-model method for short utterance speaker recognition.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

An investigation into better frequency warping for time-varying speaker recognition.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

2011

CLGVSM: Adapting Generalized Vector Space Model to Cross-lingual Document Clustering.

[BibT_eX]

[DOI]

Proceedings of the Fifth International Joint Conference on Natural Language Processing, 2011

Reliable accent specific unit generation with dynamic Gaussian mixture selection for multi-accent speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

2010

Using MMSE to improve session variability estimation.

[BibT_eX]

[DOI]

Gang Wang

Int. J. Biom., 2010

Using cepstral and prosodic features for Chinese accent identification.

[BibT_eX]

[DOI]

Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Using phoneme recognition and text-dependent speaker verification to improve speaker segmentation for Chinese speech.

[BibT_eX]

[DOI]

Gang Wang

Xiaojun Wu

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2009

Effectiveness of n-gram fast match for query-by-humming systems.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

A phrase-level piecewise linear scaling algorithm for melody match in Query-by-Humming systems.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

2008

Local Mismatch Phone for Confidence Measure in Standard and Accented Chinese Speech Recognition.

[BibT_eX]

[DOI]

Wenxiao Cao

Yi Liu

Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Job Information Retrieval Based on Document Similarity.

[BibT_eX]

[DOI]

Proceedings of the Information Retrieval Technology, 2008

2007

A Cohort-Based Speaker Model Synthesis for Mismatched Channels in Speaker Verification.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2007

Using a small development set to build a robust dialectal Chinese speech recognizer.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Emotion attribute projection for speaker recognition on emotional speech.

[BibT_eX]

[DOI]

Huanjun Bao

Ming-Xing Xu

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Session Variability Subspace Projection Based Model Compensation for Speaker Verification.

[BibT_eX]

[DOI]

Jing Deng

Proceedings of the IEEE International Conference on Acoustics, 2007

State-dependent mixture tying with variable codebook size for accented speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006

A tree-based kernel selection approach to efficient Gaussian mixture model-universal background model based speaker identification.

[BibT_eX]

[DOI]

Speech Commun., 2006

CCC Speaker Recognition Evaluation 2006: Overview, Methods, Data, Results and Perspective.

[BibT_eX]

[DOI]

Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

English Alphabet Recognition Based on Chinese Acoustic Modeling.

[BibT_eX]

[DOI]

Linquan Liu

Proceedings of the 5th International Symposium on Chinese Spoken Language Processing, 2006

State-Dependent Phoneme-Based Model Merging for Dialectal Chinese Speech Recognition.

[BibT_eX]

[DOI]

Linquan Liu

Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

Pitch Mean Based Frequency Warping.

[BibT_eX]

[DOI]

Jian Liu

Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

UBM Based Speaker Segmentation and Clustering for 2-Speaker Detection.

[BibT_eX]

[DOI]

Jing Deng

Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

Study on speaker verification on emotional speech.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Automatic initial/final generation for dialectal Chinese speech recognition.

[BibT_eX]

[DOI]

Linquan Liu

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Cohort-Based Speaker Model Synthesis for Channel Robust Speaker Recognition.

[BibT_eX]

[DOI]

Wei Wu

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005

The dynamically-adjustable histogram pruning method for embedded voice dialing.

[BibT_eX]

Proceedings of the Signal and Image Processing (SIP 2005), 2005

Rapidly developing spoken Chinese dialogue systems with the d-ear SDS SDK.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Real-time pitch tracking based on combined SMDSF.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Modeling high-level information by using Gaussian mixture correlation for GMM-UBM based speaker recognition.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

The predictive differential amplitude spectrum for robust speaker recognition in stationary noises.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Combining Selection Tree with Observation Reordering Pruning for Efficient Speaker Identification Using GMM-UBM.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004

Making Full Use of Chinese Speech Corpora.

[BibT_eX]

J. Chin. Lang. Comput., 2004

A two-step keyword spotting method based on context-dependent a posteriori probability.

[BibT_eX]

[DOI]

Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004

Weighting observation vectors for robust speech recognition in noisy environments.

[BibT_eX]

[DOI]

Zhenyu Xiong

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

2003

A Method to Build a Super Small but Practically Accurate Language Model for Handheld Devices.

[BibT_eX]

[DOI]

Genqing Wu

J. Comput. Sci. Technol., 2003

Using word confidence measure for OOV words detection in a spontaneous spoken dialog system.

[BibT_eX]

[DOI]

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002

Mandarin Pronunciation Modeling Based on CASS Corpus.

[BibT_eX]

[DOI]

J. Comput. Sci. Technol., 2002

Speech Detection in Non-Stationary Noise Based on the 1/f Process.

[BibT_eX]

[DOI]

J. Comput. Sci. Technol., 2002

A compression method used in language modeling for handheld devices.

[BibT_eX]

[DOI]

Genqing Wu

Proceedings of the 2002 International Symposium on Chinese Spoken Language Processing, 2002

Reducing pronunciation lexicon confusion and using more data without phonetic transcription for pronunciation modeling.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Improved katz smoothing for language modeling in speech recogniton.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

2001

Comparison of Different Implementations of MFCC.

[BibT_eX]

[DOI]

Zhanjiang Song

J. Comput. Sci. Technol., 2001

Modeling pronunciation variation using context-dependent weighting and b/s refined acoustic modeling.

[BibT_eX]

[DOI]

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

A two-layer lexical tree based beam search in continuous Chinese speech recognition.

[BibT_eX]

[DOI]

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Improved context-dependent acoustic modeling for continuous Chinese speech recognition.

[BibT_eX]

[DOI]

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Robust parsing in spoken dialogue systems.

[BibT_eX]

[DOI]

Pengju Yan

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

An online incremental language model adaptation method.

[BibT_eX]

[DOI]

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

An MCE based classification tree using hierarchical feature-weighting in speech recognition.

[BibT_eX]

[DOI]

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Design of a semantic parser with support to ellipsis resolution in a Chinese spoken language dialogue system.

[BibT_eX]

[DOI]

Yi Su

Yinfei Huang

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

A theme structure method for the ellipsis resolution.

[BibT_eX]

[DOI]

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Topic Forest: a plan-based dialog management structure.

[BibT_eX]

[DOI]

Xiaojun Wu

Proceedings of the IEEE International Conference on Acoustics, 2001

Automatic generation of pronunciation lexicons for Mandarin spontaneous speech.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2001

2000

Improving the Syllable-Synchronous Network Search Algorithm for Word Decoding in Continuous Chinese Speech Recognition.

[BibT_eX]

[DOI]

Zhanjiang Song

J. Comput. Sci. Technol., 2000

Intra-syllable Dependent Phonetic Modeling For Chinese Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2000 International Symposium on Chinese Spoken Language Processing, 2000

Tone Recognition of Chinese Continuous Speech.

[BibT_eX]

[DOI]

Proceedings of the 2000 International Symposium on Chinese Spoken Language Processing, 2000

A Noise Cancellation Method Based on Wavelet Transform.

[BibT_eX]

[DOI]

Proceedings of the 2000 International Symposium on Chinese Spoken Language Processing, 2000

Word-class Stochastic Model in A Spoken Language Dialogue System.

[BibT_eX]

[DOI]

Proceedings of the 2000 International Symposium on Chinese Spoken Language Processing, 2000

A Self adapting Endpoint Detection Algorithm for Speech Recognition in Noisy Environment Based on 1/f Process.

[BibT_eX]

[DOI]

Proceedings of the 2000 International Symposium on Chinese Spoken Language Processing, 2000

Acoustic Level Error Analysis in Continuous Speech Recognition.

[BibT_eX]

[DOI]

Chunhua Luo

Proceedings of the 2000 International Symposium on Chinese Spoken Language Processing, 2000

Context-Independent Chinese Initial-Final Acoustic Modeling.

[BibT_eX]

[DOI]

Jing Li

Proceedings of the 2000 International Symposium on Chinese Spoken Language Processing, 2000

Improved Strategies For Intelligent Sentence Input Method Engine System.

[BibT_eX]

[DOI]

Proceedings of the 2000 International Symposium on Chinese Spoken Language Processing, 2000

EasyCmd: Navigation by Voice Commands.

[BibT_eX]

[DOI]

Yinfei Huang

Proceedings of the 2000 International Symposium on Chinese Spoken Language Processing, 2000

Integrating the energy information into MFCC.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Input Chinese sentences using digits.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Semi-continuous segmental probability modeling for continuous speech recognition.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Reducing time-synchronous beam search effort using stage based look-ahead and language model rank based pruning.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

On enhancing katz-smoothing based back-off language model.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

A c/v segmentation method for Mandarin speech based on multiscale fractal dimension.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

An equivalent-class based MMI learning method for MGCPM.

[BibT_eX]

[DOI]

Chunhua Luo

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

CASS: a phonetically transcribed corpus of mandarin spontaneous speech.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

The phonetic labeling on read and spontaneous discourse corpora.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Language understanding component for Chinese dialogue system.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Statistical knowledge based frame synchronous search strategies in continuous speech recognition.

[BibT_eX]

[DOI]

Zhanjiang Song

Proceedings of the IEEE International Conference on Acoustics, 2000

1999

HarkMan - A vocabulary-independent keyword spotter for spontaneous Chinese speech.

[BibT_eX]

[DOI]

J. Comput. Sci. Technol., 1999

Easytalk: a large-vocabulary speaker-independent Chinese dictation machine.

[BibT_eX]

[DOI]

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

A fast and effective state decoding algorithm.

[BibT_eX]

[DOI]

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

An effective scoring method for speaking skill evaluation system.

[BibT_eX]

[DOI]

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

A syllable-synchronous network search algorithm for word decoding in Chinese speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

An new method used in HMM for modeling frame correlation.

[BibT_eX]

[DOI]

Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1998

Center-distance continuous probability models and the distance measure.

[BibT_eX]

[DOI]

Ditang Fang

J. Comput. Sci. Technol., 1998

The Similarity Measure among Acoustic Models and Its Two Applications.

[BibT_eX]

[DOI]

Proceedings of the 1998 International Symposium on Chinese Spoken Language Processing, 1998

A Vocabulary-Independent Keyword Spotter for Spontaneous Chinese Speech.

[BibT_eX]

[DOI]

Proceedings of the 1998 International Symposium on Chinese Spoken Language Processing, 1998

On The Embedded Multiple-Model Scoring Scheme For Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 1998 International Symposium on Chinese Spoken Language Processing, 1998

The distance measure for line spectrum pairs applied to speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Non-linear probability estimation method used in HMM for modeling frame correlation.

[BibT_eX]

[DOI]

1997

A log-index weighted cepstral distance measure for speech recognition.

[BibT_eX]

[DOI]