Man-Hung Siu

According to our database1, Man-Hung Siu authored at least 93 papers between 1991 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Personalization of CTC-based End-to-End Speech Recognition Using Pronunciation-Driven Subword Tokenization.
CoRR, 2023

Acoustic Model Fusion For End-to-End Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2020
Learning from Noisy Labels with Noise Modeling Network.
CoRR, 2020

Towards a New Understanding of the Training of Neural Networks with Mislabeled Training Data.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2018
Optimizing Multilingual Knowledge Transfer for Time-Delay Neural Networks with Low-Rank Factorization.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Improving Deliverable Speech-to-Text Systems with Multilingual Knowledge Transfer.
Proceedings of the Interspeech 2017, 2017

Improved Single System Conversational Telephone Speech Recognition with VGG Bottleneck Features.
Proceedings of the Interspeech 2017, 2017

Unsupervised adaptation for deep neural networks using Alternating Direction Method of Multipliers.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
Sage: The New BBN Speech Processing Platform.
Proceedings of the Interspeech 2016, 2016

Importance sampling of delta-AUC: A basis for active learning for improved keyword search.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Large-scale speaker search using PLDA on mismatched conditions.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
Unsupervised training of an HMM-based self-organizing unit recognizer with applications to topic classification and keyword discovery.
Comput. Speech Lang., 2014

2012
MLLR transforms of self-organized units as features in speaker recognition.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Detection of unseen words in conversational Mandarin.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Unsupervised Audio Patterns Discovery Using HMM-Based Self-Organized Units.
Proceedings of the INTERSPEECH 2011, 2011

Topic modeling for spoken documents using only phonetic information.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010
Improved named entity extraction from conversational speech with language model adaptation.
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

Improved topic classification and keyword discovery using an HMM-based speech recognizer trained without supervision.
Proceedings of the INTERSPEECH 2010, 2010

2009
Discriminatively Trained GMMs for Language Classification Using Boosting Methods.
IEEE Trans. Speech Audio Process., 2009

Clustering-Based Approach for Predicting Motif Pairs from protein Interaction Data.
J. Bioinform. Comput. Biol., 2009

Unsupervised training of an HMM-based speech recognizer for topic classification.
Proceedings of the INTERSPEECH 2009, 2009

2008
Evaluation of the robustness of the polynomial segment models to noisy environments with unsupervised adaptation.
Speech Commun., 2008

Optimal Algorithm for Finding DNA Motifs with Nucleotide Adjacent Dependency.
Proceedings of the 6th Asia-Pacific Bioinformatics Conference, 2008

2007
Web resources for language modeling in conversational speech recognition.
ACM Trans. Speech Lang. Process., 2007

Boosting with anti-models for automatic language identification.
Proceedings of the INTERSPEECH 2007, 2007

A model-based estimation of phonotactic language verification performance.
Proceedings of the INTERSPEECH 2007, 2007

N-Best Tokenization in a GMM-SVM Language Identification System.
Proceedings of the IEEE International Conference on Acoustics, 2007

2006
A Robust Viterbi Algorithm Against Impulsive Noise With Application to Speech Recognition.
IEEE Trans. Speech Audio Process., 2006

Recursive likelihood evaluation and fast search algorithm for polynomial segment model with application to speech recognition.
IEEE Trans. Speech Audio Process., 2006

Minimization of Utterance Verification Error Rate as a Constrained Optimization Problem.
IEEE Signal Process. Lett., 2006

Joint Optimization of the Frequency-Domain and Time-Domain Transformations in Deriving Generalized Static and Dynamic MFCCs.
IEEE Signal Process. Lett., 2006

Maximum Likelihood Linear Regression Adaptation for the Polynomial Segment Models.
IEEE Signal Process. Lett., 2006

Adaptive articulatory feature-based conditional pronunciation modeling for speaker verification.
Speech Commun., 2006

Articulatory-feature-based confidence measures.
Comput. Speech Lang., 2006

Discriminatively trained Language Models using Support Vector Machines for Language Identification.
Proceedings of the Odyssey 2006, 2006

Consistent Modeling of the Static and Time-Derivative Cepstrums for Speech Recognition Using HSPTM.
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

Improved language identification using support vector machines for language modeling.
Proceedings of the INTERSPEECH 2006, 2006

Improved tone modeling for Mandarin broadcast news speech recognition.
Proceedings of the INTERSPEECH 2006, 2006

Robust Large Vocabulary Continuous Speech Recognition using Polynomial Segment Model with Unsupervised Adaptation.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Reference Speaker Weighting Adaptation for Sub-Phonetic Polynomial Segment Models.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
A quantitative assessment of the importance of tone in mandarin speech recognition.
IEEE Signal Process. Lett., 2005

Efficient computation of the frame-based extended union model and its application in speech recognition against partial temporal corruptions.
Comput. Speech Lang., 2005

High-density discrete HMM with the use of scalar quantization indexing.
Proceedings of the INTERSPEECH 2005, 2005

Speaker verification via articulatory feature-based conditional pronunciation modeling with vowel and consonant mixture models.
Proceedings of the INTERSPEECH 2005, 2005

Web-Data Augmented Language Models for Mandarin Conversational Speech Recognition.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Speaker Verification Using Adapted Articulatory Feature-based Conditional Pronunciation Modeling.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Sub-phonetic Polynomial Segment Model for Large Vocabulary Continuous Speech Recognition.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
Integration of acoustic and articulatory information with application to speech recognition.
Inf. Fusion, 2004

Adaptive conditional pronunciation modeling using articulatory features for speaker verification.
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004

Progress on Mandarin conversational telephone speech recognition.
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004

Automatic language identification using discrete hidden Markov model.
Proceedings of the INTERSPEECH 2004, 2004

A robust training algorithm based on neighborhood information.
Proceedings of the INTERSPEECH 2004, 2004

Improved performance of Aurora 4 using HTK and unsupervised MLLR adaptation.
Proceedings of the INTERSPEECH 2004, 2004

Speech recognition enhancement by psychoacoustic modeled noise suppression.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Decision tree based tone modeling for Chinese speech recognition.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Training for polynomial segment model using the expectation maximization algorithm.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Hidden spectral peak trajectory model for phone classification.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Discrimination power weighted subword-based speaker verification.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
Maximum likelihood normalization for robust speech recognition.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

A new approach to minimize utterance verification error rate for a specific operating point.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

An efficient incremental likelihood evaluation for polynomial trajectory model using with application to model training and recognition.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Phone level confidence measure using articulatory features.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
Robust speech recognition against short-time noise.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Speech recognition using combined acoustic and articulatory information with retraining of acoustic model parameters.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Integration of Tone Related Feature for Chinese Speech Recognition.
Proceedings of the 4th IEEE International Conference on Multimodal Interfaces (ICMI 2002), 2002

Speech recognition using multi-state acoustic and articulatory features models with asynchronous states transition.
Proceedings of the IEEE International Conference on Acoustics, 2002

2001
Robust speech recognition against packet loss.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000
Variable n-grams and extensions for conversational speech language modeling.
IEEE Trans. Speech Audio Process., 2000

Computer-aided Mandarin pronunciation learning system.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Pruning of state-tying tree using bayesian information criterion with multiple mixtures.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Integrating a context-dependent phrase grammar in the variable n-gram framework.
Proceedings of the IEEE International Conference on Acoustics, 2000

1999
Evaluation of word confidence for speech recognition systems.
Comput. Speech Lang., 1999

Using a large vocabulary continuous speech recognizer for a constrained domain with limited training.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

Automatic topic identification for two-level call routing.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

Recent experiments in large vocabulary conversational speech recognition.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1998
Using untranscribed training data to improve performance.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Parametric trajectory mixtures for LVCSR.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Hidden Markov models for trajectory modeling.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

The BBN Byblos 1997 large vocabulary conversational speech recognition system.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

1997
Variable n-gram language modeling and extensions for conversational speech.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Improved estimation, evaluation and applications of confidence measures for speech recognition.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

1996
Modeling disfluencies in conversational speech.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

1995
Large vocabulary word scoring as a basis for transcription generation.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Reducing word error rate on conversational speech from the Switchboard corpus.
Proceedings of the 1995 International Conference on Acoustics, 1995

1994
Predicting word spotting performance.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Spotting events in continuous speech.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

An invariance property of neural networks.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

1993
Phonetic-based word spotter: various configurations and application to event spotting.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Phonetic training and language modeling for word spotting.
Proceedings of the IEEE International Conference on Acoustics, 1993

Gisting conversational speech in real time.
Proceedings of the IEEE International Conference on Acoustics, 1993

1992
An unsupervised, sequential learning algorithm for the segmentation of speech waveforms with multiple speakers.
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

Gisting conversational speech.
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

1991
Segregation of speakers for speech recognition and speaker identification.
Proceedings of the 1991 International Conference on Acoustics, 1991


  Loading...