Alex Acero

Affiliations:
  • Microsoft Research


According to our database1, Alex Acero authored at least 217 papers between 1989 and 2021.

Collaborative distances:

Awards

IEEE Fellow

IEEE Fellow 2004, "For contributions to noise robust speech recognition and speech technology education.".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2021
DEXTER: Deep Encoding of External Knowledge for Named Entity Recognition in Virtual Assistants.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

2020
Robust Multichannel Linear Prediction for Online Speech Dereverberation Using Weighted Householder Least Squares Lattice Adaptive Filter.
IEEE Trans. Signal Process., 2020

2016
We Need Your Help to Take the Society to New Heights [President's Message].
IEEE Signal Process. Mag., 2016

Siri's voice gets deep learning.
Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016

2015
Signal Processing: The Science Behind Our Digital Life [President's Message].
IEEE Signal Process. Mag., 2015

Should We Experiment with New Peer-Review Models? [President's Message].
IEEE Signal Process. Mag., 2015

SigPort: A Paper Repository for Signal Processing [President's Message].
IEEE Signal Process. Mag., 2015

The IEEE Gives Our Society the "Thumbs Up" [President's Message].
IEEE Signal Process. Mag., 2015

The IEEE Signal Processing Cup: A Competition for Undergraduate Students [President's Message].
IEEE Signal Process. Mag., 2015

SigView: Video Tutorials in Emerging Signal Processing Topics [President's Message].
IEEE Signal Process. Mag., 2015

2014
Chapters? Role in Networking and Continuing Education [President's Message].
IEEE Signal Process. Mag., 2014

Where Does Your Conference Registration Fee Go? [President's Message].
IEEE Signal Process. Mag., 2014

At the Forefront in Technical Publications [President's Message].
IEEE Signal Process. Mag., 2014

2013
Recent advances in deep learning for speech research at Microsoft.
Proceedings of the IEEE International Conference on Acoustics, 2013

Learning deep structured semantic models for web search using clickthrough data.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

2012
Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition.
IEEE Trans. Speech Audio Process., 2012

Factored adaptation using a combination of feature-space and model-space transforms.
Proceedings of the INTERSPEECH 2012, 2012

New methods and evaluation experiments on translating TED talks in the IWSLT benchmark.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Media Search in Mobile Devices [From the Guest Editors].
IEEE Signal Process. Mag., 2011

The MSR SYSTEM for IWSLT 2011 evaluation.
Proceedings of the 2011 International Workshop on Spoken Language Translation, 2011

Separating Speaker and Environmental Variability Using Factored Transforms.
Proceedings of the INTERSPEECH 2011, 2011

A novel decision function and the associated decision-feedback learning for speech translation.
Proceedings of the IEEE International Conference on Acoustics, 2011

Lexicon modeling for query understanding.
Proceedings of the IEEE International Conference on Acoustics, 2011

Why word error rate is not a good metric for speech recognizer training for the speech translation task?
Proceedings of the IEEE International Conference on Acoustics, 2011

Joint encoding of the waveform and speech recognition features using a transform codec.
Proceedings of the IEEE International Conference on Acoustics, 2011

A new speaker identification algorithm for gaming scenarios.
Proceedings of the IEEE International Conference on Acoustics, 2011

Large vocabulary continuous speech recognition with context-dependent DBN-HMMS.
Proceedings of the IEEE International Conference on Acoustics, 2011

Factored adaptation for separable compensation of speaker and environmental variability.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

Speaker adaptation with an Exponential Transform.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010
Learning with click graph for query intent classification.
ACM Trans. Inf. Syst., 2010

Noise Adaptive Training for Robust Automatic Speech Recognition.
IEEE Trans. Speech Audio Process., 2010

Active learning and semi-supervised learning for speech recognition: A unified framework using the global entropy reduction maximization criterion.
Comput. Speech Lang., 2010

Continuous speech recognition with a TF-IDF acoustic model.
Proceedings of the INTERSPEECH 2010, 2010

HMM adaptation using linear spline interpolation with integrated spline parameter training for robust speech recognition.
Proceedings of the INTERSPEECH 2010, 2010

Binary coding of speech spectrograms using a deep auto-encoder.
Proceedings of the INTERSPEECH 2010, 2010

Information retrieval methods for automatic speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010

Acoustic model adaptation via Linear Spline Interpolation for robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010

Reverberated speech signal separation based on regularized subband feedforward ICA and instantaneous direction of arrival.
Proceedings of the IEEE International Conference on Acoustics, 2010

Discriminative training methods for language models using conditional entropy criteria.
Proceedings of the IEEE International Conference on Acoustics, 2010

Context dependent phonetic string edit distance for automatic speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
A Novel Framework and Training Algorithm for Variable-Parameter Hidden Markov Models.
IEEE Trans. Speech Audio Process., 2009

Using continuous features in the maximum entropy model.
Pattern Recognit. Lett., 2009

A unified framework of HMM adaptation with joint compensation of additive and convolutive distortions.
Comput. Speech Lang., 2009

Extracting structured information from user queries with semi-supervised conditional random fields.
Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2009

Hidden conditional random field with distribution constraints for phone classification.
Proceedings of the INTERSPEECH 2009, 2009

Cross-lingual speech recognition under runtime resource constraints.
Proceedings of the IEEE International Conference on Acoustics, 2009

Discriminative pronounciation learning using phonetic decoder and minimum-classification-error criterion.
Proceedings of the IEEE International Conference on Acoustics, 2009

Maximizing global entropy reduction for active learning in speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2009

Using collective information in semi-supervised learning for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2009

Voice search of structured media data.
Proceedings of the IEEE International Conference on Acoustics, 2009

A study on multilingual acoustic modeling for large vocabulary ASR.
Proceedings of the IEEE International Conference on Acoustics, 2009

Noise adaptive training using a vector taylor series approach for noise robust automatic speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2009

Experimenting with a global decision tree for state clustering in automatic speech recognition systems.
Proceedings of the IEEE International Conference on Acoustics, 2009

Noise robust model adaptation using linear spline interpolation.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2008
Robust Speech Recognition Using a Cepstral Minimum-Mean-Square-Error-Motivated Noise Suppressor.
IEEE Trans. Speech Audio Process., 2008

An Integrative and Discriminative Technique for Spoken Utterance Classification.
IEEE Trans. Speech Audio Process., 2008

An introduction to voice search.
IEEE Signal Process. Mag., 2008

Multisensory processing for speech enhancement and magnitude-normalized spectra for speech modeling.
Speech Commun., 2008

Large-margin minimum classification error training: A theoretical risk minimization perspective.
Comput. Speech Lang., 2008

Learning query intent from regularized click graphs.
Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2008

Improvements on Mel-Frequency Cepstrum Minimum-Mean-Square-Error Noise Suppressor for Robust Speech Recognition.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Parameter clustering and sharing in variable-parameter HMMs for noise robust speech recognition.
Proceedings of the INTERSPEECH 2008, 2008

Discriminative training of variable-parameter HMMs for noise robust speech recognition.
Proceedings of the INTERSPEECH 2008, 2008

Inductive and example-based learning for text classification.
Proceedings of the INTERSPEECH 2008, 2008

Sound capture system and spatial filter for small devices.
Proceedings of the INTERSPEECH 2008, 2008

Automatic children's reading tutor on hand-held devices.
Proceedings of the INTERSPEECH 2008, 2008

Towards a non-parametric acoustic model: an acoustic decision tree for observation probability calculation.
Proceedings of the INTERSPEECH 2008, 2008

A minimum-mean-square-error noise reduction algorithm on Mel-frequency cepstra for robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2008

Maximum a posteriori ICA: Applying prior knowledge to the separation of acoustic sources.
Proceedings of the IEEE International Conference on Acoustics, 2008

Robust design of wideband loudspeaker arrays.
Proceedings of the IEEE International Conference on Acoustics, 2008

AN EM-based probabilistic approach for Acoustic Echo Suppression.
Proceedings of the IEEE International Conference on Acoustics, 2008

Language modeling for voice search: A machine translation approach.
Proceedings of the IEEE International Conference on Acoustics, 2008

Adaptation of compressed HMM parameters for resource-constrained speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2008

HMM adaptation using a phase-sensitive acoustic distortion model for environment-robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2008

Speech enhancement using a pitch predictive model.
Proceedings of the IEEE International Conference on Acoustics, 2008

Live search for mobile: Web services by voice on the cellphone.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Training Wideband Acoustic Models Using Mixed-Bandwidth Training Data for Speech Recognition.
IEEE Trans. Speech Audio Process., 2007

Adaptive Kalman Filtering and Smoothing for Tracking Vocal Tract Resonances Using a Continuous-Valued Hidden Dynamic Model.
IEEE Trans. Speech Audio Process., 2007

Automatic Removal of Typed Keystrokes From Speech Signals.
IEEE Signal Process. Lett., 2007

Speaker-adaptive learning of resonance targets in a hidden trajectory model of speech coarticulation.
Comput. Speech Lang., 2007

Soft indexing of speech content for search in spoken documents.
Comput. Speech Lang., 2007

Commute UX: Telephone Dialog System for Location-based Services.
Proceedings of the 8th SIGdial Workshop on Discourse and Dialogue, 2007


The voice-rate dialog system for consumer ratings.
Proceedings of the INTERSPEECH 2007, 2007

Automated directory assistance system - from theory to practice.
Proceedings of the INTERSPEECH 2007, 2007

Handling phonetic context and speaker variation in a structure-based speech recognizer.
Proceedings of the INTERSPEECH 2007, 2007

Confidence measures for voice search applications.
Proceedings of the INTERSPEECH 2007, 2007

Voicepedia: towards speech-based access to unstructured information.
Proceedings of the INTERSPEECH 2007, 2007

Robust location understanding in spoken dialog systems using intersections.
Proceedings of the INTERSPEECH 2007, 2007

A fine pitch model for speech.
Proceedings of the INTERSPEECH 2007, 2007

Large-Margin Minimum Classification Error Training for Large-Scale Speech Recognition Tasks.
Proceedings of the IEEE International Conference on Acoustics, 2007

Robust Adaptive Beamforming Algorithm using Instantaneous Direction of Arrival with Enhanced Noise Suppression Capability.
Proceedings of the IEEE International Conference on Acoustics, 2007

A Discriminative Training Framework using N-Best Speech Recognition Transcriptions and Scores for Spoken Utterance Classification.
Proceedings of the IEEE International Conference on Acoustics, 2007

Maximum Entropy Confidence Estimation for Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2007

A Generative-Discriminative Framework using Ensemble Methods for Text-Dependent Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2007

Microphone Array Post-Filter using Incremental Bayes Learning to Track the Spatial Distributions of Speech and Noise.
Proceedings of the IEEE International Conference on Acoustics, 2007

Efficient and Robust Language Modeling in an Automatic Children's Reading Tutor System.
Proceedings of the IEEE International Conference on Acoustics, 2007

Maximum entropy model parameterization with TF∗IDF weighted vector space model.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

Adapting grapheme-to-phoneme conversion for name recognition.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

High-performance hmm adaptation with joint compensation of additive and convolutive distortions via Vector Taylor Series.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006
Structured speech modeling.
IEEE Trans. Speech Audio Process., 2006

A bidirectional target-filtering model of speech coarticulation and reduction: two-stage implementation for phonetic recognition.
IEEE Trans. Speech Audio Process., 2006

Tracking vocal tract resonances using a quantized nonlinear function embeddedin a temporal constraint.
IEEE Trans. Speech Audio Process., 2006

A lattice search technique for a long-contextual-span hidden trajectory model of speech.
Speech Commun., 2006

Rapid development of spoken language understanding grammars.
Speech Commun., 2006

Adaptation of maximum entropy capitalizer: Little data can help a lot.
Comput. Speech Lang., 2006

Integration of Metadata in spoken Document Search Using Position Specific Posterior latices.
Proceedings of the 2006 IEEE ACL Spoken Language Technology Workshop, 2006

An effective and efficient utterance verification technology using word n-gram filler models.
Proceedings of the INTERSPEECH 2006, 2006

Use of incrementally regulated discriminative margins in MCE training for speech recognition.
Proceedings of the INTERSPEECH 2006, 2006

Discriminative models for spoken language understanding.
Proceedings of the INTERSPEECH 2006, 2006

A time-synchronous phonetic decoder for a long-contextual-Span hidden trajectory model.
Proceedings of the INTERSPEECH 2006, 2006

Call analysis with classification using speech and non-speech features.
Proceedings of the INTERSPEECH 2006, 2006

Speech Modelingwith Magnitude-Normalized Complex Spectra and Its Application to Multisensory Speech Enhancement.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

N-Gram Based Filler Model for Robust Grammar Authoring.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Speech Utterance Classification Model Training without Manual Transcriptions.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Pruning Analysis for the Position Specific Posterior Lattices for Spoken Document Search.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Training Algorithms for Hidden Conditional Random Fields.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Joint Discriminative Front End and Back End Training for Improved Speech Recognition Accuracy.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Combining Statistical and Knowledge-Based Spoken Language Understanding in Conditional Models.
Proceedings of the ACL 2006, 2006

2005
Semiautomatic Improvements of System-Initiative Spoken Dialog Applications Using Interactive Clustering.
IEEE Trans. Speech Audio Process., 2005

Dynamic compensation of HMM variances using the feature enhancement uncertainty computed from a parametric model of speech distortion.
IEEE Trans. Speech Audio Process., 2005

Spoken language understanding.
IEEE Signal Process. Mag., 2005

Analysis and comparison of two speech feature extraction/compensation algorithms.
IEEE Signal Process. Lett., 2005

Evaluation of a long-contextual-Span hidden trajectory model and phonetic recognizer using a* lattice search.
Proceedings of the INTERSPEECH 2005, 2005

SGStudio: rapid semantic grammar development for spoken language understanding.
Proceedings of the INTERSPEECH 2005, 2005

A graphical model for multi-sensory speech processing in air-and-bone conductive microphones.
Proceedings of the INTERSPEECH 2005, 2005

Robust bandwidth extension of noise-corrupted narrowband speech.
Proceedings of the INTERSPEECH 2005, 2005

Hidden conditional random fields for phone classification.
Proceedings of the INTERSPEECH 2005, 2005

Maximum mutual information SPLICE transform for seen and unseen conditions.
Proceedings of the INTERSPEECH 2005, 2005

Learning statistically characterized resonance targets in a hidden trajectory model of speech coarticulation and reduction.
Proceedings of the INTERSPEECH 2005, 2005

Indexing uncertainty for spoken document search.
Proceedings of the INTERSPEECH 2005, 2005

Automatic Head-size Equalization in Panorama Images for Video Conferencing.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Maximum Entropy Based Generic Filter for Language Model Adaptation.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Training Wideband Acoustic Models using Mixed-Bandwidth Training Data via Feature Bandwidth Extension.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Leakage Model and Teeth Clack Removal for Air- and Bone-Conductive Integrated Microphones.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Unsupervised Semantic Intent Discovery from Call Log Acoustics.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

A Hidden Trajectory Model with Bi-directional Target-Filtering: Cascaded vs. Integrated Implementation for Phonetic Recognition.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

SPEECH OGLE: Indexing Uncertainty for Spoken Document Search.
Proceedings of the ACL 2005, 2005

Position Specific Posterior Lattices for Indexing Speech.
Proceedings of the ACL 2005, 2005

2004
Speech and Language Processing for Multimodal Human-Computer Interaction.
J. VLSI Signal Process., 2004

Estimating cepstrum of speech under the presence of noise using a joint prior of static and dynamic features.
IEEE Trans. Speech Audio Process., 2004

Enhancement of log Mel power spectra of speech using a phase-sensitive model of the acoustic environment and sequential estimation of the corrupting noise.
IEEE Trans. Speech Audio Process., 2004

Use and Acquisition of Semantic Language Model.
Proceedings of the Demonstration Papers at HLT-NAACL 2004, 2004

Direct filtering for air- and bone-conductive microphones.
Proceedings of the IEEE 6th Workshop on Multimedia Signal Processing, 2004

Nonlinear information fusion in multi-sensor processing - extracting and exploiting hidden dynamics of speech captured by a bone-conductive microphone.
Proceedings of the IEEE 6th Workshop on Multimedia Signal Processing, 2004

Unsupervised learning from users' error correction in speech dictation.
Proceedings of the INTERSPEECH 2004, 2004

A quantitative model for formant dynamics and contextually assimilated reduction in fluent speech.
Proceedings of the INTERSPEECH 2004, 2004

Multi-sensory microphones for robust speech detection, enhancement and recognition.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Noise robust speech recognition with a switching linear dynamic model.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

A structured speech model with continuous hidden dynamics and prediction-residual training for tracking vocal tract resonances.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Adaptation of Maximum Entropy Capitalizer: Little Data Can Help a Lo.
Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing , 2004

2003
Recursive estimation of nonstationary noise using iterative stochastic approximation for robust speech recognition.
IEEE Trans. Speech Audio Process., 2003

Speech Recognition and Understanding.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2003

Improved name recognition with user modeling.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Combination of CFG and n-gram modeling in semantic grammar learning.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

A harmonic-model-based front end for robust speech recognition.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Adapting acoustic models to new domains and conditions using untranscribed data.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

A comparison of three non-linear observation models for noisy speech features.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Estimating speech recognition error rate without acoustic test data.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Tracking vocal tract resonances using an analytical nonlinear predictor and a target-guided temporal constraint.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Discriminative training of n-gram classifiers for speech and text routing.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Concept acquisition in example-based grammar authoring.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Incremental Bayes learning with prior evolution for tracking nonstationary noise statistics from noisy speech data.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Speech utterance classification.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

An expectation maximization approach for formant tracking using a parameter-free non-linear predictor.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
Distributed speech processing in miPad's multimodal user interface.
IEEE Trans. Speech Audio Process., 2002

Combination of statistical and rule-based approaches for spoken language understanding.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Evaluation of SPLICE on the Aurora 2 and 3 tasks.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Noise from corrupted speech log mel-spectral energies.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Exploiting variances in robust feature extraction based on a parametric model of speech distortion.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Sequential MAP noise estimation and a phase-sensitive model of the acoustic environment.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Separating colorred signals distorted by convolutive channels using diagonal constrained decorrelation.
Proceedings of the IEEE International Conference on Acoustics, 2002

Evaluation of spoken language grammar learning in the ATIS domain.
Proceedings of the IEEE International Conference on Acoustics, 2002

Uncertainty decoding with SPLICE for noise robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2002

A Bayesian approach to speech feature enhancement using the dynamic cepstral prior.
Proceedings of the IEEE International Conference on Acoustics, 2002

A speech-centric perspective for human-computer interface.
Proceedings of the IEEE 5th Workshop on Multimedia Signal Processing, 2002

2001
ALGONQUIN - Learning Dynamic Noise Models From Noisy Speech for Robust Speech Recognition.
Proceedings of the Advances in Neural Information Processing Systems 14 [Neural Information Processing Systems: Natural and Synthetic, 2001

ALGONQUIN: iterating laplace's method to remove multiple types of acoustic distortion for robust speech recognition.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Evaluation of the SPLICE algorithm on the Aurora2 database.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

A new method for speech denoising and robust speech recognition using probabilistic models for clean speech and for noise.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Experimental investigation of delayed instantaneous demixer for speech enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2001

Towards non-stationary model-based noise adaptation for large vocabulary speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2001


Efficient on-line acoustic environment estimation for FCDCN in a continuous speech recognition system.
Proceedings of the IEEE International Conference on Acoustics, 2001

High-performance robust speech recognition using stereo training data.
Proceedings of the IEEE International Conference on Acoustics, 2001

2000
Speech Denoising and Dereverberation Using Probabilistic Models.
Proceedings of the Advances in Neural Information Processing Systems 13, 2000

Automatically extracting highlights for TV Baseball programs.
Proceedings of the 8th ACM International Conference on Multimedia 2000, Los Angeles, CA, USA, October 30, 2000

Mipad: a next generation PDA prototype.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Large-vocabulary speech recognition under adverse acoustic environments.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

HMM adaptation using vector taylor series for noisy speech recognition.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Speech/noise separation using two microphones and a VQ model of speech signals.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1999
Improvements on speech recognition for fast talkers.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Formant analysis and synthesis using hidden Markov models.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

1998
Speech Research: Near and Not-so-near Results and What They Might Mean for IUI (Panel).
Proceedings of the 3rd International Conference on Intelligent User Interfaces, 1998

HMM-based smoothing for concatenative speech synthesis.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Maximum a posteriori pitch tracking.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

A mixed-excitation frequency domain model for time-scale pitch-scale modification of speech.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Automatic generation of synthesis units for trainable text-to-speech systems.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

Source-filter models for time-scale pitch-scale modification of speech.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

1997
Recent improvements on Microsoft's trainable text-to-speech system-Whistler.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

1996
Whistler: a trainable text-to-speech system.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Speaker and gender normalization for continuous-density hidden Markov models.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1995
Microsoft Windows highly intelligent speech recognizer: Whisper.
Proceedings of the 1995 International Conference on Acoustics, 1995

1994
Discriminative training of garbage model for non-vocabulary utterance rejection.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

The VESTEL telephone speech database.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Signal processing for robust speech recognition.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Environment normalization for robust speech recognition using direct cepstral comparison.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

1993
Efficient Cepstral Normalization For Robust Speech Recognition.
Proceedings of the Human Language Technology: Proceedings of a Workshop Held at Plainsboro, 1993

Robust HMM-based endpoint detector.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Rejection techniques for digit recognition in telecommunication applications.
Proceedings of the IEEE International Conference on Acoustics, 1993

1992
Multiple approaches to robust speech recognition.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

Efficient joint compensation of speech for the effects of additive noise and linear filtering.
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

1991
Robust speech recognition by normalization of the acoustic space.
Proceedings of the 1991 International Conference on Acoustics, 1991

1990
Towards Environment-Independent Spoken Language Systems.
Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Hidden Valley, 1990

Acoustical pre-processing for robust spoken language systems.
Proceedings of the First International Conference on Spoken Language Processing, 1990

Environmental robustness in automatic speech recognition.
Proceedings of the 1990 International Conference on Acoustics, 1990

1989
ACOUSTICAL PRE-PROCESSING FOR ROBUST SPEECH RECOGNITION.
Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Cape Cod, 1989


  Loading...