Xuedong Huang

Ruochen Xu

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

TED: A Pretrained Unsupervised Summarization Model with Theme Modeling and Denoising.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

2019

Make Lead Bias in Your Favor: A Simple and Effective Method for News Summarization.

[BibT_eX]

[DOI]

CoRR, 2019

Meeting Transcription Using Virtual Microphone Arrays.

[BibT_eX]

[DOI]

Takuya Yoshioka

Zhuo Chen

Dimitrios Dimitriadis

CoRR, 2019

SIM: A Slot-Independent Neural Model for Dialogue State Tracking.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual SIGdial Meeting on Discourse and Dialogue, 2019

Meeting Transcription Using Asynchronous Distant Microphones.

[BibT_eX]

[DOI]

Takuya Yoshioka

Dimitrios Dimitriadis

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Multi-task Learning for Natural Language Generation in Task-Oriented Dialogue.

[BibT_eX]

[DOI]

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Advances in Online Audio-Visual Meeting Transcription.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018

SDNet: Contextualized Attention-based Deep Network for Conversational Question Answering.

[BibT_eX]

[DOI]

CoRR, 2018

Achieving Human Parity on Automatic Chinese to English News Translation.

[BibT_eX]

[DOI]

Marcin Junczys-Dowmunt

CoRR, 2018

The Microsoft 2017 Conversational Speech Recognition System.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Big Data for Speech and Language Processing.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Big Data (IEEE BigData 2018), 2018

2017

Toward Human Parity in Conversational Speech Recognition.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2017

The microsoft 2016 conversational speech recognition system.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016

Achieving Human Parity in Conversational Speech Recognition.

[BibT_eX]

[DOI]

CoRR, 2016

2015

Large-Scale Question Answering with Joint Embedding and Proof Tree Decoding.

[BibT_eX]

[DOI]

Proceedings of the 24th ACM International Conference on Information and Knowledge Management, 2015

2014

Web Information at Your Fingertips: Paper as an Interaction Metaphor.

[BibT_eX]

[DOI]

Zheng Chen

Jian-Tao Sun

Computer, 2014

A historical perspective of speech recognition.

[BibT_eX]

[DOI]

James Baker

Raj Reddy

Commun. ACM, 2014

2010

An Overview of Modern Speech Recognition.

[BibT_eX]

[DOI]

Li Deng

Proceedings of the Handbook of Natural Language Processing, Second Edition., 2010

2008

International workshop on question answering on the web (QAWeb2008).

[BibT_eX]

[DOI]

Liu Wenyin

Qing Li

Proceedings of the 17th International Conference on World Wide Web, 2008

2004

Speech and Language Processing for Multimodal Human-Computer Interaction.

[BibT_eX]

[DOI]

J. VLSI Signal Process., 2004

Challenges in adopting speech recognition.

[BibT_eX]

[DOI]

Li Deng

Commun. ACM, 2004

Direct filtering for air- and bone-conductive microphones.

[BibT_eX]

[DOI]

Proceedings of the IEEE 6th Workshop on Multimedia Signal Processing, 2004

Enabling natural computing.

[BibT_eX]

[DOI]

Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004

Multi-sensory microphones for robust speech detection, enhancement and recognition.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2002

Distributed speech processing in miPad's multimodal user interface.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2002

A speech-centric perspective for human-computer interface.

[BibT_eX]

[DOI]

Proceedings of the IEEE 5th Workshop on Multimedia Signal Processing, 2002

2001

MiPad: a multimodal interaction prototype.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2001

High-performance robust speech recognition using stereo training data.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2001

2000

Subword-dependent speaker clustering for improved speech recognition.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Mipad: a next generation PDA prototype.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Large-vocabulary speech recognition under adverse acoustic environments.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

A unified context-free grammar and n-gram model for spoken language processing.

[BibT_eX]

[DOI]

Ye-Yi Wang

Milind Mahajan

Proceedings of the IEEE International Conference on Acoustics, 2000

1999

Improvements on speech recognition for fast talkers.

[BibT_eX]

[DOI]

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Unified decoding and feature representation for improved speech recognition.

[BibT_eX]

[DOI]

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Improved topic-dependent language modeling using information retrieval techniques.

[BibT_eX]

[DOI]

Milind Mahajan

Doug Beeferman

Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1998

Can continuous speech recognizers handle isolated speech?

[BibT_eX]

[DOI]

Speech Commun., 1998

HMM-based smoothing for concatenative speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Vocabulary-independent word confidence measure using subword features.

[BibT_eX]

[DOI]

How effective is unsupervised data collection for children's speech recognition?

[BibT_eX]

[DOI]

Dynamically configurable acoustic models for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

Automatic generation of synthesis units for trainable text-to-speech systems.

[BibT_eX]

[DOI]

Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

1997

Improvements on a trainable letter-to-sound converter.

[BibT_eX]

[DOI]

Hsiao-Wuen Hon

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Recent improvements on Microsoft's trainable text-to-speech system-Whistler.

[BibT_eX]

[DOI]

Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

1996

Predicting unseen triphones with senones.

[BibT_eX]

[DOI]

Fileno A. Alleva

IEEE Trans. Speech Audio Process., 1996

Whistler: a trainable text-to-speech system.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Deleted interpolation and density sharing for continuous hidden Markov models.

[BibT_eX]

[DOI]

Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

Improvements on the pronunciation prefix tree search organization.

[BibT_eX]

[DOI]

Fil Alleva

Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

Speaker and gender normalization for continuous-density hidden Markov models.

[BibT_eX]

[DOI]

Alex Acero

Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1995

Microsoft Windows highly intelligent speech recognizer: Whisper.

[BibT_eX]

[DOI]

Proceedings of the 1995 International Conference on Acoustics, 1995

1994

Session 2: Language Modeling.

[BibT_eX]

[DOI]

Proceedings of the Human Language Technology, 1994

Improving speech recognition performance via phone-dependent VQ codebooks and adaptive language models in SPHINX-II.

[BibT_eX]

[DOI]

Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

1993

Shared-distribution hidden Markov models for speech recognition.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 1993

On speaker-independent, speaker-dependent, and speaker-adaptive speech recognition.

[BibT_eX]

[DOI]

Kai-Fu Lee

IEEE Trans. Speech Audio Process., 1993

A comparative study of discrete, semicontinuous, and continuous hidden Markov models.

[BibT_eX]

[DOI]

Comput. Speech Lang., 1993

The SPHINX-II speech recognition system: an overview.

[BibT_eX]

[DOI]

Comput. Speech Lang., 1993

Efficient Cepstral Normalization For Robust Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the Human Language Technology: Proceedings of a Workshop Held at Plainsboro, 1993

An Overview of the SPHINX-II Speech Recognition System.

[BibT_eX]

[DOI]

Proceedings of the Human Language Technology: Proceedings of a Workshop Held at Plainsboro, 1993

Senones, multi-pass search, and unified stochastic modeling in sphinx-II.

[BibT_eX]

[DOI]

Fil Alleva

Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Unified stochastic engine (USE) for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 1993

An improved search algorithm using incremental knowledge for continuous speech recognition.

[BibT_eX]

[DOI]

Fil Alleva

Proceedings of the IEEE International Conference on Acoustics, 1993

1992

Speech Understanding in Open Tasks.

[BibT_eX]

[DOI]

Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Harriman, 1992

Improvements in Stochastic Language Modeling.

[BibT_eX]

[DOI]

Ronald Rosenfeld

Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Harriman, 1992

Subphonetic Modeling for Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Harriman, 1992

Minimizing Speaker Variation Effects for Speaker-Independent Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Harriman, 1992

Applying SPHINX-II to the DARPA Wall Street Journal CSR Task.

[BibT_eX]

[DOI]

Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Harriman, 1992

Exploiting correlations among competing models with application to large vocabulary speech recognition.

[BibT_eX]

[DOI]

Ronald Rosenfeld

Merrick L. Furst

Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

Subphonetic modeling with Markov states-Senone.

[BibT_eX]

[DOI]

Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

Speaker normalization for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

1991

A Study on Speaker-Adaptive Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the Speech and Natural Language, 1991

Acoustic distribution clustering in phonetic hidden Markov models.

[BibT_eX]

[DOI]

Proceedings of the Second European Conference on Speech Communication and Technology, 1991

Improved acoustic modeling with the SPHINX speech recognition system.

[BibT_eX]

[DOI]

Proceedings of the 1991 International Conference on Acoustics, 1991

1990

Speech recognition using hidden Markov models: A CMU perspective.

[BibT_eX]

[DOI]

Speech Commun., 1990

Improved Hidden Markov Modeling for Speaker-Independent Continuous Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Hidden Valley, 1990

On semi-continuous hidden Markov modeling.

[BibT_eX]

[DOI]

Kai-Fu Lee

Hsiao-Wuen Hon

Proceedings of the 1990 International Conference on Acoustics, 1990

1989

Large-vocabulary speaker-independent continuous speech recognition with semi-continuous hidden Markov models.

[BibT_eX]

[DOI]