Geoffrey Zweig

Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016

An Attentional Neural Conversation Model with Improved Specificity.

[BibT_eX]

[DOI]

CoRR, 2016

Achieving Human Parity in Conversational Speech Recognition.

[BibT_eX]

[DOI]

CoRR, 2016

End-to-end LSTM-based dialog control optimized with supervised and reinforcement learning.

[BibT_eX]

[DOI]

Jason D. Williams

CoRR, 2016

Deep Convolutional Neural Networks with Layer-Wise Context Expansion and Attention.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Parallelizing WFST speech decoders.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Exploring multidimensional lstms for large vocabulary ASR.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015

Using Recurrent Neural Networks for Slot Filling in Spoken Language Understanding.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2015

Attention with Intention for a Neural Network Conversation Model.

[BibT_eX]

[DOI]

Kaisheng Yao

Baolin Peng

CoRR, 2015

Fast and easy language understanding for dialog systems with Microsoft Language Understanding Intelligent Service (LUIS).

[BibT_eX]

[DOI]

Proceedings of the SIGDIAL 2015 Conference, 2015

Sequence-to-sequence neural net models for grapheme-to-phoneme conversion.

[BibT_eX]

[DOI]

Kaisheng Yao

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Clustering novel intents in a conversational interaction system with semantic parsing.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Feedback-based handwriting recognition from inertial sensor data for wearable devices.

[BibT_eX]

[DOI]

Yujia Li

Kaisheng Yao

Carlos Garcia Jurado Suarez

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

From captions to visual concepts and back.

[BibT_eX]

[DOI]

Hao Fang

Saurabh Gupta

Forrest N. Iandola

Rupesh Kumar Srivastava

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Deep bi-directional recurrent networks over spectral windows.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

LSTM time and frequency recurrence for automatic speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Language Models for Image Captioning: The Quirks and What Works.

[BibT_eX]

[DOI]

Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Rapidly Scaling Dialog Systems with Interactive Learning.

[BibT_eX]

[DOI]

Mouni Reddy

Proceedings of the Natural Language Dialog Systems and Intelligent Assistants, 2015

2014

Spoken language understanding using long short-term memory neural networks.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Joint semantic utterance classification and slot filling with recursive neural networks.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

An introduction to computational networks and the computational network toolkit (invited talk).

[BibT_eX]

[DOI]

Christopher J. Rossbach

Jon Currey

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Probabilistic enrichment of knowledge graph entities for relation detection in conversational understanding.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Recurrent conditional random field for language understanding.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Cache based recurrent neural network language model inference for first pass speech recognition.

[BibT_eX]

[DOI]

Zhiheng Huang

Benoît Dumoulin

Proceedings of the IEEE International Conference on Acoustics, 2014

2013

Combining Heterogeneous Models for Measuring Relational Similarity.

[BibT_eX]

[DOI]

Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

Linguistic Regularities in Continuous Space Word Representations.

[BibT_eX]

[DOI]

Tomás Mikolov

Wen-tau Yih

Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

Recurrent neural networks for language understanding.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Speed regularization and optimality in word classing.

[BibT_eX]

[DOI]

Konstantin Makarychev

Proceedings of the IEEE International Conference on Acoustics, 2013

Combining forward and backward search in decoding.

[BibT_eX]

[DOI]

Mirko Hannemann

Daniel Povey

Proceedings of the IEEE International Conference on Acoustics, 2013

Recent advances in deep learning for speech research at Microsoft.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Joint Language and Translation Modeling with Recurrent Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

Accelerating recurrent neural network training via two stage classes and parallelization.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012

Context dependent recurrent neural network language model.

[BibT_eX]

[DOI]

Tomás Mikolov

Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

A Challenge Set for Advancing Language Modeling.

[BibT_eX]

[DOI]

Christopher J. C. Burges

Proceedings of the Workshop: Will We Ever Really Replace the N-gram Model? On the Future of Language Modeling for HLT, 2012

Classification and recognition with direct segment models.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Polarity Inducing Latent Semantic Analysis.

[BibT_eX]

[DOI]

Wen-tau Yih

John C. Platt

Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 2012

Computational Approaches to Sentence Completion.

[BibT_eX]

[DOI]

John C. Platt

Christopher Meek

Christopher J. C. Burges

Ainur Yessenalina

Qiang Liu

Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

2011

Personalizing Model M for Voice-Search.

[BibT_eX]

[DOI]

Shuangyu Chang

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Speech recognitionwith segmental conditional random fields: A summary of the JHU CLSP 2010 Summer Workshop.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

MLP based phoneme detectors for Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Discriminative duration modeling for speech recognition with segmental conditional random fields.

[BibT_eX]

[DOI]

Justine T. Kao

Proceedings of the IEEE International Conference on Acoustics, 2011

Integrating meta-information into exemplar-based speech recognition with segmental conditional random fields.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Speaker adaptation with an Exponential Transform.

[BibT_eX]

[DOI]

Daniel Povey

Alex Acero

Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010

Speech Recognition With Flat Direct Models.

[BibT_eX]

[DOI]

Georg Heigold

IEEE J. Sel. Top. Signal Process., 2010

Continuous speech recognition with a TF-IDF acoustic model.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

SCARF: a segmental conditional random field toolkit for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

From flat direct models to segmental CRF models.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Discriminative template extraction for direct modeling.

[BibT_eX]

[DOI]

Shankar Shivappa

Proceedings of the IEEE International Conference on Acoustics, 2010

2009

Multi-scale Personalization for Voice Search Applications.

[BibT_eX]

[DOI]

Daniel Bolaños

Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

Maximum mutual information multi-phone units in direct modeling.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

New methods for the analysis of repeated utterances.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Semantic context effects in the recognition of acoustically unreduced and reduced words.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Leveraging multiple query logs to improve language models for spoken query recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

A flat direct model for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

A segmental CRF approach to large vocabulary continuous speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2008

Joint n-best rescoring for repeated utterances in spoken dialog systems.

[BibT_eX]

[DOI]

Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008

Optimal Dialog in Consumer-Rating Systems using POMDP Framework.

[BibT_eX]

[DOI]

Zhifei Li

Proceedings of the SIGDIAL 2008 Workshop, 2008

Learning N-Best Correction Models from Implicit User Feedback in a Multi-Modal Local Search Application.

[BibT_eX]

[DOI]

Proceedings of the SIGDIAL 2008 Workshop, 2008

Structured models for joint decoding of repeated utterances.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Empirical properties of multilingual phone-to-word transduction.

[BibT_eX]

[DOI]

Jon Nedel

Proceedings of the IEEE International Conference on Acoustics, 2008

Confidence estimation, OOV detection and language ID using phone-to-word transduction and phone-level alignments.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

Language modeling for voice search: A machine translation approach.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

An empirical study of automatic accent classification.

[BibT_eX]

[DOI]

Ghinwa F. Choueiter

Proceedings of the IEEE International Conference on Acoustics, 2008

Live search for mobile: Web services by voice on the cellphone.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

2007

Voice-Rate: A Dialog System for Consumer Ratings.

[BibT_eX]

[DOI]

Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2007

Automated directory assistance system - from theory to practice.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Confidence measures for voice search applications.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

The IBM 2006 Gale Arabic ASR System.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

Discriminative Training of Decoding Graphs for Large Vocabulary Continuous Speech Recognition.

[BibT_eX]

[DOI]

Hong-Kwang Jeff Kuo

Brian Kingsbury

Proceedings of the IEEE International Conference on Acoustics, 2007

The IBM Mandarin Broadcast Speech Transcription System.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

2006

Advances in speech transcription at IBM under the DARPA EARS program.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2006

On the Effect Ofword Error Rate on Automated Quality Monitoring.

[BibT_eX]

[DOI]

Bhuvana Ramabhadran

Proceedings of the 2006 IEEE ACL Spoken Language Technology Workshop, 2006

Automated Quality Monitoring for Call Centers using Speech and NLP Technologies.

[BibT_eX]

[DOI]

Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2006

Advances in Mandarin Broadcast Speech Transcription at IBM Under the DARPA GALE Program.

[BibT_eX]

[DOI]

Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

The IBM 2006 speech transcription system for european parliamentary speeches.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Automated Quality Monitoring in the Call Center with ASR and Maximum Entropy.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Morpheme-Based Language Modeling for Arabic Lvcsr.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005

Introduction to the Special Issue on Data Mining of Speech, Audio, and Dialog.

[BibT_eX]

[DOI]

Mazin Gilbert

Roger K. Moore

IEEE Trans. Speech Audio Process., 2005

Anatomy of an extremely fast LVCSR decoder.

[BibT_eX]

[DOI]

Daniel Povey

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

The IBM 2004 Conversational Telephony System for Rich Transcription.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

fMPE: Discriminatively Trained Features for Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004

Advances in Large Vocabulary Continuous Speech Recognition.

[BibT_eX]

[DOI]

Michael Picheny

Adv. Comput., 2004

Speech recognition error analysis on the English MALACH corpus.

[BibT_eX]

[DOI]

Olivier Siohan

Bhuvana Ramabhadran

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Use of metadata to improve recognition of spontaneous speech and named entities.

[BibT_eX]

[DOI]

Bhuvana Ramabhadran

Olivier Siohan

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

2003

Bayesian network structures and inference techniques for automatic speech recognition.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2003

An architecture for rapid decoding of large vocabulary conversational speech.

[BibT_eX]

[DOI]

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Toward domain-independent conversational speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Automatic construction of unique signatures and confusable sets for natural language directory assistance applications.

[BibT_eX]

[DOI]

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002

Arc minimization in finite state decoding graphs with cross-word acoustic context.

[BibT_eX]

[DOI]

François Yvon

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Maximum entropy model for punctuation annotation from speech.

[BibT_eX]

[DOI]

Jing Huang

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Structurally discriminative graphical models for automatic speech recognition - results from the 2001 Johns Hopkins Summer Workshop.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2002

The graphical models toolkit: An open source software system for speech and time-series processing.

[BibT_eX]

[DOI]

Jeff A. Bilmes

Proceedings of the IEEE International Conference on Acoustics, 2002

2001

Extracting Caller Information from Voicemail.

[BibT_eX]

[DOI]

Jing Huang

Proceedings of the Information Retrieval Techniques for Speech Applications [this book is based on the workshop "Information Retrieval Techniques for Speech Applications", 2001

Linear feature space projections for speaker adaptation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2001

Information Extraction from Voicemail.

[BibT_eX]

[DOI]

Jing Huang

Proceedings of the Association for Computational Linguistic, 2001

2000

Exact alpha-beta computation in logarithmic space with application to MAP word graph construction.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Recent improvements in speech recognition performance on large vocabulary conversational speech (voicemail and switchboard).

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Boosting Gaussian mixtures in an LVCSR system.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2000

1999

Dependency modeling with bayesian networks in a voicemail transcription system.

[BibT_eX]

[DOI]

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Recent improvements in voicemail transcription.

[BibT_eX]

[DOI]

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

1998

Probabilistic modeling with Bayesian networks for automatic speech recognition.

[BibT_eX]

[DOI]

Stuart Russell

Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Speech Recognition with Dynamic Bayesian Networks.

[BibT_eX]

[DOI]