Geoffrey Zweig

Affiliations:
  • Facebook AI, USA (since 2018)
  • Microsoft Research, Redmond, WA, USA
  • IBM, T.J. Watson Research Center, Yorktown Heights, NY, USA
  • University of California, Berkeley, CA, USA


According to our database1, Geoffrey Zweig authored at least 127 papers between 1995 and 2021.

Collaborative distances:

Awards

IEEE Fellow

IEEE Fellow 2013, "For contribuitons to advance speech recognition".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2021
Benchmarking LF-MMI, CTC And RNN-T Criteria For Streaming ASR.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Improving RNN Transducer Based ASR with Auxiliary Tasks.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

On Compositions of Transformations in Contrastive Self-Supervised Learning.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Kaizen: Continuously Improving Teacher Using Exponential Moving Average for Semi-Supervised Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Fast, Simpler and More Accurate Hybrid ASR Systems Using Wordpieces.
CoRR, 2020

Multi-modal Self-Supervision from Generalized Data Transformations.
CoRR, 2020

Multilingual Graphemic Hybrid ASR with Massive Data Augmentation.
Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages and Collaboration and Computing for Under-Resourced Languages, 2020

Faster, Simpler and More Accurate Hybrid ASR Systems Using Wordpieces.
Proceedings of the Interspeech 2020, 2020

Large Scale Weakly and Semi-Supervised Learning for Low-Resource Video ASR.
Proceedings of the Interspeech 2020, 2020

Contextualizing ASR Lattice Rescoring with Hybrid Pointer Network Language Model.
Proceedings of the Interspeech 2020, 2020

Contextual RNN-T for Open Domain ASR.
Proceedings of the Interspeech 2020, 2020

Transformer-Based Acoustic Modeling for Hybrid Speech Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

DEJA-VU: Double Feature Presentation and Iterated Loss in Deep Transformer Networks.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Training ASR Models By Generation of Contextual Information.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Deja-vu: Double Feature Presentation in Deep Transformer Networks.
CoRR, 2019

Multilingual ASR with Massive Data Augmentation.
CoRR, 2019

From Senones to Chenones: Tied Context-Dependent Graphemes for Hybrid Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2017
Toward Human Parity in Conversational Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Advances in all-neural speech recognition.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

The microsoft 2016 conversational speech recognition system.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

May I take your order? A Neural Model for Extracting Structured Information from Conversations.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Hybrid Code Networks: practical and efficient end-to-end dialog control with supervised and reinforcement learning.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
An Attentional Neural Conversation Model with Improved Specificity.
CoRR, 2016

Achieving Human Parity in Conversational Speech Recognition.
CoRR, 2016

End-to-end LSTM-based dialog control optimized with supervised and reinforcement learning.
CoRR, 2016

Deep Convolutional Neural Networks with Layer-Wise Context Expansion and Attention.
Proceedings of the Interspeech 2016, 2016

Parallelizing WFST speech decoders.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Exploring multidimensional lstms for large vocabulary ASR.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Using Recurrent Neural Networks for Slot Filling in Spoken Language Understanding.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Attention with Intention for a Neural Network Conversation Model.
CoRR, 2015

Fast and easy language understanding for dialog systems with Microsoft Language Understanding Intelligent Service (LUIS).
Proceedings of the SIGDIAL 2015 Conference, 2015

Sequence-to-sequence neural net models for grapheme-to-phoneme conversion.
Proceedings of the INTERSPEECH 2015, 2015

Clustering novel intents in a conversational interaction system with semantic parsing.
Proceedings of the INTERSPEECH 2015, 2015

Feedback-based handwriting recognition from inertial sensor data for wearable devices.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

From captions to visual concepts and back.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Deep bi-directional recurrent networks over spectral windows.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

LSTM time and frequency recurrence for automatic speech recognition.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Language Models for Image Captioning: The Quirks and What Works.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Rapidly Scaling Dialog Systems with Interactive Learning.
Proceedings of the Natural Language Dialog Systems and Intelligent Assistants, 2015

2014
Spoken language understanding using long short-term memory neural networks.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Joint semantic utterance classification and slot filling with recursive neural networks.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

An introduction to computational networks and the computational network toolkit (invited talk).
Proceedings of the INTERSPEECH 2014, 2014

Probabilistic enrichment of knowledge graph entities for relation detection in conversational understanding.
Proceedings of the INTERSPEECH 2014, 2014

Recurrent conditional random field for language understanding.
Proceedings of the IEEE International Conference on Acoustics, 2014

Cache based recurrent neural network language model inference for first pass speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Combining Heterogeneous Models for Measuring Relational Similarity.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

Linguistic Regularities in Continuous Space Word Representations.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

Recurrent neural networks for language understanding.
Proceedings of the INTERSPEECH 2013, 2013

Speed regularization and optimality in word classing.
Proceedings of the IEEE International Conference on Acoustics, 2013

Combining forward and backward search in decoding.
Proceedings of the IEEE International Conference on Acoustics, 2013

Recent advances in deep learning for speech research at Microsoft.
Proceedings of the IEEE International Conference on Acoustics, 2013

Joint Language and Translation Modeling with Recurrent Neural Networks.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

Accelerating recurrent neural network training via two stage classes and parallelization.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012
Context dependent recurrent neural network language model.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

A Challenge Set for Advancing Language Modeling.
Proceedings of the Workshop: Will We Ever Really Replace the N-gram Model? On the Future of Language Modeling for HLT, 2012

Classification and recognition with direct segment models.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Polarity Inducing Latent Semantic Analysis.
Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 2012

Computational Approaches to Sentence Completion.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

2011
Personalizing Model M for Voice-Search.
Proceedings of the INTERSPEECH 2011, 2011

Speech recognitionwith segmental conditional random fields: A summary of the JHU CLSP 2010 Summer Workshop.
Proceedings of the IEEE International Conference on Acoustics, 2011

MLP based phoneme detectors for Automatic Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011

Discriminative duration modeling for speech recognition with segmental conditional random fields.
Proceedings of the IEEE International Conference on Acoustics, 2011

Integrating meta-information into exemplar-based speech recognition with segmental conditional random fields.
Proceedings of the IEEE International Conference on Acoustics, 2011

Speaker adaptation with an Exponential Transform.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010
Speech Recognition With Flat Direct Models.
IEEE J. Sel. Top. Signal Process., 2010

Continuous speech recognition with a TF-IDF acoustic model.
Proceedings of the INTERSPEECH 2010, 2010

SCARF: a segmental conditional random field toolkit for speech recognition.
Proceedings of the INTERSPEECH 2010, 2010

From flat direct models to segmental CRF models.
Proceedings of the IEEE International Conference on Acoustics, 2010

Discriminative template extraction for direct modeling.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Multi-scale Personalization for Voice Search Applications.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

Maximum mutual information multi-phone units in direct modeling.
Proceedings of the INTERSPEECH 2009, 2009

New methods for the analysis of repeated utterances.
Proceedings of the INTERSPEECH 2009, 2009

Semantic context effects in the recognition of acoustically unreduced and reduced words.
Proceedings of the INTERSPEECH 2009, 2009

Leveraging multiple query logs to improve language models for spoken query recognition.
Proceedings of the IEEE International Conference on Acoustics, 2009

A flat direct model for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2009

A segmental CRF approach to large vocabulary continuous speech recognition.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2008
Joint n-best rescoring for repeated utterances in spoken dialog systems.
Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008

Optimal Dialog in Consumer-Rating Systems using POMDP Framework.
Proceedings of the SIGDIAL 2008 Workshop, 2008

Learning N-Best Correction Models from Implicit User Feedback in a Multi-Modal Local Search Application.
Proceedings of the SIGDIAL 2008 Workshop, 2008

Structured models for joint decoding of repeated utterances.
Proceedings of the INTERSPEECH 2008, 2008

Empirical properties of multilingual phone-to-word transduction.
Proceedings of the IEEE International Conference on Acoustics, 2008

Confidence estimation, OOV detection and language ID using phone-to-word transduction and phone-level alignments.
Proceedings of the IEEE International Conference on Acoustics, 2008

Language modeling for voice search: A machine translation approach.
Proceedings of the IEEE International Conference on Acoustics, 2008

An empirical study of automatic accent classification.
Proceedings of the IEEE International Conference on Acoustics, 2008

Live search for mobile: Web services by voice on the cellphone.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
The voice-rate dialog system for consumer ratings.
Proceedings of the INTERSPEECH 2007, 2007

Automated directory assistance system - from theory to practice.
Proceedings of the INTERSPEECH 2007, 2007

Confidence measures for voice search applications.
Proceedings of the INTERSPEECH 2007, 2007

The IBM 2006 Gale Arabic ASR System.
Proceedings of the IEEE International Conference on Acoustics, 2007

Discriminative Training of Decoding Graphs for Large Vocabulary Continuous Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2007

The IBM Mandarin Broadcast Speech Transcription System.
Proceedings of the IEEE International Conference on Acoustics, 2007

2006
Advances in speech transcription at IBM under the DARPA EARS program.
IEEE Trans. Speech Audio Process., 2006

On the Effect Ofword Error Rate on Automated Quality Monitoring.
Proceedings of the 2006 IEEE ACL Spoken Language Technology Workshop, 2006

Automated Quality Monitoring for Call Centers using Speech and NLP Technologies.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2006

Advances in Mandarin Broadcast Speech Transcription at IBM Under the DARPA GALE Program.
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

The IBM 2006 speech transcription system for european parliamentary speeches.
Proceedings of the INTERSPEECH 2006, 2006

Automated Quality Monitoring in the Call Center with ASR and Maximum Entropy.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Morpheme-Based Language Modeling for Arabic Lvcsr.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Introduction to the Special Issue on Data Mining of Speech, Audio, and Dialog.
IEEE Trans. Speech Audio Process., 2005

Anatomy of an extremely fast LVCSR decoder.
Proceedings of the INTERSPEECH 2005, 2005

The IBM 2004 Conversational Telephony System for Rich Transcription.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

fMPE: Discriminatively Trained Features for Speech Recognition.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
Arc minimization in finite-state decoding graphs with cross-word acoustic context.
Comput. Speech Lang., 2004

Advances in Large Vocabulary Continuous Speech Recognition.
Adv. Comput., 2004

Speech recognition error analysis on the English MALACH corpus.
Proceedings of the INTERSPEECH 2004, 2004

Use of metadata to improve recognition of spontaneous speech and named entities.
Proceedings of the INTERSPEECH 2004, 2004

2003
Bayesian network structures and inference techniques for automatic speech recognition.
Comput. Speech Lang., 2003

An architecture for rapid decoding of large vocabulary conversational speech.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Toward domain-independent conversational speech recognition.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Automatic construction of unique signatures and confusable sets for natural language directory assistance applications.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002
Maximum entropy model for punctuation annotation from speech.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Structurally discriminative graphical models for automatic speech recognition - results from the 2001 Johns Hopkins Summer Workshop.
Proceedings of the IEEE International Conference on Acoustics, 2002

The graphical models toolkit: An open source software system for speech and time-series processing.
Proceedings of the IEEE International Conference on Acoustics, 2002

2001
Extracting caller information from voicemail.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Linear feature space projections for speaker adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2001

Information Extraction from Voicemail.
Proceedings of the Association for Computational Linguistic, 2001

2000
Exact alpha-beta computation in logarithmic space with application to MAP word graph construction.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Recent improvements in speech recognition performance on large vocabulary conversational speech (voicemail and switchboard).
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Boosting Gaussian mixtures in an LVCSR system.
Proceedings of the IEEE International Conference on Acoustics, 2000

1999
Dependency modeling with bayesian networks in a voicemail transcription system.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Recent improvements in voicemail transcription.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

1998
Probabilistic modeling with Bayesian networks for automatic speech recognition.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Speech Recognition with Dynamic Bayesian Networks.
Proceedings of the Fifteenth National Conference on Artificial Intelligence and Tenth Innovative Applications of Artificial Intelligence Conference, 1998

1997
Syntactic Clustering of the Web.
Comput. Networks, 1997

1995
Physical Mapping of Chromosomes Using Unique Probes.
J. Comput. Biol., 1995

An Effective Tour Construction and Improvement Procedure for the Traveling Salesman Problem.
Oper. Res., 1995

The Bit Vector Intersection Problem (Preliminary Version).
Proceedings of the 36th Annual Symposium on Foundations of Computer Science, 1995


  Loading...