Rohit Prabhavalkar

According to our database1, Rohit Prabhavalkar authored at least 50 papers between 2009 and 2019.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

On csauthors.net:

Bibliography

2019
Two-Pass End-to-End Speech Recognition.
CoRR, 2019

Phoneme-Based Contextualization for Cross-Lingual Speech Recognition in End-to-End Models.
CoRR, 2019

Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling.
CoRR, 2019

Model Unit Exploration for Sequence-to-Sequence Speech Recognition.
CoRR, 2019

Streaming End-to-end Speech Recognition for Mobile Devices.
Proceedings of the IEEE International Conference on Acoustics, 2019

Joint Endpointing and Decoding with End-to-end Models.
Proceedings of the IEEE International Conference on Acoustics, 2019

Phoebe: Pronunciation-aware Contextualization for End-to-end Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Streaming End-to-end Speech Recognition For Mobile Devices.
CoRR, 2018

From Audio to Semantics: Approaches to end-to-end spoken language understanding.
CoRR, 2018

Deep context: end-to-end contextual speech recognition.
CoRR, 2018

Exploring Architectures, Data and Units For Streaming End-to-End Speech Recognition with RNN-Transducer.
CoRR, 2018

Deep Context: End-to-end Contextual Speech Recognition.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

From Audio to Semantics: Approaches to End-to-End Spoken Language Understanding.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Compression of End-to-End Models.
Proceedings of the Interspeech 2018, 2018

No Need for a Lexicon? Evaluating the Value of the Pronunciation Lexica in End-to-End Models.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Improving the Performance of Online Neural Transducer Models.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Minimum Word Error Rate Training for Attention-Based Sequence-to-Sequence Models.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

An Analysis of Incorporating an External Language Model into a Sequence-to-Sequence Model.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Exploring Speech Enhancement with Generative Adversarial Networks for Robust Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

State-of-the-Art Speech Recognition with Sequence-to-Sequence Models.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
An analysis of incorporating an external language model into a sequence-to-sequence model.
CoRR, 2017

No Need for a Lexicon? Evaluating the Value of the Pronunciation Lexica in End-to-End Models.
CoRR, 2017

Minimum Word Error Rate Training for Attention-based Sequence-to-Sequence Models.
CoRR, 2017

Improving the Performance of Online Neural Transducer Models.
CoRR, 2017

State-of-the-art Speech Recognition With Sequence-to-Sequence Models.
CoRR, 2017

Exploring Speech Enhancement with Generative Adversarial Networks for Robust Speech Recognition.
CoRR, 2017

Streaming Small-Footprint Keyword Spotting using Sequence-to-Sequence Models.
CoRR, 2017

An Analysis of "Attention" in Sequence-to-Sequence Models.
Proceedings of the Interspeech 2017, 2017

A Comparison of Sequence-to-Sequence Models for Speech Recognition.
Proceedings of the Interspeech 2017, 2017

Exploring architectures, data and units for streaming end-to-end speech recognition with RNN-transducer.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Streaming small-footprint keyword spotting using sequence-to-sequence models.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016
On the Compression of Recurrent Neural Networks with an Application to LVCSR acoustic modeling for Embedded Speech Recognition.
CoRR, 2016

Personalized Speech recognition on mobile devices.
CoRR, 2016

On the efficient representation and execution of deep acoustic models.
CoRR, 2016

On the Efficient Representation and Execution of Deep Acoustic Models.
Proceedings of the Interspeech 2016, 2016

On the compression of recurrent neural networks with an application to LVCSR acoustic modeling for embedded speech recognition.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Personalized speech recognition on mobile devices.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Compressing deep neural networks using a rank-constrained topology.
Proceedings of the INTERSPEECH 2015, 2015

Automatic gain control and multi-style training for robust small-footprint keyword spotting with deep neural networks.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2013
Conditional Random Fields in Speech, Audio, and Language Processing.
Proceedings of the IEEE, 2013

An evaluation of posterior modeling techniques for phonetic recognition.
Proceedings of the IEEE International Conference on Acoustics, 2013

Discriminative articulatory models for spoken term detection in low-resource conversational settings.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Discriminative spoken term detection with limited data.
Proceedings of the 2012 Symposium on Machine Learning in Speech and Language Processing, 2012

A chunk-based phonetic score for mobile voice search.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Articulatory Feature Classification Using Nearest Neighbors.
Proceedings of the INTERSPEECH 2011, 2011

A factored conditional random field model for articulatory feature forced transcription.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010
Investigations into the Crandem Approach to Word Recognition.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2010

Combining monaural and binaural evidence for reverberant speech segregation.
Proceedings of the INTERSPEECH 2010, 2010

Backpropagation training for multilayer conditional random field based phone recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Monaural segregation of voiced speech using discriminative random fields.
Proceedings of the INTERSPEECH 2009, 2009


  Loading...