Sri Harish Reddy Mallidi

Takaaki Hori

Shinji Watanabe

CoRR, 2018

Multilingual Sequence-to-Sequence Speech Recognition: Architecture, Transfer Learning, and Language Modeling.

[BibT_eX]

[DOI]

Jaejin Cho

Murali Karthick Baskar

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Device-directed Utterance Detection.

[BibT_eX]

[DOI]

Angel Mario Castro Martinez

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2017

On the relevance of auditory-based Gabor features for deep learning in robust speech recognition.

[BibT_eX]

[DOI]

Angel Mario Castro Martinez

Comput. Speech Lang., 2017

On the Relevance of Auditory-Based Gabor Features for Deep Learning in Automatic Speech Recognition.

[BibT_eX]

[DOI]

Pedro A. Torres-Carrasquillo

CoRR, 2017

The MIT-LL, JHU and LRDE NIST 2016 Speaker Recognition Evaluation System.

[BibT_eX]

[DOI]

Phani Sankar Nidadavolu

Ruizhi Li

Réda Dehak

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Predicting error rates for unknown data in automatic speech recognition.

[BibT_eX]

[DOI]

Hendrik Kayser

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016

Performance monitoring for automatic speech recognition in noisy multi-channel environments.

[BibT_eX]

[DOI]

Angel Mario Castro Martinez

Guillermo Payá-Vayá

Hendrik Kayser

Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

A Framework for Practical Multistream ASR.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Exploiting Hidden-Layer Responses of Deep Neural Networks for Language Recognition.

[BibT_eX]

[DOI]

Ruizhi Li

Lukás Burget

Oldrich Plchot

Najim Dehak

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

A new efficient measure for accuracy prediction and its application to multistream-based unsupervised adaptation.

[BibT_eX]

[DOI]

Tetsuji Ogawa

Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Novel neural network based fusion for multistream ASR.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015

Autoencoder based multi-stream combination for noise robust speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Towards machines that know when they do not know: Summary of work done at 2014 Frederick Jelinek Memorial Workshop.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Uncertainty estimation of DNN classifiers.

[BibT_eX]

[DOI]

Tetsuji Ogawa

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Robust speech recognition in unknown reverberant and noisy conditions.

[BibT_eX]

[DOI]

Stavros Tsakalidis

Richard M. Schwartz

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014

Robust Feature Extraction Using Modulation Filtering of Autoregressive Models.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2014

Neural Network Bottleneck Features for Language Identification.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2014: The Speaker and Language Recognition Workshop, 2014

Progress in the BBN keyword search system for the DARPA RATS program.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

2013

Robust speaker recognition using spectro-temporal autoregressive models.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Improvements in language identification on the RATS noisy speech corpus.

[BibT_eX]

[DOI]

Jeff Z. Ma

Bing Zhang

Spyros Matsoukas

Feipeng Li

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Developing a speaker identification system for the DARPA RATS project.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Frequency offset correction in speech without detecting pitch.

[BibT_eX]

[DOI]

Pascal Clark

Aren Jansen

Proceedings of the IEEE International Conference on Acoustics, 2013

2012

Regularized Auto-Associative Neural Networks for Speaker Verification.

[BibT_eX]

[DOI]

Sri Garimella

IEEE Signal Process. Lett., 2012

Adaptation transforms of auto-associative neural networks as features for speaker verification.

[BibT_eX]

[DOI]

Samuel Thomas

Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012

Acoustic and Data-driven Features for Robust Speech Activity Detection.

[BibT_eX]

[DOI]

Samuel Thomas

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Phone recognition in critical bands using sub-band temporal modulations.

[BibT_eX]

[DOI]

Feipeng Li

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

The UMD-JHU 2011 speaker recognition system.

[BibT_eX]

[DOI]

Daniel Garcia-Romero

Xinhui Zhou

Dmitry N. Zotkin

Balaji Vasan Srinivasan

Yuancheng Luo

Garimella S. V. S. Sivaram

Samuel Thomas

Sridhar Krishna Nemala

Majid Mirbagheri

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011

Modulation Spectrum Analysis for Recognition of Reverberant Speech.

[BibT_eX]

[DOI]

Anand Joseph Xavier Medabalimi

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

2010

Speaker-dependent mapping of source and system features for enhancement of throat microphone speech.

[BibT_eX]

[DOI]

B. Yegnanarayana

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Significance of pitch synchronous analysis for speaker recognition using AANN models.

[BibT_eX]

[DOI]

Suryakanth V. Gangashetty

Kishore Prahallad

B. Yegnanarayana

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2009

Analysis of laugh signals for detecting in continuous speech.

[BibT_eX]

[DOI]

K. Sudheer Kumar