Sri Harish Reddy Mallidi

According to our database1, Sri Harish Reddy Mallidi authored at least 42 papers between 2009 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Max-Margin Transducer Loss: Improving Sequence-Discriminative Training Using a Large-Margin Learning Strategy.
Proceedings of the IEEE International Conference on Acoustics, 2024

2022
Do You Listen with one or two Microphones? A Unified ASR Model for Single and Multi-Channel Audio.
Proceedings of the 17th International Workshop on Acoustic Signal Enhancement, 2022

2021
Attention-based Neural Beamforming Layers for Multi-channel Speech Recognition.
CoRR, 2021

Streaming ResLSTM with Causal Mean Aggregation for Device-Directed Utterance Detection.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

wav2vec-C: A Self-Supervised Model for Speech Representation Learning.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2020
Multi-Stream End-to-End Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Multi-view Frequency LSTM: An Efficient Frontend for Automatic Speech Recognition.
CoRR, 2020

2019
Improving ASR Confidence Scores for Alexa Using Acoustic and Hypothesis Embeddings.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

A Study for Improving Device-Directed Speech Detection Toward Frictionless Human-Machine Interaction.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Stream Attention-based Multi-array End-to-end Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Multi-encoder multi-resolution framework for end-to-end speech recognition.
CoRR, 2018

Multilingual Sequence-to-Sequence Speech Recognition: Architecture, Transfer Learning, and Language Modeling.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Device-directed Utterance Detection.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2017
On the relevance of auditory-based Gabor features for deep learning in robust speech recognition.
Comput. Speech Lang., 2017

On the Relevance of Auditory-Based Gabor Features for Deep Learning in Automatic Speech Recognition.
CoRR, 2017

The MIT-LL, JHU and LRDE NIST 2016 Speaker Recognition Evaluation System.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Predicting error rates for unknown data in automatic speech recognition.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
Performance monitoring for automatic speech recognition in noisy multi-channel environments.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

A Framework for Practical Multistream ASR.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Exploiting Hidden-Layer Responses of Deep Neural Networks for Language Recognition.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

A new efficient measure for accuracy prediction and its application to multistream-based unsupervised adaptation.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Novel neural network based fusion for multistream ASR.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Autoencoder based multi-stream combination for noise robust speech recognition.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Towards machines that know when they do not know: Summary of work done at 2014 Frederick Jelinek Memorial Workshop.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Uncertainty estimation of DNN classifiers.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Robust speech recognition in unknown reverberant and noisy conditions.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
Robust Feature Extraction Using Modulation Filtering of Autoregressive Models.
IEEE ACM Trans. Audio Speech Lang. Process., 2014

Neural Network Bottleneck Features for Language Identification.
Proceedings of the Odyssey 2014: The Speaker and Language Recognition Workshop, 2014

Progress in the BBN keyword search system for the DARPA RATS program.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

2013
Robust speaker recognition using spectro-temporal autoregressive models.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Improvements in language identification on the RATS noisy speech corpus.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Developing a speaker identification system for the DARPA RATS project.
Proceedings of the IEEE International Conference on Acoustics, 2013

Frequency offset correction in speech without detecting pitch.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Regularized Auto-Associative Neural Networks for Speaker Verification.
IEEE Signal Process. Lett., 2012

Adaptation transforms of auto-associative neural networks as features for speaker verification.
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012

Acoustic and Data-driven Features for Robust Speech Activity Detection.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Phone recognition in critical bands using sub-band temporal modulations.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012


2011
Modulation Spectrum Analysis for Recognition of Reverberant Speech.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

2010
Speaker-dependent mapping of source and system features for enhancement of throat microphone speech.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Significance of pitch synchronous analysis for speaker recognition using AANN models.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2009
Analysis of laugh signals for detecting in continuous speech.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009


  Loading...