Sri Harish Reddy Mallidi

According to our database1, Sri Harish Reddy Mallidi authored at least 41 papers between 2009 and 2022.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2022
Do You Listen with one or two Microphones? A Unified ASR Model for Single and Multi-Channel Audio.
Proceedings of the 17th International Workshop on Acoustic Signal Enhancement, 2022

2021
Attention-based Neural Beamforming Layers for Multi-channel Speech Recognition.
CoRR, 2021

Streaming ResLSTM with Causal Mean Aggregation for Device-Directed Utterance Detection.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

wav2vec-C: A Self-Supervised Model for Speech Representation Learning.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

2020
Multi-Stream End-to-End Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Multi-view Frequency LSTM: An Efficient Frontend for Automatic Speech Recognition.
CoRR, 2020

2019
Improving ASR Confidence Scores for Alexa Using Acoustic and Hypothesis Embeddings.
Proceedings of the Interspeech 2019, 2019

A Study for Improving Device-Directed Speech Detection Toward Frictionless Human-Machine Interaction.
Proceedings of the Interspeech 2019, 2019

Stream Attention-based Multi-array End-to-end Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Multi-encoder multi-resolution framework for end-to-end speech recognition.
CoRR, 2018

Multilingual Sequence-to-Sequence Speech Recognition: Architecture, Transfer Learning, and Language Modeling.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Device-directed Utterance Detection.
Proceedings of the Interspeech 2018, 2018

2017
On the relevance of auditory-based Gabor features for deep learning in robust speech recognition.
Comput. Speech Lang., 2017

On the Relevance of Auditory-Based Gabor Features for Deep Learning in Automatic Speech Recognition.
CoRR, 2017


Predicting error rates for unknown data in automatic speech recognition.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
Performance monitoring for automatic speech recognition in noisy multi-channel environments.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

A Framework for Practical Multistream ASR.
Proceedings of the Interspeech 2016, 2016

Exploiting Hidden-Layer Responses of Deep Neural Networks for Language Recognition.
Proceedings of the Interspeech 2016, 2016

A new efficient measure for accuracy prediction and its application to multistream-based unsupervised adaptation.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Novel neural network based fusion for multistream ASR.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Autoencoder based multi-stream combination for noise robust speech recognition.
Proceedings of the INTERSPEECH 2015, 2015

Towards machines that know when they do not know: Summary of work done at 2014 Frederick Jelinek Memorial Workshop.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Uncertainty estimation of DNN classifiers.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Robust speech recognition in unknown reverberant and noisy conditions.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
Robust Feature Extraction Using Modulation Filtering of Autoregressive Models.
IEEE ACM Trans. Audio Speech Lang. Process., 2014

Neural Network Bottleneck Features for Language Identification.
Proceedings of the Odyssey 2014: The Speaker and Language Recognition Workshop, 2014

Progress in the BBN keyword search system for the DARPA RATS program.
Proceedings of the INTERSPEECH 2014, 2014

2013
Robust speaker recognition using spectro-temporal autoregressive models.
Proceedings of the INTERSPEECH 2013, 2013

Improvements in language identification on the RATS noisy speech corpus.
Proceedings of the INTERSPEECH 2013, 2013

Developing a speaker identification system for the DARPA RATS project.
Proceedings of the IEEE International Conference on Acoustics, 2013

Frequency offset correction in speech without detecting pitch.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Regularized Auto-Associative Neural Networks for Speaker Verification.
IEEE Signal Process. Lett., 2012

Adaptation transforms of auto-associative neural networks as features for speaker verification.
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012

Acoustic and Data-driven Features for Robust Speech Activity Detection.
Proceedings of the INTERSPEECH 2012, 2012

Phone recognition in critical bands using sub-band temporal modulations.
Proceedings of the INTERSPEECH 2012, 2012


2011
Modulation Spectrum Analysis for Recognition of Reverberant Speech.
Proceedings of the INTERSPEECH 2011, 2011

2010
Speaker-dependent mapping of source and system features for enhancement of throat microphone speech.
Proceedings of the INTERSPEECH 2010, 2010

Significance of pitch synchronous analysis for speaker recognition using AANN models.
Proceedings of the INTERSPEECH 2010, 2010

2009
Analysis of laugh signals for detecting in continuous speech.
Proceedings of the INTERSPEECH 2009, 2009


  Loading...