Kartik Audhkhasi

According to our database1, Kartik Audhkhasi authored at least 42 papers between 2007 and 2018.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Homepage:

On csauthors.net:

Bibliography

2018
Modeling Multiple Time Series Annotations as Noisy Distortions of the Ground Truth: An Expectation-Maximization Approach.
IEEE Trans. Affective Computing, 2018

Joint Modeling of Accents and Acoustics for Multi-Accent Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Whole Sentence Neural Language Models.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Building Competitive Direct Acoustics-to-Word Models for English Conversational Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Recent progress in deep end-to-end models for spoken language processing.
IBM Journal of Research and Development, 2017

English Conversational Telephone Speech Recognition by Humans and Machines.
Proceedings of the Interspeech 2017, 2017

Direct Acoustics-to-Word Models for English Conversational Speech Recognition.
Proceedings of the Interspeech 2017, 2017

End-to-end speech recognition and keyword search on low-resource languages.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Knowledge distillation across ensembles of multilingual models for low-resource languages.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

End-to-end ASR-free keyword search from speech.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
Noise-enhanced convolutional neural networks.
Neural Networks, 2016

Detecting paralinguistic events in audio stream using context in features and probabilistic decisions.
Computer Speech & Language, 2016

Multilingual Data Selection for Low Resource Speech Recognition.
Proceedings of the Interspeech 2016, 2016

Efficient one-vs-one kernel ridge regression for speech recognition.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Semantic word embedding neural network language models for automatic speech recognition.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
A mixture of experts approach towards intelligibility classification of pathological speech.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Multilingual representations for low resource speech recognition and keyword search.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
Theoretical Analysis of Diversity in an Ensemble of Automatic Speech Recognition Systems.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2014

Training ensemble of diverse classifiers on feature subsets.
Proceedings of the IEEE International Conference on Acoustics, 2014

Semi-supervised term-weighted value rescoring for keyword search.
Proceedings of the IEEE International Conference on Acoustics, 2014

Fusion of diverse denoising systems for robust automatic speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
A Globally-Variant Locally-Constant Model for Fusion of Labels from Multiple Diverse Experts without Using Reference Labels.
IEEE Trans. Pattern Anal. Mach. Intell., 2013

Which ASR should I choose for my dialogue system?
Proceedings of the SIGDIAL 2013 Conference, 2013

Paralinguistic event detection from speech using probabilistic time-series smoothing and masking.
Proceedings of the INTERSPEECH 2013, 2013

Classifying language-related developmental disorders from speech cues: the promise and the potential confounds.
Proceedings of the INTERSPEECH 2013, 2013

Empirical link between hypothesis diversity and fusion performance in an ensemble of automatic speech recognition systems.
Proceedings of the INTERSPEECH 2013, 2013

Noisy hidden Markov models for speech recognition.
Proceedings of the 2013 International Joint Conference on Neural Networks, 2013

Noise benefits in backpropagation and deep bidirectional pre-training.
Proceedings of the 2013 International Joint Conference on Neural Networks, 2013

Joint training of interpolated exponential n-gram models.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012
A reranking approach for recognition and classification of speech input in conversational dialogue systems.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Speaker Personality Classification Using Systems Based on Acoustic-Lexical Cues and an Optimal Tree-Structured Bayesian Network.
Proceedings of the INTERSPEECH 2012, 2012

Creating ensemble of diverse maximum entropy models.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Analyzing quality of crowd-sourced speech transcriptions of noisy audio for acoustic model adaptation.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Reliability-Weighted Acoustic Model Adaptation Using Crowd Sourced Transcriptions.
Proceedings of the INTERSPEECH 2011, 2011

Emotion classification from speech using evaluator reliability-weighted combination of ranked lists.
Proceedings of the IEEE International Conference on Acoustics, 2011

Accurate transcription of broadcast news speech using multiple noisy transcribers and unsupervised reliability metrics.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Automatic speech recognition system channel modeling.
Proceedings of the INTERSPEECH 2010, 2010

Data-dependent evaluator modeling and its application to emotional valence classification from speech.
Proceedings of the INTERSPEECH 2010, 2010

2009
Automatic evaluation of spoken english fluency.
Proceedings of the IEEE International Conference on Acoustics, 2009

Formant-based technique for automatic filled-pause detection in spontaneous spoken english.
Proceedings of the IEEE International Conference on Acoustics, 2009

Lattice-based lexical cues for word fragment detection in conversational speech.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2007
Keyword Search using Modified Minimum Edit Distance Measure.
Proceedings of the IEEE International Conference on Acoustics, 2007


  Loading...