Tara N. Sainath

According to our database1, Tara N. Sainath authored at least 109 papers between 2006 and 2018.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Homepages:

On csauthors.net:

Bibliography

2018
Contextual Speech Recognition with Difficult Negative Training Examples.
CoRR, 2018

Deep context: end-to-end contextual speech recognition.
CoRR, 2018

A Comparison of Techniques for Language Model Integration in Encoder-Decoder Speech Recognition.
CoRR, 2018

Contextual Speech Recognition in End-to-end Neural Network Systems Using Beam Search.
Proceedings of the Interspeech 2018, 2018

Domain Adaptation Using Factorized Hidden Layer for Robust Automatic Speech Recognition.
Proceedings of the Interspeech 2018, 2018

Compression of End-to-End Models.
Proceedings of the Interspeech 2018, 2018

Multilingual Speech Recognition with a Single End-to-End Model.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

No Need for a Lexicon? Evaluating the Value of the Pronunciation Lexica in End-to-End Models.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Improving the Performance of Online Neural Transducer Models.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Minimum Word Error Rate Training for Attention-Based Sequence-to-Sequence Models.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Multi-Dialect Speech Recognition with a Single Sequence-to-Sequence Model.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Spectral Distortion Model for Training Phase-Sensitive Deep-Neural Networks for Far-Field Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

An Analysis of Incorporating an External Language Model into a Sequence-to-Sequence Model.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Performance of Mask Based Statistical Beamforming in a Smart Home Scenario.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

State-of-the-Art Speech Recognition with Sequence-to-Sequence Models.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Temporal Modeling Using Dilated Convolution and Gating for Voice-Activity-Detection.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Parallel Deep Neural Network Training for Big Data on Blue Gene/Q.
IEEE Trans. Parallel Distrib. Syst., 2017

Multichannel Signal Processing With Deep Neural Networks for Automatic Speech Recognition.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2017

An analysis of incorporating an external language model into a sequence-to-sequence model.
CoRR, 2017

No Need for a Lexicon? Evaluating the Value of the Pronunciation Lexica in End-to-End Models.
CoRR, 2017

Minimum Word Error Rate Training for Attention-based Sequence-to-Sequence Models.
CoRR, 2017

Improving the Performance of Online Neural Transducer Models.
CoRR, 2017

State-of-the-art Speech Recognition With Sequence-to-Sequence Models.
CoRR, 2017

Multi-Dialect Speech Recognition With A Single Sequence-To-Sequence Model.
CoRR, 2017

Multilingual Speech Recognition With A Single End-To-End Model.
CoRR, 2017

Annealed f-Smoothing as a Mechanism to Speed up Neural Network Training.
Proceedings of the Interspeech 2017, 2017

Highway-LSTM and Recurrent Highway Networks for Speech Recognition.
Proceedings of the Interspeech 2017, 2017

An Analysis of "Attention" in Sequence-to-Sequence Models.
Proceedings of the Interspeech 2017, 2017

A Comparison of Sequence-to-Sequence Models for Speech Recognition.
Proceedings of the Interspeech 2017, 2017


Reducing the Computational Complexity of Two-Dimensional LSTMs.
Proceedings of the Interspeech 2017, 2017

Generation of Large-Scale Simulated Utterances in Virtual Rooms to Train Deep-Neural Networks for Far-Field Speech Recognition in Google Home.
Proceedings of the Interspeech 2017, 2017

Endpoint Detection Using Grid Long Short-Term Memory Networks for Streaming Speech Recognition.
Proceedings of the Interspeech 2017, 2017

Improving the efficiency of forward-backward algorithm using batched computation in TensorFlow.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Raw Multichannel Processing Using Deep Neural Networks.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

2016
Learning Compact Recurrent Neural Networks.
CoRR, 2016

Feature Learning with Raw-Waveform CLDNNs for Voice Activity Detection.
Proceedings of the Interspeech 2016, 2016

Complex Linear Projection (CLP): A Discriminative Approach to Joint Feature Extraction and Acoustic Modeling.
Proceedings of the Interspeech 2016, 2016

Reducing the Computational Complexity of Multimicrophone Acoustic Models with Integrated Feature Extraction.
Proceedings of the Interspeech 2016, 2016

Modeling Time-Frequency Patterns with LSTM vs. Convolutional Architectures for LVCSR Tasks.
Proceedings of the Interspeech 2016, 2016

Lower Frame Rate Neural Network Acoustic Models.
Proceedings of the Interspeech 2016, 2016

Neural Network Adaptive Beamforming for Robust Multichannel Speech Recognition.
Proceedings of the Interspeech 2016, 2016

Factored spatial and spectral multichannel raw waveform CLDNNs.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Learning compact recurrent neural networks.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Deep Convolutional Neural Networks for Large-scale Speech Tasks.
Neural Networks, 2015

Structured Transforms for Small-Footprint Deep Learning.
CoRR, 2015

Structured Transforms for Small-Footprint Deep Learning.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Learning the speech front-end with raw waveform CLDNNs.
Proceedings of the INTERSPEECH 2015, 2015

Convolutional neural networks for small-footprint keyword spotting.
Proceedings of the INTERSPEECH 2015, 2015

Large vocabulary automatic speech recognition for children.
Proceedings of the INTERSPEECH 2015, 2015

Locally-connected and convolutional neural networks for small footprint speaker recognition.
Proceedings of the INTERSPEECH 2015, 2015

Convolutional, Long Short-Term Memory, fully connected Deep Neural Networks.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Automatic gain control and multi-style training for robust small-footprint keyword spotting with deep neural networks.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Query-by-example keyword spotting using long short-term memory networks.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Acoustic modelling with CD-CTC-SMBR LSTM RNNS.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Speaker location and microphone spacing invariant acoustic modeling from raw multichannel waveforms.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
Parallel Deep Neural Network Training for Big Data on Blue Gene/Q.
Proceedings of the International Conference for High Performance Computing, 2014

Deep scattering spectra with deep neural networks for LVCSR tasks.
Proceedings of the INTERSPEECH 2014, 2014

Parallel deep neural network training for LVCSR tasks using blue gene/Q.
Proceedings of the INTERSPEECH 2014, 2014

Joint training of convolutional and non-convolutional neural networks.
Proceedings of the IEEE International Conference on Acoustics, 2014

Improvements to filterbank and delta learning within a deep neural network framework.
Proceedings of the IEEE International Conference on Acoustics, 2014

Deep Scattering Spectrum with deep neural networks.
Proceedings of the IEEE International Conference on Acoustics, 2014

Kernel methods match Deep Neural Networks on TIMIT.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Optimization Techniques to Improve Training Speed of Deep Neural Networks for Large Speech Tasks.
IEEE Trans. Audio, Speech & Language Processing, 2013

Improvements to deep convolutional neural networks for LVCSR.
CoRR, 2013

Improving training time of Hessian-free optimization for deep neural networks using preconditioning and sampling.
CoRR, 2013

Deep convolutional neural networks for LVCSR.
Proceedings of the IEEE International Conference on Acoustics, 2013

Low-rank matrix factorization for Deep Neural Network training with high-dimensional output targets.
Proceedings of the IEEE International Conference on Acoustics, 2013

An evaluation of posterior modeling techniques for phonetic recognition.
Proceedings of the IEEE International Conference on Acoustics, 2013

Improving deep neural networks for LVCSR using rectified linear units and dropout.
Proceedings of the IEEE International Conference on Acoustics, 2013

Developing speech recognition systems for corpus indexing under the IARPA Babel program.
Proceedings of the IEEE International Conference on Acoustics, 2013

Learning filter banks within a deep neural network framework.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

Improvements to Deep Convolutional Neural Networks for LVCSR.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

Accelerating Hessian-free optimization for Deep Neural Networks by implicit preconditioning and sampling.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012
Exemplar-Based Processing for Speech Recognition: An Overview.
IEEE Signal Process. Mag., 2012

Deep Neural Network Language Models.
Proceedings of the Workshop: Will We Ever Really Replace the N-gram Model? On the Future of Language Modeling for HLT, 2012

Enhancing Exemplar-Based Posteriors for Speech Recognition Tasks.
Proceedings of the INTERSPEECH 2012, 2012

Scalable Minimum Bayes Risk Training of Deep Neural Network Acoustic Models Using Distributed Hessian-free Optimization.
Proceedings of the INTERSPEECH 2012, 2012

Auto-encoder bottleneck features using deep belief networks.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Improved pre-training of Deep Belief Networks using Sparse Encoding Symmetric Machines.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

N-best entropy based data selection for acoustic modeling.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Reducing Computational Complexities of Exemplar-Based Sparse Representations with Applications to Large Vocabulary Speech Recognition.
Proceedings of the INTERSPEECH 2011, 2011

Convergence of Line Search A-Function Methods.
Proceedings of the INTERSPEECH 2011, 2011

Application specific loss minimization using gradient boosting.
Proceedings of the IEEE International Conference on Acoustics, 2011

Exemplar-based Sparse Representation phone identification features.
Proceedings of the IEEE International Conference on Acoustics, 2011

Deep Belief Networks using discriminative features for phone recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011

A-Functions: A generalization of Extended Baum-Welch transformations to convex optimization.
Proceedings of the IEEE International Conference on Acoustics, 2011

A convex hull approach to sparse representations for exemplar-based speech recognition.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

Making Deep Belief Networks effective for large vocabulary continuous speech recognition.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010
Data selection for language modeling using sparse representations.
Proceedings of the INTERSPEECH 2010, 2010

Sparse representation features for speech recognition.
Proceedings of the INTERSPEECH 2010, 2010

Sparse representations for text categorization.
Proceedings of the INTERSPEECH 2010, 2010

An analysis of sparseness and regularization in exemplar-based methods for speech classification.
Proceedings of the INTERSPEECH 2010, 2010

Incorporating sparse representation phone identification features in automatic speech recognition using exponential families.
Proceedings of the INTERSPEECH 2010, 2010

A voice-commandable robotic forklift working alongside humans in minimally-prepared outdoor environments.
Proceedings of the IEEE International Conference on Robotics and Automation, 2010

Bayesian compressive sensing for phonetic classification.
Proceedings of the IEEE International Conference on Acoustics, 2010

The Use of isometric transformations and bayesian estimation in compressive sensing for fMRI classification.
Proceedings of the IEEE International Conference on Acoustics, 2010

Kalman filtering for compressed sensing.
Proceedings of the 13th Conference on Information Fusion, 2010

2009
Applications of broad class knowledge for noise robust speech recognition.
PhD thesis, 2009

A generalized family of parameter estimation techniques.
Proceedings of the IEEE International Conference on Acoustics, 2009

An exploration of large vocabulary tools for small vocabulary phonetic recognition.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

Island-driven search using broad phonetic classes.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2008
A comparison of broad phonetic and acoustic units for noise robust segment-based phonetic recognition.
Proceedings of the INTERSPEECH 2008, 2008

Generalization of extended baum-welch parameter estimation for discriminative training and decoding.
Proceedings of the INTERSPEECH 2008, 2008

Gradient steepness metrics using extended Baum-Welch transformations for universal pattern recognition tasks.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Audio classification using extended baum-welch transformations.
Proceedings of the INTERSPEECH 2007, 2007

Unsupervised Audio Segmentation using Extended Baum-Welch Transformations.
Proceedings of the IEEE International Conference on Acoustics, 2007

Broad phonetic class recognition in a Hidden Markov model framework using extended Baum-Welch transformations.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006
A Sinusoidal Model Approach to Acoustic Landmark Detection and Segmentation for Robust Segment-Based Speech Recognition.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006


  Loading...