Dong Yu

According to our database1, Dong Yu authored at least 142 papers between 2003 and 2018.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Homepages:

On csauthors.net:

Bibliography

2018
Erratum to: Past review, current progress, and challenges ahead on the cocktail party problem.
Frontiers of IT & EE, 2018

Past review, current progress, and challenges ahead on the cocktail party problem.
Frontiers of IT & EE, 2018

Deep Extractor Network for Target Speaker Recovery From Single Channel Speech Mixtures.
CoRR, 2018

Recent Progresses in Deep Learning based Acoustic Models (Updated).
CoRR, 2018

Monaural Multi-Talker Speech Recognition with Attention Mechanism and Gated Convolutional Networks.
Proceedings of the Interspeech 2018, 2018

Knowledge Transfer in Permutation Invariant Training for Single-Channel Multi-Talker Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Toward Human Parity in Conversational Speech Recognition.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2017

Recognizing Multi-talker Speech with Permutation Invariant Training.
CoRR, 2017

Single-Channel Multi-talker Speech Recognition with Permutation Invariant Training.
CoRR, 2017

Recognizing Multi-Talker Speech with Permutation Invariant Training.
Proceedings of the Interspeech 2017, 2017

The microsoft 2016 conversational speech recognition system.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Advanced Recurrent Neural Networks for Automatic Speech Recognition.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

Discriminative Beamforming with Phase-Aware Neural Networks for Speech Enhancement and Recognition.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

Sequence-Discriminative Training of Neural Networks.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

2016
Neural Network Based Multi-Factor Aware Joint Training for Robust Speech Recognition.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2016

Achieving Human Parity in Conversational Speech Recognition.
CoRR, 2016

The Microsoft 2016 Conversational Speech Recognition System.
CoRR, 2016

Recurrent Support Vector Machines For Slot Tagging In Spoken Language Understanding.
Proceedings of the NAACL HLT 2016, 2016

Deep Convolutional Neural Networks with Layer-Wise Context Expansion and Attention.
Proceedings of the Interspeech 2016, 2016

Highway long short-term memory RNNS for distant speech recognition.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Prediction-adaptation-correction recurrent neural networks for low-resource language speech recognition.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Deep beamforming networks for multi-channel speech recognition.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Speaker-aware training of LSTM-RNNS for acoustic modelling.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Integrated adaptation with multi-factor joint-learning for far-field speech recognition.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

An investigation into using parallel data for far-field speech recognition.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Deep Neural Networks for Single-Channel Multi-Talker Speech Recognition.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2015

Using Recurrent Neural Networks for Slot Filling in Spoken Language Understanding.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2015

The Computational Network Toolkit [Best of the Web].
IEEE Signal Process. Mag., 2015

Highway Long Short-Term Memory RNNs for Distant Speech Recognition.
CoRR, 2015

Prediction-Adaptation-Correction Recurrent Neural Networks for Low-Resource Language Speech Recognition.
CoRR, 2015

Speech recognition with prediction-adaptation-correction recurrent neural networks.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Improving speech recognition in reverberation using a room-aware deep neural network and multi-task learning.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Deep bi-directional recurrent networks over spectral windows.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
Convolutional Neural Networks for Speech Recognition.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2014

A fast maximum likelihood nonlinear feature transformation method for GMM-HMM speaker adaptation.
Neurocomputing, 2014

Deep Learning: Methods and Applications.
Foundations and Trends in Signal Processing, 2014

Spoken language understanding using long short-term memory neural networks.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

An introduction to computational networks and the computational network toolkit (invited talk).
Proceedings of the INTERSPEECH 2014, 2014

1-bit stochastic gradient descent and its application to data-parallel distributed training of speech DNNs.
Proceedings of the INTERSPEECH 2014, 2014

Multi-accent deep neural network acoustic model with accent-specific top layer using the KLD-regularized model adaptation.
Proceedings of the INTERSPEECH 2014, 2014

A comparative analytic study on the Gaussian mixture and context dependent deep neural network hidden Markov models.
Proceedings of the INTERSPEECH 2014, 2014

Speech emotion recognition using deep neural network and extreme learning machine.
Proceedings of the INTERSPEECH 2014, 2014

Recurrent conditional random field for language understanding.
Proceedings of the IEEE International Conference on Acoustics, 2014

Singular value decomposition based low-footprint speaker adaptation and personalization for deep neural network.
Proceedings of the IEEE International Conference on Acoustics, 2014

Recurrent deep neural networks for robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

Single-channel mixed speech recognition using deep neural networks.
Proceedings of the IEEE International Conference on Acoustics, 2014

On parallelizability of stochastic gradient descent for speech DNNS.
Proceedings of the IEEE International Conference on Acoustics, 2014

Phone sequence modeling with recurrent neural networks.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
The Deep Tensor Neural Network With Applications to Large Vocabulary Speech Recognition.
IEEE Trans. Audio, Speech & Language Processing, 2013

Modeling Spectral Envelopes Using Restricted Boltzmann Machines and Deep Belief Networks for Statistical Parametric Speech Synthesis.
IEEE Trans. Audio, Speech & Language Processing, 2013

Speech Recognition Using Long-Span Temporal Patterns in a Deep Network Model.
IEEE Signal Process. Lett., 2013

Tensor Deep Stacking Networks.
IEEE Trans. Pattern Anal. Mach. Intell., 2013

Exploiting deep neural networks for detection-based speech recognition.
Neurocomputing, 2013

Feature Learning in Deep Neural Networks - A Study on Speech Recognition Tasks
CoRR, 2013

Recurrent neural networks for language understanding.
Proceedings of the INTERSPEECH 2013, 2013

Semi-supervised GMM and DNN acoustic model training with multi-system combination and confidence re-calibration.
Proceedings of the INTERSPEECH 2013, 2013

Deep segmental neural networks for speech recognition.
Proceedings of the INTERSPEECH 2013, 2013

Exploring convolutional neural network structures and optimization techniques for speech recognition.
Proceedings of the INTERSPEECH 2013, 2013

KL-divergence regularized deep neural network adaptation for improved large vocabulary speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2013

Error back propagation for sequence training of Context-Dependent Deep NetworkS for conversational speech transcription.
Proceedings of the IEEE International Conference on Acoustics, 2013

An investigation of deep neural networks for noise robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2013

Modeling spectral envelopes using restricted Boltzmann machines for statistical parametric speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2013

Cross-language knowledge transfer using multilingual deep neural network with shared hidden layers.
Proceedings of the IEEE International Conference on Acoustics, 2013

Recent advances in deep learning for speech research at Microsoft.
Proceedings of the IEEE International Conference on Acoustics, 2013

A deep convolutional neural network using heterogeneous pooling for trading acoustic invariance with phonetic confusion.
Proceedings of the IEEE International Conference on Acoustics, 2013

Large-scale malware classification using random projections and neural networks.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Introduction to the Special Section on Deep Learning for Speech and Language Processing.
IEEE Trans. Audio, Speech & Language Processing, 2012

Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition.
IEEE Trans. Audio, Speech & Language Processing, 2012

Efficient and effective algorithms for training single-hidden-layer neural networks.
Pattern Recognition Letters, 2012

Adaptation of context-dependent deep neural networks for automatic speech recognition.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Context-dependent Deep Neural Networks for audio indexing of real-life data.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Improving wideband speech recognition using mixed-bandwidth training data in CD-DNN-HMM.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Large Vocabulary Speech Recognition Using Deep Tensor Neural Networks.
Proceedings of the INTERSPEECH 2012, 2012

Parallel Training for Deep Stacking Networks.
Proceedings of the INTERSPEECH 2012, 2012

Pipelined Back-Propagation for Context-Dependent Deep Neural Networks.
Proceedings of the INTERSPEECH 2012, 2012

Conversational Speech Transcription Using Context-Dependent Deep Neural Networks.
Proceedings of the 29th International Conference on Machine Learning, 2012

Exploiting sparseness in deep neural networks for large vocabulary speech recognition.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Boosting attribute and phone estimation accuracies with deep neural networks for detection-based speech recognition.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

A deep architecture with bilinear modeling of hidden representations: Applications to phonetic recognition.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Scalable stacking and learning for building deep architectures.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Deep Learning and Its Applications to Signal and Information Processing [Exploratory DSP].
IEEE Signal Process. Mag., 2011

In-Car Media Search.
IEEE Signal Process. Mag., 2011

Improved Bottleneck Features Using Pretrained Deep Neural Networks.
Proceedings of the INTERSPEECH 2011, 2011

Deep Convex Net: A Scalable Architecture for Speech Pattern Classification.
Proceedings of the INTERSPEECH 2011, 2011

Accelerated Parallelizable Neural Network Learning Algorithm for Speech Recognition.
Proceedings of the INTERSPEECH 2011, 2011

Conversational Speech Transcription Using Context-Dependent Deep Neural Networks.
Proceedings of the INTERSPEECH 2011, 2011

Large vocabulary continuous speech recognition with context-dependent DBN-HMMS.
Proceedings of the IEEE International Conference on Acoustics, 2011

Feature engineering in Context-Dependent Deep Neural Networks for conversational speech transcription.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010
Sequential Labeling Using Deep-Structured Conditional Random Fields.
J. Sel. Topics Signal Processing, 2010

Active learning and semi-supervised learning for speech recognition: A unified framework using the global entropy reduction maximization criterion.
Computer Speech & Language, 2010

Deep-structured hidden conditional random fields for phonetic recognition.
Proceedings of the INTERSPEECH 2010, 2010

Investigation of full-sequence training of deep belief networks for speech recognition.
Proceedings of the INTERSPEECH 2010, 2010

Unscented transform with online distortion estimation for HMM adaptation.
Proceedings of the INTERSPEECH 2010, 2010

Binary coding of speech spectrograms using a deep auto-encoder.
Proceedings of the INTERSPEECH 2010, 2010

Word confidence calibration using a maximum entropy model with constraints on confidence and word distributions.
Proceedings of the IEEE International Conference on Acoustics, 2010

Language recognition using deep-structured conditional random fields.
Proceedings of the IEEE International Conference on Acoustics, 2010

Semantic confidence calibration for spoken dialog applications.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
A Novel Framework and Training Algorithm for Variable-Parameter Hidden Markov Models.
IEEE Trans. Audio, Speech & Language Processing, 2009

Using continuous features in the maximum entropy model.
Pattern Recognition Letters, 2009

A unified framework of HMM adaptation with joint compensation of additive and convolutive distortions.
Computer Speech & Language, 2009

Hidden conditional random field with distribution constraints for phone classification.
Proceedings of the INTERSPEECH 2009, 2009

Cross-lingual speech recognition under runtime resource constraints.
Proceedings of the IEEE International Conference on Acoustics, 2009

Discriminative pronounciation learning using phonetic decoder and minimum-classification-error criterion.
Proceedings of the IEEE International Conference on Acoustics, 2009

Maximizing global entropy reduction for active learning in speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2009

Using collective information in semi-supervised learning for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2009

A study on multilingual acoustic modeling for large vocabulary ASR.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Robust Speech Recognition Using a Cepstral Minimum-Mean-Square-Error-Motivated Noise Suppressor.
IEEE Trans. Audio, Speech & Language Processing, 2008

An Integrative and Discriminative Technique for Spoken Utterance Classification.
IEEE Trans. Audio, Speech & Language Processing, 2008

Large-margin minimum classification error training: A theoretical risk minimization perspective.
Computer Speech & Language, 2008

Improvements on Mel-Frequency Cepstrum Minimum-Mean-Square-Error Noise Suppressor for Robust Speech Recognition.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Parameter clustering and sharing in variable-parameter HMMs for noise robust speech recognition.
Proceedings of the INTERSPEECH 2008, 2008

Discriminative training of variable-parameter HMMs for noise robust speech recognition.
Proceedings of the INTERSPEECH 2008, 2008

A minimum-mean-square-error noise reduction algorithm on Mel-frequency cepstra for robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2008

Adaptation of compressed HMM parameters for resource-constrained speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2008

HMM adaptation using a phase-sensitive acoustic distortion model for environment-robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Speaker-adaptive learning of resonance targets in a hidden trajectory model of speech coarticulation.
Computer Speech & Language, 2007

Large-Margin Discriminative Training of Hidden Markov Models for Speech Recognition.
Proceedings of the First IEEE International Conference on Semantic Computing (ICSC 2007), 2007

Voice-Rate: A Dialog System for Consumer Ratings.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2007

The voice-rate dialog system for consumer ratings.
Proceedings of the INTERSPEECH 2007, 2007

Automated directory assistance system - from theory to practice.
Proceedings of the INTERSPEECH 2007, 2007

Handling phonetic context and speaker variation in a structure-based speech recognizer.
Proceedings of the INTERSPEECH 2007, 2007

Confidence measures for voice search applications.
Proceedings of the INTERSPEECH 2007, 2007

Voicepedia: towards speech-based access to unstructured information.
Proceedings of the INTERSPEECH 2007, 2007

Large-Margin Minimum Classification Error Training for Large-Scale Speech Recognition Tasks.
Proceedings of the IEEE International Conference on Acoustics, 2007

A Discriminative Training Framework using N-Best Speech Recognition Transcriptions and Scores for Spoken Utterance Classification.
Proceedings of the IEEE International Conference on Acoustics, 2007

Use of Differential Cepstra as Acoustic Features in Hidden Trajectory Modeling for Phonetic Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2007

High-performance hmm adaptation with joint compensation of additive and convolutive distortions via Vector Taylor Series.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006
Structured speech modeling.
IEEE Trans. Audio, Speech & Language Processing, 2006

A bidirectional target-filtering model of speech coarticulation and reduction: two-stage implementation for phonetic recognition.
IEEE Trans. Audio, Speech & Language Processing, 2006

A lattice search technique for a long-contextual-span hidden trajectory model of speech.
Speech Communication, 2006

An effective and efficient utterance verification technology using word n-gram filler models.
Proceedings of the INTERSPEECH 2006, 2006

Use of incrementally regulated discriminative margins in MCE training for speech recognition.
Proceedings of the INTERSPEECH 2006, 2006

A time-synchronous phonetic decoder for a long-contextual-Span hidden trajectory model.
Proceedings of the INTERSPEECH 2006, 2006

N-Gram Based Filler Model for Robust Grammar Authoring.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
A Speech-Centric Perspective for Human-Computer Interface: A Case Study.
VLSI Signal Processing, 2005

Semiautomatic Improvements of System-Initiative Spoken Dialog Applications Using Interactive Clustering.
IEEE Trans. Speech and Audio Processing, 2005

Evaluation of a long-contextual-Span hidden trajectory model and phonetic recognizer using a* lattice search.
Proceedings of the INTERSPEECH 2005, 2005

Learning statistically characterized resonance targets in a hidden trajectory model of speech coarticulation and reduction.
Proceedings of the INTERSPEECH 2005, 2005

Maximum Entropy Based Generic Filter for Language Model Adaptation.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

A Hidden Trajectory Model with Bi-directional Target-Filtering: Cascaded vs. Integrated Implementation for Phonetic Recognition.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
Unsupervised learning from users' error correction in speech dictation.
Proceedings of the INTERSPEECH 2004, 2004

2003
Improved name recognition with user modeling.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003


  Loading...