Yifan Gong

According to our database1, Yifan Gong authored at least 162 papers between 1987 and 2019.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

On csauthors.net:

Bibliography

2019
Adversarial Speaker Adaptation.
CoRR, 2019

Adversarial Speaker Verification.
CoRR, 2019

Attentive Adversarial Learning for Domain-Invariant Training.
CoRR, 2019

Conditional Teacher-Student Learning.
CoRR, 2019

Speaker Adaptation for End-to-End CTC Models.
CoRR, 2019

2018
Advancing Acoustic-to-Word CTC Model with Attention and Mixed-Units.
CoRR, 2018

Cycle-Consistent Speech Enhancement.
CoRR, 2018

Adversarial Feature-Mapping for Speech Enhancement.
CoRR, 2018

Layer Trajectory LSTM.
CoRR, 2018

Developing Far-Field Speaker System Via Teacher-Student Learning.
CoRR, 2018

Speaker-Invariant Training via Adversarial Learning.
CoRR, 2018

Adversarial Teacher-Student Learning for Unsupervised Domain Adaptation.
CoRR, 2018

Cracking the cocktail party problem by multi-beam deep attractor network.
CoRR, 2018

Advancing Acoustic-to-Word CTC Model.
CoRR, 2018

Advancing Connectionist Temporal Classification With Attention Modeling.
CoRR, 2018

Speaker Adaptation for End-to-End CTC Models.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Exploring Layer Trajectory LSTM with Depth Processing Units and Attention.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Multi-Channel Overlapped Speech Recognition with Location Guided Speech Extraction Network.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

BitFlow: Exploiting Vector Parallelism for Binary Neural Networks on CPU.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium, 2018

Adversarial Feature-Mapping for Speech Enhancement.
Proceedings of the Interspeech 2018, 2018

Cycle-Consistent Speech Enhancement.
Proceedings of the Interspeech 2018, 2018

Layer Trajectory LSTM.
Proceedings of the Interspeech 2018, 2018

Domain and Speaker Adaptation for Cortana Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Adversarial Teacher-Student Learning for Unsupervised Domain Adaptation.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Speaker-Invariant Training Via Adversarial Learning.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Developing Far-Field Speaker System Via Teacher-Student Learning.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Advancing Acoustic-to-Word CTC Model.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Advancing Connectionist Temporal Classification with Attention Modeling.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Efficient Integration of Fixed Beamformers and Speech Separation Networks for Multi-Channel Far-Field Speech Separation.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Network Performance Aware Optimizations on IaaS Clouds.
IEEE Trans. Computers, 2017

Acoustic-To-Word Model Without OOV.
CoRR, 2017

Unsupervised Adaptation with Domain Separation Networks for Robust Speech Recognition.
CoRR, 2017

Large-Scale Domain Adaptation via Teacher-Student Learning.
CoRR, 2017

End-to-End Attention based Text-Dependent Speaker Verification.
CoRR, 2017

Efficient process mapping in geo-distributed cloud data centers.
Proceedings of the International Conference for High Performance Computing, 2017

Large-Scale Domain Adaptation via Teacher-Student Learning.
Proceedings of the Interspeech 2017, 2017

Don't Count on ASR to Transcribe for You: Breaking Bias with Two Crowds.
Proceedings of the Interspeech 2017, 2017

Improving Mask Learning Based Speech Enhancement System with Restoration Layers and Residual Connection.
Proceedings of the Interspeech 2017, 2017

Extended low-rank plus diagonal adaptation for deep and recurrent neural networks.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Improved cepstra minimum-mean-square-error noise reduction algorithm for robust speech recognition.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Unsupervised adaptation with domain separation networks for robust speech recognition.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Acoustic-to-word model without OOV.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Cracking the cocktail party problem by multi-beam deep attractor network.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Challenges in and Solutions to Deep Learning Network Acoustic Modeling in Speech Recognition Products at Microsoft.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

2016
End-to-End attention based text-dependent speaker verification.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Semi-Supervised Training in Deep Learning Acoustic Model.
Proceedings of the Interspeech 2016, 2016

Low-rank plus diagonal adaptation for deep neural networks.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Recurrent support vector machines for speech recognition.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Geo-location dependent deep neural network acoustic model for speech recognition.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Simplifying long short-term memory acoustic models for fast training and decoding.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Investigations on speaker adaptation of LSTM RNN models for speech recognition.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Exploring multidimensional lstms for large vocabulary ASR.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Non-negative intermediate-layer DNN adaptation for a 10-KB speaker adaptation profile.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Network Performance Aware MPI Collective Communication Operations in the Cloud.
IEEE Trans. Parallel Distrib. Syst., 2015

Monetary cost optimizations for MPI-based HPC applications on Amazon clouds: checkpoints and replicated execution.
Proceedings of the International Conference for High Performance Computing, 2015

SVD-based universal DNN modeling for multiple scenarios.
Proceedings of the INTERSPEECH 2015, 2015

Intermediate-layer DNN adaptation for offline and session-based iterative speaker adaptation.
Proceedings of the INTERSPEECH 2015, 2015

Delta-melspectra features for noise robustness to DNN-based ASR systems.
Proceedings of the INTERSPEECH 2015, 2015

Confidence-features and confidence-scores for ASR applications in arbitration and DNN speaker adaptation.
Proceedings of the INTERSPEECH 2015, 2015

Regularized sequence-level deep neural network model adaptation.
Proceedings of the INTERSPEECH 2015, 2015

Investigating online low-footprint speaker adaptation using generalized linear regression and click-through data.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Deep neural support vector machines for speech recognition.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Small-footprint high-performance deep neural network-based speech recognition using split-VQ.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Estimating confidence scores on ASR results using recurrent neural networks.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

An analysis of convolutional neural networks for speech recognition.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

LSTM time and frequency recurrence for automatic speech recognition.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
An Overview of Noise-Robust Automatic Speech Recognition.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2014

A fast maximum likelihood nonlinear feature transformation method for GMM-HMM speaker adaptation.
Neurocomputing, 2014

Variable-activation and variable-input deep neural network for robust speech recognition.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Finding Constant from Change: Revisiting Network Performance Aware Optimizations on IaaS Clouds.
Proceedings of the International Conference for High Performance Computing, 2014

Variable-component deep neural network for robust speech recognition.
Proceedings of the INTERSPEECH 2014, 2014

Learning small-size DNN with output-distribution-based criteria.
Proceedings of the INTERSPEECH 2014, 2014

Normalization of ASR confidence classifier scores via confidence mapping.
Proceedings of the INTERSPEECH 2014, 2014

Multi-accent deep neural network acoustic model with accent-specific top layer using the KLD-regularized model adaptation.
Proceedings of the INTERSPEECH 2014, 2014

A comparative analytic study on the Gaussian mixture and context dependent deep neural network hidden Markov models.
Proceedings of the INTERSPEECH 2014, 2014

Towards better performance with heterogeneous training data in acoustic modeling using deep neural networks.
Proceedings of the INTERSPEECH 2014, 2014

Singular value decomposition based low-footprint speaker adaptation and personalization for deep neural network.
Proceedings of the IEEE International Conference on Acoustics, 2014

Factorized adaptation for deep neural network.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Restructuring of deep neural network acoustic models with singular value decomposition.
Proceedings of the INTERSPEECH 2013, 2013

Semi-supervised GMM and DNN acoustic model training with multi-system combination and confidence re-calibration.
Proceedings of the INTERSPEECH 2013, 2013

Cross-language knowledge transfer using multilingual deep neural network with shared hidden layers.
Proceedings of the IEEE International Conference on Acoustics, 2013

Predicting speech recognition confidence using deep learning with word identity and score features.
Proceedings of the IEEE International Conference on Acoustics, 2013

Recent advances in deep learning for speech research at Microsoft.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Adaptation of context-dependent deep neural networks for automatic speech recognition.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Improving wideband speech recognition using mixed-bandwidth training data in CD-DNN-HMM.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

An overview of CMPI: network performance aware MPI in the cloud.
Proceedings of the 17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2012

A Feature Space Transformation Method for Personalization using Generalized I-Vector Clustering.
Proceedings of the INTERSPEECH 2012, 2012

Efficient VTS Adaptation Using Jacobian Approximation.
Proceedings of the INTERSPEECH 2012, 2012

Improvements to VTS feature enhancement.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2010
Unscented transform with online distortion estimation for HMM adaptation.
Proceedings of the INTERSPEECH 2010, 2010

2009
A Novel Framework and Training Algorithm for Variable-Parameter Hidden Markov Models.
IEEE Trans. Audio, Speech & Language Processing, 2009

A unified framework of HMM adaptation with joint compensation of additive and convolutive distortions.
Computer Speech & Language, 2009

Cross-lingual speech recognition under runtime resource constraints.
Proceedings of the IEEE International Conference on Acoustics, 2009

A study on multilingual acoustic modeling for large vocabulary ASR.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Robust Speech Recognition Using a Cepstral Minimum-Mean-Square-Error-Motivated Noise Suppressor.
IEEE Trans. Audio, Speech & Language Processing, 2008

Improvements on Mel-Frequency Cepstrum Minimum-Mean-Square-Error Noise Suppressor for Robust Speech Recognition.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Parameter clustering and sharing in variable-parameter HMMs for noise robust speech recognition.
Proceedings of the INTERSPEECH 2008, 2008

Discriminative training of variable-parameter HMMs for noise robust speech recognition.
Proceedings of the INTERSPEECH 2008, 2008

A minimum-mean-square-error noise reduction algorithm on Mel-frequency cepstra for robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2008

Adaptation of compressed HMM parameters for resource-constrained speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2008

HMM adaptation using a phase-sensitive acoustic distortion model for environment-robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
A Study of Variable-Parameter Gaussian Mixture Hidden Markov Modeling for Noisy Speech Recognition.
IEEE Trans. Audio, Speech & Language Processing, 2007

High-performance hmm adaptation with joint compensation of additive and convolutive distortions via Vector Taylor Series.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006
Modeling Variance Variation in a Variable Parameter HMM Framework for Noise Robust Speech Recognition.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2004
Can back-ends be more robust than front-ends? Investigation over the Aurora-2 database.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
Model-space compensation of microphone and noise for speaker-independent speech recognition.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Variable parameter Gaussian mixture hidden Markov modeling for speech recognition.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
Noise-dependent Gaussian mixture classifiers for robust rejection decision.
IEEE Trans. Speech and Audio Processing, 2002

The effects of speech compression on speech recognition and text-to-speech synthesis.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Experiments on speaker-independent voice command recognition using in-vehicle hands free speech.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

A comparative study of approximations for parallel model combination of static and dynamic parameters.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Noise-robust open-set speaker recognition using noise-dependent Gaussian mixture classifier.
Proceedings of the IEEE International Conference on Acoustics, 2002

2000
HMM adaptation and microphone array processing for distant speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2000

Implementing a high accuracy speaker-independent continuous speech recognizer on a fixed-point DSP.
Proceedings of the IEEE International Conference on Acoustics, 2000

1999
A minimum cross-entropy approach to hidden Markov model adaptation.
IEEE Signal Process. Lett., 1999

Speaker-dependent name dialing in a car environment with out-of-vocabulary rejection.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

Speech-enabled information retrieval in the automobile environment.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1998
A general joint additive and convolutive bias compensation approach applied to noisy Lombard speech recognition.
IEEE Trans. Speech and Audio Processing, 1998

Assessing the importance of the segmentation probability in segment-based speech recognition.
Speech Communication, 1998

Environment normalization training and environment adaptation using mixture stochastic trajectory model.
Speech Communication, 1998

1997
Stochastic trajectory modeling and sentence searching for continuous speech recognition.
IEEE Trans. Speech and Audio Processing, 1997

Speaker normalization training for mixture stochastic trajectory model.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Source normalization training for HMM applied to noisy telephone speech recognition.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

An acoustic subword unit approach to non-linguistic speech feature identification.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Correlation based predictive adaptation of hidden Markov models.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

The importance of segmentation probability in segment based speech recognizers.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Elimination of trajectory folding phenomenon: HMM, trajectory mixture HMM and mixture stochastic trajectory model.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

A unified maximum likelihood approach to acoustic mismatch compensation: application to noisy Lombard speech recognition.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

1996
Comparative experiments of several adaptation approaches to noisy speech recognition using stochastic trajectory models.
Speech Communication, 1996

Estimation of mixtures of stochastic dynamic trajectories: application to continuous speech recognition.
Computer Speech & Language, 1996

A study on continuous Chinese speech recognition based on stochastic trajectory models.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Improvement in n-best search for continuous speech recognition.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Stochastic trajectory model with state-mixture for continuous speech recognition.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Modelling long term variability information in mixture stochastic trajectory framework.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

A semi-continuous stochastic trajectory model for phoneme-based continuous speech recognition.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

Probabilistic mapping networks for speaker recognition.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1995
Speech recognition in noisy environments: A survey.
Speech Communication, 1995

Noise adaptation using linear regression for continuous noisy speech recognition.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Speaker recognition with temporal transition models.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

On MMI learning of Gaussian mixture for speaker models.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Evaluation of Bayes decision approach to automatic determination of thresholds for speaker verification.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Stochastic trajectory models for speech recognition: an extension to modelling time correlation.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Stochastic trajectory modeling for recognition of unconstrained handwritten words.
Proceedings of the Third International Conference on Document Analysis and Recognition, 1995

1994
Off-line Handwriting Recognition by Statistical Correlation.
Proceedings of IAPR Workshop on Machine Vision Applications, 1994

A comparison of three noisy speech recognition approaches.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Nonlinear time alignment in stochastic trajectory models for speech recognition.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Noise independent speech recognition for a variety of noise types.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

Stochastic trajectory modeling for speech recognition.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

1993
Plausibility functions in continuous speech recognition: The VINICS system.
Speech Communication, 1993

A Bayesian approach to phone duration adaptation for lombard speech recognition.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Use of explicit context-dependent phonemic model in continuous speech recognition.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Duration of phones as function of utterance length and its use in automatic speech recognition.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Iterative transformation and alignment for speech labeling.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Base transformation for environment adaptation in continuous speech recognition.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

1992
Minimization of speech alignment error by iterative transformation for speaker adaptation.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

DTW-based phonetic labeling using explicit phoneme duration constraints.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

1991
Signal-to-String Conversion Based on High Likelihood Regions Using Embedded Dynamic Programming.
IEEE Trans. Pattern Anal. Mach. Intell., 1991

VINICS: a continuous speech recognizer based on a new robust formulation.
Proceedings of the Second European Conference on Speech Communication and Technology, 1991

Comparing two phoneme identification methods using a continuous speech recognizer.
Proceedings of the Second European Conference on Speech Communication and Technology, 1991

1989
Parallel construction of syntactic structure for continuous speech recognition.
Proceedings of the First European Conference on Speech Communication and Technology, 1989

1987
Time domain harmonic matching pitch estimation using time-dependent speech modeling.
IEEE Trans. Acoustics, Speech, and Signal Processing, 1987

Phoneme-based continuous speech recognition without pre-segmentation.
Proceedings of the European Conference on Speech Technology, 1987


  Loading...