Zhijie Yan

CoRR, 2019

Investigation of Transformer Based Spelling Correction Model for CTC-Based End-to-End Mandarin Speech Recognition.

[BibT_eX]

[DOI]

Shiliang Zhang

Ming Lei

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

2018

Uncertainty analysis of dynamic thermal rating based on environmental parameter estimation.

[BibT_eX]

[DOI]

EURASIP J. Wirel. Commun. Netw., 2018

A Study on Improving Acoustic Model for Robust and Far-Field Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 23rd IEEE International Conference on Digital Signal Processing, 2018

Deep-FSMN for Large Vocabulary Continuous Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Linear Networks Based Speaker Adaptation for Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Deep Feed-Forward Sequential Memory Networks for Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

Analysis on Ampacity of Overhead Transmission Lines Being Operated.

[BibT_eX]

[DOI]

Yanling Wang

Likai Liang

J. Inf. Process. Syst., 2017

Improving latency-controlled BLSTM acoustic models for online speech recognition.

[BibT_eX]

[DOI]

Shaofei Xue

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016

Rapid speaker adaptation based on D-code extracted from BLSTM-RNN in LVCSR.

[BibT_eX]

[DOI]

Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Unsupervised speaker adaptation of BLSTM-RNN for LVCSR based on speaker code.

[BibT_eX]

[DOI]

Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

2015

Training deep bidirectional LSTM acoustic model for LVCSR by a context-sensitive-chunk BPTT approach.

[BibT_eX]

[DOI]

Kai Chen

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

A context-sensitive-chunk BPTT approach to training deep LSTM/BLSTM recurrent neural networks for offline handwriting recognition.

[BibT_eX]

[DOI]

Kai Chen

Proceedings of the 13th International Conference on Document Analysis and Recognition, 2015

2014

An Unsupervised Adaptation Approach to Leveraging Feedback Loop Data by Using i-Vector for Data Clustering and Selection.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2014

2013

A Unified Trajectory Tiling Approach to High Quality Speech Rendering.

[BibT_eX]

[DOI]

Yao Qian

IEEE Trans. Speech Audio Process., 2013

A scalable approach to using DNN-derived features in GMM-HMM based acoustic modeling for LVCSR.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Tied-state based discriminative training of context-expanded region-dependent feature transforms for LVCSR.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

2012

Tip tap tones: mobile microtraining of mandarin sounds.

[BibT_eX]

[DOI]

Proceedings of the Mobile HCI '12, 2012

A feature-transform based approach to unsupervised task adaptation and personalization.

[BibT_eX]

[DOI]

Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

A comparative study of fMPE and RDLT approaches to LVCSR.

[BibT_eX]

[DOI]

Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

A study of discriminative feature extraction for i-vector based acoustic sniffing in IVN acoustic model training.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011

A new i-vector approach and its application to irrelevant variability normalization based acoustic model training.

[BibT_eX]

[DOI]

Yu Zhang

Proceedings of the 2011 IEEE International Workshop on Machine Learning for Signal Processing, 2011

An i-vector Based Approach to Training Data Clustering for Improved Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

An i-vector Based Approach to Acoustic Sniffing for Irrelevant Variability Normalization Based Acoustic Model Training and Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Improvements in Speaker Characterization Using Spectral Subband Energy Based on Harmonic plus Noise Model.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

A study of an irrelevant variability normalization based discriminative training approach for LVCSR.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Speaker characterization using spectral subband energy ratio based on Harmonic plus Noise Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

2010

An HMM trajectory tiling (HTT) approach to high quality TTS.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

A perceptual study of acceleration parameters in HMM-based TTS.

[BibT_eX]

[DOI]

Yining Chen

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Cross-validation based decision tree clustering for HMM-based TTS.

[BibT_eX]

[DOI]

Yu Zhang

Proceedings of the IEEE International Conference on Acoustics, 2010

Improved modeling for F0 generation and V/U decision in HMM-based TTS.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

RIch-context Unit Selection (RUS) approach to high quality TTS.

[BibT_eX]

[DOI]

Yao Qian

Proceedings of the IEEE International Conference on Acoustics, 2010

2009

Rich context modeling for high quality HMM-based TTS.

[BibT_eX]

[DOI]

Yao Qian

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

A trust region based optimization for maximum mutual information estimation of HMMS in speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

2008

Investigation on Adaptation Using Different Discriminative Training Criteria Based Linear Regression and Map.

[BibT_eX]

[DOI]

Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Soft margin estimation with various separation levels for LVCSR.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Minimum word classification error training of HMMS for automatic speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

2007

Word Graph Based Feature Enhancement for Noisy Speech Recognition.

[BibT_eX]

[DOI]