Xunying Liu

According to our database1, Xunying Liu authored at least 111 papers between 2003 and 2020.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

On csauthors.net:

Bibliography

2020
Cross-Domain Deep Visual Feature Generation for Mandarin Audio-Visual Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Replay and Synthetic Speech Detection with Res2net Architecture.
CoRR, 2020

Any-to-Many Voice Conversion with Location-Relative Sequence-to-Sequence Modeling.
CoRR, 2020

Understanding the wiring evolution in differentiable neural architecture search.
CoRR, 2020

Neural Architecture Search for Speech Recognition.
CoRR, 2020

Investigating Robustness of Adversarial Samples Detection for Automatic Speaker Verification.
CoRR, 2020

Audio-visual Multi-channel Recognition of Overlapped Speech.
CoRR, 2020

Bayesian x-vector: Bayesian Neural Network based x-vector System for Speaker Verification.
CoRR, 2020

Deep segmental phonetic posterior-grams based discovery of non-categories in L2 English speech.
CoRR, 2020

Speaker-Aware Linear Discriminant Analysis in Speaker Verification.
Proceedings of the Interspeech 2020, 2020

Audio-Visual Multi-Channel Recognition of Overlapped Speech.
Proceedings of the Interspeech 2020, 2020

Exploiting Cross-Domain Visual Feature Generation for Disordered Speech Recognition.
Proceedings of the Interspeech 2020, 2020

Transferring Source Style in Non-Parallel Voice Conversion.
Proceedings of the Interspeech 2020, 2020

Investigating Robustness of Adversarial Samples Detection for Automatic Speaker Verification.
Proceedings of the Interspeech 2020, 2020

Investigation of Data Augmentation Techniques for Disordered Speech Recognition.
Proceedings of the Interspeech 2020, 2020

Audio-Visual Recognition of Overlapped Speech for the LRS2 Dataset.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Low-bit Quantization of Recurrent Neural Network Language Models Using Alternating Direction Methods of Multipliers.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

End-To-End Voice Conversion Via Cross-Modal Knowledge Distillation for Dysarthric Speech Reconstruction.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

End-To-End Accent Conversion Without Using Native Utterances.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Adversarial Attacks on GMM I-Vector Based Speaker Verification Systems.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Code-Switched Speech Synthesis Using Bilingual Phonetic Posteriorgram with Only Monolingual Corpora.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

DSNAS: Direct Neural Architecture Search Without Parameter Retraining.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Exploiting Future Word Contexts in Neural Network Language Models for Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Comparative Study of Parametric and Representation Uncertainty Modeling for Recurrent Neural Network Language Models.
Proceedings of the Interspeech 2019, 2019

Fast DNN Acoustic Model Speaker Adaptation by Learning Hidden Unit Contribution Features.
Proceedings of the Interspeech 2019, 2019

Unsupervised Methods for Audio Classification from Lecture Discussion Recordings.
Proceedings of the Interspeech 2019, 2019

Exploiting Visual Features Using Bayesian Gated Neural Networks for Disordered Speech Recognition.
Proceedings of the Interspeech 2019, 2019

On the Use of Pitch Features for Disordered Speech Recognition.
Proceedings of the Interspeech 2019, 2019

Jointly Trained Conversion Model and WaveNet Vocoder for Non-Parallel Voice Conversion Using Mel-Spectrograms and Phonetic Posteriorgrams.
Proceedings of the Interspeech 2019, 2019

Extract, Adapt and Recognize: An End-to-End Neural Network for Corrupted Monaural Speech Recognition.
Proceedings of the Interspeech 2019, 2019

LF-MMI Training of Bayesian and Gaussian Process Time Delay Neural Networks for Speech Recognition.
Proceedings of the Interspeech 2019, 2019

The CUHK Dysarthric Speech Recognition Systems for English and Cantonese.
Proceedings of the Interspeech 2019, 2019

Recurrent Neural Network Language Model Training Using Natural Gradient.
Proceedings of the IEEE International Conference on Acoustics, 2019

BLHUC: Bayesian Learning of Hidden Unit Contributions for Deep Neural Network Speaker Adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2019

Speech Emotion Recognition Using Capsule Networks.
Proceedings of the IEEE International Conference on Acoustics, 2019

CNN-RNN-CTC Based End-to-end Mispronunciation Detection and Diagnosis.
Proceedings of the IEEE International Conference on Acoustics, 2019

Gaussian Process Lstm Recurrent Neural Network Language Models for Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

Bayesian and Gaussian Process Neural Networks for Large Vocabulary Continuous Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

End-to-end Code-switched TTS with Mix of Monolingual Recordings.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
The HCCL-CUHK System for the Voice Conversion Challenge 2018.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Investigation of Stacked Deep Neural Networks and Mixture Density Networks for Acoustic-to-Articulatory Inversion.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Development of the CUHK Dysarthric Speech Recognition System for the UA Speech Corpus.
Proceedings of the Interspeech 2018, 2018

Rapid Style Adaptation Using Residual Error Embedding for Expressive Speech Synthesis.
Proceedings of the Interspeech 2018, 2018

Semi-supervised Cross-domain Visual Feature Learning for Audio-Visual Broadcast Speech Transcription.
Proceedings of the Interspeech 2018, 2018

Voice Conversion Across Arbitrary Speakers Based on a Single Target-Speaker Utterance.
Proceedings of the Interspeech 2018, 2018

Unsupervised Discovery of Non-native Phonetic Patterns in L2 English Speech for Mispronunciation Detection and Diagnosis.
Proceedings of the Interspeech 2018, 2018

Gaussian Process Neural Networks for Speech Recognition.
Proceedings of the Interspeech 2018, 2018

Feature Based Adaptation for Speaking Style Synthesis.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Unsupervised Discovery of an Extended Phoneme Set in L2 English Speech for Mispronunciation Detection and Diagnosis.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Limited-Memory BFGS Optimization of Recurrent Neural Network Language Models for Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Drawing-Based Automatic Dementia Screening Using Gaussian Process Markov Chains.
Proceedings of the 51st Hawaii International Conference on System Sciences, 2018

2017
Relating dynamic brain states to dynamic machine states: Human and machine solutions to the speech recognition problem.
PLoS Comput. Biol., 2017

Future Word Contexts in Neural Network Language Models.
CoRR, 2017

RNN-LDA Clustering for Feature Based DNN Adaptation.
Proceedings of the Interspeech 2017, 2017

Multi-task learning of structured output layer bidirectional LSTMS for speech synthesis.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Multimodal learning using 3D audio-visual data for audio-visual speech recognition.
Proceedings of the 2017 International Conference on Asian Language Processing, 2017

2016
Two Efficient Lattice Rescoring Methods Using Recurrent Neural Network Language Models.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Efficient Training and Evaluation of Recurrent Neural Network Language Models for Automatic Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Deep Neural Network Based Acoustic-to-Articulatory Inversion Using Phone Sequence Information.
Proceedings of the Interspeech 2016, 2016

Convolutional neural network bottleneck features for bi-directional generalized variable parameter HMMs.
Proceedings of the IEEE International Conference on Information and Automation, 2016

Improved DNN-based segmentation for multi-genre broadcast audio.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Automatic Complexity Control of Generalized Variable Parameter HMMs for Noise Robust Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Generalized variable parameter HMMs based acoustic-to-articulatory inversion.
Proceedings of the INTERSPEECH 2015, 2015

Efficient use of DNN bottleneck features in generalized variable parameter HMMs for noise robust speech recognition.
Proceedings of the INTERSPEECH 2015, 2015

The Cambridge University 2014 BOLT conversational telephone Mandarin Chinese LVCSR system for speech translation.
Proceedings of the INTERSPEECH 2015, 2015

Recurrent neural network language model adaptation for multi-genre broadcast speech recognition.
Proceedings of the INTERSPEECH 2015, 2015

Investigations of low resource multi-accent mandarin speech recognition.
Proceedings of the IEEE International Conference on Information and Automation, 2015

Paraphrastic recurrent neural network language models.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Recurrent neural network language model training with noise contrastive estimation for speech recognition.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Improving the training and evaluation efficiency of recurrent neural network language models.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Cambridge university transcription systems for the multi-genre broadcast challenge.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

The development of the cambridge university alignment systems for the multi-genre broadcast challenge.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Speaker diarisation and longitudinal linking in multi-genre broadcast data.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Investigation of back-off based interpolation between recurrent neural network and n-gram language models.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

The MGB challenge: Evaluating multi-genre broadcast media recognition.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
Paraphrastic language models.
Comput. Speech Lang., 2014

Deep neural network bottleneck features for generalized variable parameter HMMs.
Proceedings of the INTERSPEECH 2014, 2014

Efficient GPU-based training of recurrent neural network language models using spliced sentence bunch.
Proceedings of the INTERSPEECH 2014, 2014

Efficient lattice rescoring using recurrent neural network language models.
Proceedings of the IEEE International Conference on Acoustics, 2014

Paraphrastic neural network language models.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Language model cross adaptation for LVCSR system combination.
Comput. Speech Lang., 2013

Use of contexts in language model interpolation and adaptation.
Comput. Speech Lang., 2013

Improving lightly supervised training for broadcast transcription.
Proceedings of the INTERSPEECH 2013, 2013

Cross-domain paraphrasing for improving language modelling using out-of-domain data.
Proceedings of the INTERSPEECH 2013, 2013

Feature space generalized variable parameter HMMs for noise robust recognition.
Proceedings of the INTERSPEECH 2013, 2013

Automatic Transcription of Multi-genre Media Archives.
Proceedings of the First Workshop on Speech, 2013

Paraphrastic language models and combination with neural network language models.
Proceedings of the IEEE International Conference on Acoustics, 2013

Automatic model complexity control for generalized variable parameter HMMs.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012
Transcription of multi-genre media archives using out-of-domain data.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Structured modeling based on generalized variable parameter HMMs and speaker adaptation.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

2011
A flexible framework for HMM based noise robust speech recognition using generalized parametric space polynomial regression.
Sci. China Inf. Sci., 2011

Improving LVCSR System Combination Using Neural Network Language Model Cross Adaptation.
Proceedings of the INTERSPEECH 2011, 2011

Word Boundary Modelling and Full Covariance Gaussians for Arabic Speech-to-Text Systems.
Proceedings of the INTERSPEECH 2011, 2011

Generalized Variable Parameter HMMs for Noise Robust Speech Recognition.
Proceedings of the INTERSPEECH 2011, 2011

Investigation of acoustic units for LVCSR systems.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Improved neural network based language modelling and adaptation.
Proceedings of the INTERSPEECH 2010, 2010

Language model combination and adaptation usingweighted finite state transducers.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Exploiting Chinese character models to improve speech recognition performance.
Proceedings of the INTERSPEECH 2009, 2009

2008
Context dependent language model adaptation.
Proceedings of the INTERSPEECH 2008, 2008

2007
Automatic Model Complexity Control Using Marginalized Discriminative Growth Functions.
IEEE Trans. Speech Audio Process., 2007

Improving Speech Transcription for Mandarin-English Translation.
Proceedings of the IEEE International Conference on Acoustics, 2007

Speech Recognition System Combination for Machine Translation.
Proceedings of the IEEE International Conference on Acoustics, 2007

Discriminative language model adaptation for Mandarin broadcast speech transcription and translation.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006
Corrections to "Automatic Transcription of Conversational Telephone Speech".
IEEE Trans. Speech Audio Process., 2006

The Cu-Htk Mandarin Broadcast News Transcription System.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Automatic transcription of conversational telephone speech.
IEEE Trans. Speech Audio Process., 2005

Investigation of Acoustic Modeling Techniques for LVCSR Systems.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Development of the CUHTK 2004 Mandarin Conversational Telephone Speech Transcription System.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
Model complexity control and compression using discriminative growth functions.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Development of the 2003 CU-HTK conversational telephone speech transcription system.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
Automatic complexity control for HLDA systems.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003


  Loading...