Kai Yu

According to our database1, Kai Yu authored at least 103 papers between 2005 and 2019.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Homepages:

On csauthors.net:

Bibliography

2019
Discriminative Neural Embedding Learning for Short-Duration Text-Independent Speaker Verification.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2019

AgentGraph: Toward Universal Dialogue Management With Structured Deep Reinforcement Learning.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2019

Data Augmentation with Atomic Templates for Spoken Language Understanding.
CoRR, 2019

Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition.
CoRR, 2019

What does a Car-ssette tape tell?
CoRR, 2019

AgentGraph: Towards Universal Dialogue Management with Structured Deep Reinforcement Learning.
CoRR, 2019

Text-based Depression Detection: What Triggers An Alert.
CoRR, 2019

A Hierarchical Decoding Model For Spoken Language Understanding From Unaligned Data.
CoRR, 2019

Duration robust sound event detection.
CoRR, 2019

Audio Caption: Listen and Tell.
CoRR, 2019

A Hierarchical Decoding Model for Spoken Language Understanding from Unaligned Data.
Proceedings of the IEEE International Conference on Acoustics, 2019

Audio Caption: Listen and Tell.
Proceedings of the IEEE International Conference on Acoustics, 2019

Knowledge Distillation for Small Foot-print Deep Speaker Embedding.
Proceedings of the IEEE International Conference on Acoustics, 2019

End-to-end Monaural Multi-speaker ASR System without Pretraining.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Rich Short Text Conversation Using Semantic-Key-Controlled Sequence Generation.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2018

Adaptive Very Deep Convolutional Residual Network for Noise Robust Speech Recognition.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2018

Investigating Raw Wave Deep Neural Networks for End-to-End Speaker Spoofing Detection.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2018

Sequence discriminative training for deep learning based acoustic keyword spotting.
Speech Communication, 2018

End-to-End Monaural Multi-speaker ASR System without Pretraining.
CoRR, 2018

Sequence Discriminative Training for Deep Learning based Acoustic Keyword Spotting.
CoRR, 2018

Deep Discriminant Analysis for i-vector Based Robust Speaker Recognition.
CoRR, 2018

On Modular Training of Neural Acoustics-to-Word Model for LVCSR.
CoRR, 2018

Concept Transfer Learning for Adaptive Language Understanding.
Proceedings of the 19th Annual SIGdial Meeting on Discourse and Dialogue, 2018

Generative Adversarial Networks based X-vector Augmentation for Robust Probabilistic Linear Discriminant Analysis in Speaker Verification.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Deep Discriminant Analysis for i-vector Based Robust Speaker Recognition.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Joint Spoken Language Understanding and Domain Adaptive Language Modeling.
Proceedings of the Intelligence Science and Big Data Engineering, 2018

Covariance Based Deep Feature for Text-Dependent Speaker Verification.
Proceedings of the Intelligence Science and Big Data Engineering, 2018

Knowledge Distillation for Sequence Model.
Proceedings of the Interspeech 2018, 2018

Angular Softmax for Short-Duration Text-independent Speaker Verification.
Proceedings of the Interspeech 2018, 2018

MLN: Moment localization Network and Samples Selection for Moment Retrieval.
Proceedings of the 2nd International Conference on Video and Image Processing, 2018

Focal Kl-Divergence Based Dilated Convolutional Neural Networks for Co-Channel Speaker Identification.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

On Modular Training of Neural Acoustics-to-Word Model for LVCSR.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Policy Adaptation for Deep Reinforcement Learning-Based Dialogue Management.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Phone Synchronous Speech Recognition With CTC Lattices.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2017

Concept Transfer Learning for Adaptive Language Understanding.
CoRR, 2017

A Unified Confidence Measure Framework Using Auxiliary Normalization Graph.
Proceedings of the Intelligence Science and Big Data Engineering, 2017

Binary Deep Neural Networks for Speech Recognition.
Proceedings of the Interspeech 2017, 2017

What Does the Speaker Embedding Encode?
Proceedings of the Interspeech 2017, 2017

Discrete Duration Model for Speech Synthesis.
Proceedings of the Interspeech 2017, 2017

Small-footprint convolutional neural network for spoofing detection.
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017

End-to-end spoofing detection with raw waveform CLDNNS.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Confidence measures for CTC-based phone synchronous decoding.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Multi-view LSTM Language Model with Word-Synchronized Auxiliary Feature for LVCSR.
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2017

Future vector enhanced LSTM language model for LVCSR.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016
Cluster Adaptive Training for Deep Neural Network Based Acoustic Model.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2016

Very Deep Convolutional Neural Networks for Noise Robust Speech Recognition.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2016

Deep features for automatic spoofing detection.
Speech Communication, 2016

On Training Bi-directional Neural Network Language Model with Noise Contrastive Estimation.
CoRR, 2016

Multi-task joint-learning for robust voice activity detection.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Directed automatic speech transcription error correction using bidirectional LSTM.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Rich punctuations prediction using large-scale deep learning.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

On training bi-directional neural network language model with noise contrastive estimation.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Unrestricted Vocabulary Keyword Spotting Using LSTM-CTC.
Proceedings of the Interspeech 2016, 2016

Phone Synchronous Decoding with CTC Lattice.
Proceedings of the Interspeech 2016, 2016

Discriminatively trained joint speaker and environment representations for adaptation of deep neural network acoustic models.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

A comparative study of robustness of deep learning approaches for VAD.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Deep feature for text-dependent speaker verification.
Speech Communication, 2015

Paragraph vector based topic model for language model adaptation.
Proceedings of the INTERSPEECH 2015, 2015

Multi-task learning for text-dependent speaker verification.
Proceedings of the INTERSPEECH 2015, 2015

Robust deep feature for spoofing detection - the SJTU system for ASVspoof 2015 challenge.
Proceedings of the INTERSPEECH 2015, 2015

An investigation of context clustering for statistical speech synthesis with deep neural network.
Proceedings of the INTERSPEECH 2015, 2015

Very deep convolutional neural networks for LVCSR.
Proceedings of the INTERSPEECH 2015, 2015

Automatic model redundancy reduction for fast back-propagation for deep neural networks in speech recognition.
Proceedings of the 2015 International Joint Conference on Neural Networks, 2015

Cluster adaptive training for deep neural network.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Recurrent neural network language model with structured word embeddings for speech recognition.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

A novel static parameter calculation method for model compensation.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Local trajectory based speech enhancement for robust speech recognition with deep neural network.
Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015

An investigation on DNN-derived bottleneck features for GMM-HMM based robust speech recognition.
Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015

Multi-task joint-learning of deep neural networks for robust speech recognition.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
Acoustic emotion recognition using deep neural network.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Tandem deep features for text-dependent speaker verification.
Proceedings of the INTERSPEECH 2014, 2014

A novel dynamic parameters calculation approach for model compensation.
Proceedings of the INTERSPEECH 2014, 2014

Speaker verification with deep features.
Proceedings of the 2014 International Joint Conference on Neural Networks, 2014

Reshaping deep neural network for fast decoding by node-pruning.
Proceedings of the IEEE International Conference on Acoustics, 2014

Stochastic data sweeping for fast DNN training.
Proceedings of the IEEE International Conference on Acoustics, 2014

Second order vector taylor series based robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Combination of data borrowing strategies for low-resource LVCSR.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012
Discriminative spoken language understanding using word confusion networks.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

The Effect of Cognitive Load on a Statistical Dialogue System.
Proceedings of the SIGDIAL 2012 Conference, 2012

Development of the 2012 SJTU HVR system.
Proceedings of the International Conference on Multimodal Interaction, 2012

ICMI'12 grand challenge: haptic voice recognition.
Proceedings of the International Conference on Multimodal Interaction, 2012

2011
Real User Evaluation of Spoken Dialogue Systems Using Amazon Mechanical Turk.
Proceedings of the INTERSPEECH 2011, 2011

On-line policy optimisation of spoken dialogue systems via live interaction with human subjects.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010
The Hidden Information State model: A practical framework for POMDP-based spoken dialogue management.
Computer Speech & Language, 2010

Bayesian dialogue system for the Let's Go Spoken Dialogue Challenge.
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

Parameter learning for POMDP spoken dialogue models.
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

Parameter estimation for agenda-based user simulation.
Proceedings of the SIGDIAL 2010 Conference, 2010

Gaussian Processes for Fast Policy Optimisation of POMDP-based Dialogue Managers.
Proceedings of the SIGDIAL 2010 Conference, 2010

Natural belief-critic: a reinforcement algorithm for parameter estimation in statistical spoken dialogue systems.
Proceedings of the INTERSPEECH 2010, 2010

Phrase-Based Statistical Language Generation Using Graphical Models and Active Learning.
Proceedings of the ACL 2010, 2010

2009
k-Nearest Neighbor Monte-Carlo Control Algorithm for POMDP-Based Dialogue Systems.
Proceedings of the SIGDIAL 2009 Conference, 2009

Transformation-based learning for semantic parsing.
Proceedings of the INTERSPEECH 2009, 2009

Probablistic modelling of F0 in unvoiced regions in HMM based speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2009

Spoken language understanding from unaligned data using discriminative classification models.
Proceedings of the IEEE International Conference on Acoustics, 2009

Back-off action selection in summary space-based POMDP dialogue systems.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2008
Modelling user behaviour in the HIS-POMDP dialogue manager.
Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008

Training and Evaluation of the HIS POMDP Dialogue System in Noise.
Proceedings of the SIGDIAL 2008 Workshop, 2008

Evaluating semantic-level confidence scores with multiple hypotheses.
Proceedings of the INTERSPEECH 2008, 2008

User study of the Bayesian update of dialogue state approach to dialogue management.
Proceedings of the INTERSPEECH 2008, 2008

2007
Improving Speech Transcription for Mandarin-English Translation.
Proceedings of the IEEE International Conference on Acoustics, 2007

Speech Recognition System Combination for Machine Translation.
Proceedings of the IEEE International Conference on Acoustics, 2007

2005
Investigation of Acoustic Modeling Techniques for LVCSR Systems.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Development of the CUHTK 2004 Mandarin Conversational Telephone Speech Transcription System.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005


  Loading...