Yu Zhang

PhD thesis, 2017

Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions.

[BibT_eX]

[DOI]

Yannis Agiomyrgiannakis

Yonghui Wu

CoRR, 2017

Training RNNs as Fast as CNNs.

[BibT_eX]

[DOI]

Tao Lei

Yoav Artzi

CoRR, 2017

Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Learning Latent Representations for Speech Generation and Transformation.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2017, 2017

Advances in Joint CTC-Attention Based End-to-End Speech Recognition with a Deep CNN Encoder and RNN-LM.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2017, 2017

Latent Sequence Decompositions.

[BibT_eX]

[DOI]

Proceedings of the 5th International Conference on Learning Representations, 2017

Unsupervised domain adaptation for robust speech recognition via variational autoencoder-based data augmentation.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Advanced Recurrent Neural Networks for Automatic Speech Recognition.

[BibT_eX]

[DOI]

Guoguo Chen

Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

Discriminative Beamforming with Phase-Aware Neural Networks for Speech Enhancement and Recognition.

[BibT_eX]

[DOI]

Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

Sequence-Discriminative Training of Neural Networks.

[BibT_eX]

[DOI]

Guoguo Chen

Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

2016

Recurrent Neural Network Encoder with Attention for Community Question Answering.

[BibT_eX]

[DOI]

CoRR, 2016

A prioritized grid long short-term memory RNN for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

SLS at SemEval-2016 Task 3: Neural-based Approaches for Ranking in Community Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

On training bi-directional neural network language model with noise contrastive estimation.

[BibT_eX]

[DOI]

Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Exploiting Depth and Highway Connections in Convolutional Recurrent Deep Neural Networks for Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2016, 2016

Highway long short-term memory RNNS for distant speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Prediction-adaptation-correction recurrent neural networks for low-resource language speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Deep beamforming networks for multi-channel speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Speaker-aware training of LSTM-RNNS for acoustic modelling.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Integrated adaptation with multi-factor joint-learning for far-field speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Multilingual data selection for training stacked bottleneck features.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Neural Attention for Learning to Rank Questions in Community Question Answering.

[BibT_eX]

[DOI]

Salvatore Romeo

Giovanni Da San Martino

Alberto Barrón-Cedeño

Proceedings of the COLING 2016, 2016

2015

The Computational Network Toolkit [Best of the Web].

[BibT_eX]

[DOI]

Kaisheng Yao

IEEE Signal Process. Mag., 2015

Speaker adaptation using the i-vector technique for bottleneck features.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2015, 2015

Speech recognition with prediction-adaptation-correction recurrent neural networks.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014

Spoken language understanding using long short-term memory neural networks.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Graph-based re-ranking using acoustic feature similarity between search results for spoken term detection on low-resource languages.

[BibT_eX]

[DOI]

Hung-yi Lee

Proceedings of the INTERSPEECH 2014, 2014

Language ID-based training of multilingual stacked bottleneck features.

[BibT_eX]

[DOI]

Anne Cutler

Proceedings of the INTERSPEECH 2014, 2014

Recent advances in ASR applied to an Arabic transcription system for Al-Jazeera.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2014, 2014

Extracting deep neural network bottleneck features using low-rank matrix factorization.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

2013

Joint Learning of Phonetic Units and Word Pronunciations for ASR.

[BibT_eX]

[DOI]

Chia-ying Lee