Zhen-Hua Ling

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Pre-training Language Model as a Multi-perspective Course Learner.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022

Denoising-and-Dereverberation Hierarchical Neural Vocoder for Statistical Parametric Speech Synthesis.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2022

Cognitive Diagnosis with Explicit Student Vector Estimation and Unsupervised Question Matrix Learning.

[BibT_eX]

[DOI]

CoRR, 2022

USTC-NELSLIP at SemEval-2022 Task 11: Gazetteer-Adapted Integration Network for Multilingual Complex Named Entity Recognition.

[BibT_eX]

[DOI]

Proceedings of the 16th International Workshop on Semantic Evaluation, SemEval@NAACL 2022, 2022

Decoupled Pronunciation and Prosody Modeling in Meta-Learning-based Multilingual Speech Synthesis.

[BibT_eX]

[DOI]

Yukun Peng

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Integrating Discrete Word-Level Style Variations into Non-Autoregressive Acoustic Models for Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Pronunciation Dictionary-Free Multilingual Speech Synthesis by Combining Unsupervised and Supervised Phonetic Representations.

[BibT_eX]

[DOI]

Chang Liu

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Who Says What to Whom: A Survey of Multi-Party Conversations.

[BibT_eX]

[DOI]

Chongyang Tao

Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

PoNet: Pooling Network for Efficient Token Mixing in Long Sequences.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Speaker Adaption with Intuitive Prosodic Features for Statistical Parametric Speech Synthesis.

[BibT_eX]

[DOI]

Pengyu Cheng

Proceedings of the ICDSP 2022: 6th International Conference on Digital Signal Processing, Chengdu, China, February 25, 2022

Discourse-Level Prosody Modeling with a Variational Autoencoder for Non-Autoregressive Expressive Speech Synthesis.

[BibT_eX]

[DOI]

Ning-Qian Wu

Zhaoci Liu

Proceedings of the IEEE International Conference on Acoustics, 2022

Dementia Detection by Fusing Speech and Eye-Tracking Representation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Using Multiple Reference Audios and Style Embedding Constraints for Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Neural Grapheme-To-Phoneme Conversion with Pre-Trained Grapheme Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Improving Recognition-Synthesis Based any-to-one Voice Conversion with Cyclic Training.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Wider & Closer: Mixture of Short-channel Distillers for Zero-shot Cross-lingual Named Entity Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Detecting Alzheimer's Disease Based on Acoustic Features Extracted from Pre-trained Models.

[BibT_eX]

[DOI]

Proceedings of the Artificial Intelligence - Second CAAI International Conference, 2022

TegTok: Augmenting Text Generation via Task-specific and Open-world Knowledge.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

HeterMPC: A Heterogeneous Graph Neural Network for Response Generation in Multi-Party Conversations.

[BibT_eX]

[DOI]

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Conversation- and Tree-Structure Losses for Dialogue Disentanglement.

[BibT_eX]

[DOI]

Proceedings of the Second DialDoc Workshop on Document-grounded Dialogue and Conversational Question Answering, 2022

2021

UnitNet: A Sequence-to-Sequence Acoustic Model for Concatenative Speech Synthesis.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2021

Extracting and Predicting Word-Level Style Variations for Speech Synthesis.

[BibT_eX]

[DOI]

Yajie Zhang

IEEE ACM Trans. Audio Speech Lang. Process., 2021

A Multiple-Integration Encoder for Multi-Turn Text-to-SQL Semantic Parsing.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2021

Deep Contextualized Utterance Representations for Response Selection and Dialogue Analysis.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2021

Robustness of Speech Spoofing Detectors Against Adversarial Post-Processing of Voice Conversion.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2021

Compressed Network in Network Models for Traffic Classification.

[BibT_eX]

[DOI]

Proceedings of the IEEE Wireless Communications and Networking Conference, 2021

Denoising-and-Dereverberation Hierarchical Neural Vocoder for Robust Waveform Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Partner Matters! An Empirical Study on Fusing Personas for Personalized Response Selection in Retrieval-Based Chatbots.

[BibT_eX]

[DOI]

Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

SemEval-2021 Task 4: Reading Comprehension of Abstract Meaning.

[BibT_eX]

[DOI]

Proceedings of the 15th International Workshop on Semantic Evaluation, 2021

Phase Spectrum Recovery for Enhancing Low-Quality Speech Captured by Laser Microphones.

[BibT_eX]

[DOI]

Chang Liu

Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021

UnitNet-Based Hybrid Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Adversarial Voice Conversion Against Neural Spoofing Detectors.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

A Neural-Network-Based Approach to Identifying Speakers in Novels.

[BibT_eX]

[DOI]

Yue Chen

Qing-Feng Liu

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Learning Deep and Wide Contextual Representations Using BERT for Statistical Parametric Speech Synthesis.

[BibT_eX]

[DOI]

Yajie Zhang

Proceedings of the ICDSP 2021: 5th International Conference on Digital Signal Processing, 2021

Voice spoofing detection with raw waveform based on Dual Path Res2net.

[BibT_eX]

[DOI]

Proceedings of the ICCSE '21: 5th International Conference on Crowd Science and Engineering, Jinan, China, October 16, 2021

Graph Attention and Interaction Network With Multi-Task Learning for Fact Verification.

[BibT_eX]

[DOI]

Rui Yang

Runze Wang

Proceedings of the IEEE International Conference on Acoustics, 2021

Patnet : A Phoneme-Level Autoregressive Transformer Network for Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Detecting Alzheimer's Disease from Speech Using Neural Networks with Bottleneck Features and Data Augmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Have You Made a Decision? Where? A Pilot Study on Interpretability of Polarity Analysis Based on Advising Problem.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Improving Naturalness and Controllability of Sequence-to-Sequence Speech Synthesis by Learning Local Prosody Representations.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Detecting Speaker Personas from Conversational Texts.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Selecting and Analyzing Speech Features for the Screening of Mild Cognitive Impairment.

[BibT_eX]

[DOI]

Proceedings of the 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2021

The Blizzard Challenge 2021.

[BibT_eX]

[DOI]

Simon King

Proceedings of the Blizzard Challenge 2021, virtual, October 23, 2021, 2021

A Deep Analysis of Speech Separation Guided Diarization Under Realistic Conditions.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

MPC-BERT: A Pre-Trained Language Model for Multi-Party Conversation Understanding.

[BibT_eX]

[DOI]

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

TaLNet: Voice Reconstruction from Tongue and Lip Articulation with Transfer Learning from Text-to-Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Tracking Interaction States for Multi-Turn Text-to-SQL Semantic Parsing.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Non-Parallel Sequence-to-Sequence Voice Conversion With Disentangled Linguistic and Speaker Representations.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2020

Utterance-to-Utterance Interactive Matching Network for Multi-Turn Response Selection in Retrieval-Based Chatbots.

[BibT_eX]

[DOI]

Quan Liu

IEEE ACM Trans. Audio Speech Lang. Process., 2020

A Neural Vocoder With Hierarchical Generation of Amplitude and Phase Spectra for Statistical Parametric Speech Synthesis.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2020

Learning and Modeling Unit Embeddings Using Deep Neural Networks for Unit-Selection-Based Mandarin Speech Synthesis.

[BibT_eX]

[DOI]

ACM Trans. Asian Low Resour. Lang. Inf. Process., 2020

Condition-Transforming Variational Autoencoder for Generating Diverse Short Text Conversations.

[BibT_eX]

[DOI]

Yu-Ping Ruan

Xiaodan Zhu

ACM Trans. Asian Low Resour. Lang. Inf. Process., 2020

Bidirectional Attention for Text-Dependent Speaker Verification.

[BibT_eX]

[DOI]

Sensors, 2020

ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2020

Generating diverse conversation responses by creating and ranking multiple candidates.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2020

Learning to Retrieve Entity-Aware Knowledge and Generate Responses with Copy Mechanism for Task-Oriented Dialogue Systems.

[BibT_eX]

[DOI]

CoRR, 2020

Filtering before Iteratively Referring for Knowledge-Grounded Response Selection in Retrieval-Based Chatbots.

[BibT_eX]

[DOI]

CoRR, 2020

DialBERT: A Hierarchical Pre-Trained Model for Conversation Disentanglement.

[BibT_eX]

[DOI]

CoRR, 2020

Pre-Trained and Attention-Based Neural Networks for Building Noetic Task-Oriented Dialogue Systems.

[BibT_eX]

[DOI]

CoRR, 2020

Fine-Tuning BERT for Schema-Guided Zero-Shot Dialogue State Tracking.

[BibT_eX]

[DOI]

CoRR, 2020

Encrypted Network Traffic Classification Using Deep and Parallel Network-in-Network Models.

[BibT_eX]

[DOI]

IEEE Access, 2020

Recognition-Synthesis Based Non-Parallel Voice Conversion with Adversarial Learning.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

An Adaptive X-Vector Model for Text-Independent Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Adaptive Speaker Normalization for CTC-Based Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Unsupervised Regularization-Based Adaptive Training for Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Reverberation Modeling for Source-Filter-Based Neural Vocoder.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Knowledge-and-Data-Driven Amplitude Spectrum Prediction for Hierarchical Neural Vocoders.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

DCDT: A Digital Clock Drawing Test System for Cognitive Impairment Screening.

[BibT_eX]

[DOI]

Proceedings of the 36th IEEE International Conference on Data Engineering, 2020

Extracting Unit Embeddings Using Sequence-To-Sequence Acoustic Models for Unit Selection Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

WaveFFJORD: FFJORD-Based Vocoder for Statistical Parametric Speech Synthesis.

[BibT_eX]

[DOI]

Ning-Qian Wu

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Joint Intent Detection and Entity Linking on Spatial Domain Queries.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Filtering before Iteratively Referring for Knowledge-Grounded Response Selection in Retrieval-Based Chatbots.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Text Classification by Contrastive Learning and Cross-lingual Data Augmentation for Alzheimer's Disease Detection.

[BibT_eX]

[DOI]

Proceedings of the 28th International Conference on Computational Linguistics, 2020

Speaker-Aware BERT for Multi-Turn Response Selection in Retrieval-Based Chatbots.

[BibT_eX]

[DOI]

Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

The Blizzard Challenge 2020.

[BibT_eX]

[DOI]

Simon King

Proceedings of the Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, 2020

Voice Conversion by Cascading Automatic Speech Recognition and Text-to-Speech Synthesis with Prosody Transfer.

[BibT_eX]

[DOI]

Proceedings of the Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, 2020

Non-Parallel Voice Conversion with Autoregressive Conversion Model and Duration Adjustment.

[BibT_eX]

[DOI]

Proceedings of the Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, 2020

Predictions of Subjective Ratings and Spoofing Assessments of Voice Conversion Challenge 2020 Submissions.

[BibT_eX]

[DOI]

Proceedings of the Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, 2020

Voice Conversion Challenge 2020 -- Intra-lingual semi-parallel and cross-lingual voice conversion --.

[BibT_eX]

[DOI]

Proceedings of the Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, 2020

Online Speaker Adaptation for WaveNet-based Neural Vocoders.

[BibT_eX]

[DOI]

Qiuchen Huang

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

Adversarial Post-Processing of Voice Conversion against Spoofing Detection.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019

Sequence-to-Sequence Acoustic Modeling for Voice Conversion.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2019

The ASVspoof 2019 database.

[BibT_eX]

[DOI]

CoRR, 2019

Align, Mask and Select: A Simple Method for Incorporating Commonsense Knowledge into Language Representation Models.

[BibT_eX]

[DOI]

CoRR, 2019

Exploring Unsupervised Pretraining and Sentence Structure Modelling for Winograd Schema Challenge.

[BibT_eX]

[DOI]

CoRR, 2019

Promoting Diversity for End-to-End Conversation Response Generation.

[BibT_eX]

[DOI]

CoRR, 2019

Knowledge Base Question Answering With Attentive Pooling for Question Representation.

[BibT_eX]

[DOI]

Runze Wang

IEEE Access, 2019

Distant Supervision Relation Extraction with Intra-Bag and Inter-Bag Attentions.

[BibT_eX]

[DOI]

Zhi-Xiu Ye

Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Singing Voice Synthesis Using Deep Autoregressive Neural Networks for Acoustic Modeling.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

A Chinese Dataset for Identifying Speakers in Novels.

[BibT_eX]

[DOI]

Jia-Xiang Chen

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Neural Text Clustering with Document-Level Attention Based on Dynamic Soft Labels.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Multi-Classification Model for Spoken Language Understanding.

[BibT_eX]

[DOI]

Chaohong Tan

Proceedings of the International Conference on Multimodal Interaction, 2019

Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Improving Sequence-to-sequence Voice Conversion by Adding Text-supervision.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Condition-transforming Variational Autoencoder for Conversation Response Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Channel Adversarial Training for Cross-channel Text-independent Speaker Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Dnn-based Spectral Enhancement for Neural Waveform Generators with Low-bit Quantization.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Dually Interactive Matching Network for Personalized Response Selection in Retrieval-Based Chatbots.

[BibT_eX]

[DOI]

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Interactive Matching Network for Multi-Turn Response Selection in Retrieval-Based Chatbots.

[BibT_eX]

[DOI]

Quan Liu

Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

Linguistic Steganography by Sampling-based Language Generation.

[BibT_eX]

[DOI]

Rui Yang

Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

End-to-End Emotional Speech Synthesis Using Style Tokens and Semi-Supervised Training.

[BibT_eX]

[DOI]

Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Dementia Detection by Analyzing Spontaneous Mandarin Speech.

[BibT_eX]

[DOI]

Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Multi-Level Matching and Aggregation Network for Few-Shot Relation Classification.

[BibT_eX]

[DOI]

Zhi-Xiu Ye

Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018

Improving the Decoding Efficiency of Deep Neural Network Acoustic Models by Cluster-Based Senone Selection.

[BibT_eX]

[DOI]

J. Signal Process. Syst., 2018

Unit Selection Speech Synthesis Using Frame-Sized Speech Segments and Neural Network Based Acoustic Models.

[BibT_eX]

[DOI]

Zhi-Ping Zhou

J. Signal Process. Syst., 2018

A Sequential Neural Encoder With Latent Structured Description for Modeling Sentences.

[BibT_eX]

[DOI]

Yu-Ping Ruan

Qian Chen

IEEE ACM Trans. Audio Speech Lang. Process., 2018

Waveform Modeling and Generation Using Hierarchical Recurrent Neural Networks for Speech Bandwidth Extension.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2018

Extracting Spectral Features Using Deep Autoencoders With Binary Distributed Hidden Units for Statistical Parametric Speech Synthesis.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2018

Statistical Parametric Speech Synthesis Using Generalized Distillation Framework.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2018

Articulatory-to-acoustic conversion using BLSTM-RNNs with augmented input representation.

[BibT_eX]

[DOI]

Speech Commun., 2018

Building Sequential Inference Models for End-to-End Response Selection.

[BibT_eX]

[DOI]

CoRR, 2018

Improving Sequence-to-Sequence Acoustic Modeling by Adding Text-Supervision.

[BibT_eX]

[DOI]

CoRR, 2018

Forward Attention in Sequence-to-sequence Acoustic Modelling for Speech Synthesis.

[BibT_eX]

[DOI]

CoRR, 2018

The Voice Conversion Challenge 2018: Promoting Development of Parallel and Nonparallel Methods.

[BibT_eX]

[DOI]

Fernando Villavicencio

Tomi Kinnunen

Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

A Spoofing Benchmark for the 2018 Voice Conversion Challenge: Leveraging from Spoofing Countermeasures for Speech Artifact Assessment.

[BibT_eX]

[DOI]

Fernando Villavicencio

Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

GTDNN-Based Voice Conversion Using DAEs with Binary Distributed Hidden Units.

[BibT_eX]

[DOI]

Yi-Yang Ding

Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Learning and Modeling Unit Embeddings for Improving HMM-based Unit Selection Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

WaveNet Vocoder with Limited Training Data for Voice Conversion.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Forward Attention in Sequence- To-Sequence Acoustic Modeling for Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Pseudo-Supervised Approach for Text Clustering Based on Consensus Analysis.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Samplernn-Based Neural Vocoder for Statistical Parametric Speech Synthesis.

[BibT_eX]

[DOI]

Hong-Chuan Wu

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Enhancing Sentence Embedding with Generalized Pooling.

[BibT_eX]

[DOI]

Qian Chen

Xiaodan Zhu

Proceedings of the 27th International Conference on Computational Linguistics, 2018

A Study on Improving End-to-End Neural Coreference Resolution.

[BibT_eX]

[DOI]

Nitin Indurkhya

Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2018

An Analysis of Speaker Diarization Fusion Methods For The First DIHARD Challenge.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

Hybrid semi-Markov CRF for Neural Sequence Labeling.

[BibT_eX]

[DOI]

Zhi-Xiu Ye

Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Neural Natural Language Inference Models Enhanced with External Knowledge.

[BibT_eX]

[DOI]

Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017

Natural Language Inference with External Knowledge.

[BibT_eX]

[DOI]

CoRR, 2017

Recurrent Neural Network-Based Sentence Encoder with Gated Attention for Natural Language Inference.

[BibT_eX]

[DOI]

Proceedings of the 2nd Workshop on Evaluating Vector Space Representations for NLP, 2017

Waveform Modeling Using Stacked Dilated Convolutional Neural Networks for Speech Bandwidth Extension.

[BibT_eX]

[DOI]

Yu Gu

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Cause-Effect Knowledge Acquisition and Neural Association Model for Solving A Set of Winograd Schema Problems.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Extracting structural spectral features using what-where auto-encoders for statistical parametric speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Question Answering with Character-Level LSTM Encoders and Model-Based Data Augmentation.

[BibT_eX]

[DOI]

Runze Wang

Chen-Di Zhan

Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2017

The iFLYTEK system for blizzard machine learning challenge 2017-ES1.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

The USTC system for blizzard machine learning challenge 2017-ES2.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Emotional statistical parametric speech synthesis using LSTM-RNNs.

[BibT_eX]

[DOI]

Shumin An

Lirong Dai

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Enhanced LSTM for Natural Language Inference.

[BibT_eX]

[DOI]

Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Combing Context and Commonsense Knowledge Through Neural Networks for Solving Winograd Schema Problems.

[BibT_eX]

[DOI]

Proceedings of the 2017 AAAI Spring Symposia, 2017

2016

Anti-Spoofing for Text-Independent Speaker Verification: An Initial Database, Comparison of Countermeasures, and Human Performance.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2016

DBN-based Spectral Feature Representation for Statistical Parametric Speech Synthesis.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2016

Modeling F0 trajectories in hierarchically structured deep neural networks.

[BibT_eX]

[DOI]

Speech Commun., 2016

Concept-to-Speech generation with knowledge sharing for acoustic modelling and utterance filtering.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2016

Part-of-Speech Relevance Weights for Learning Word Embeddings.

[BibT_eX]

[DOI]

CoRR, 2016

Probabilistic Reasoning via Deep Learning: Neural Association Models.

[BibT_eX]

[DOI]

CoRR, 2016

Distraction-Based Neural Networks for Document Summarization.

[BibT_eX]

[DOI]

CoRR, 2016

Enhancing and Combining Sequential and Tree LSTM for Natural Language Inference.

[BibT_eX]

[DOI]

CoRR, 2016

Intra-Topic Variability Normalization based on Linear Projection for Topic Classification.

[BibT_eX]

[DOI]

Proceedings of the NAACL HLT 2016, 2016

DNN-based unit selection using frame-sized speech segments.

[BibT_eX]

[DOI]

Zhi-Ping Zhou

Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Cluster-based senone selection for the efficient calculation of deep neural network acoustic models.

[BibT_eX]

[DOI]

Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Articulatory-to-Acoustic Conversion with Cascaded Prediction of Spectral and Excitation Features Using Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Speech Bandwidth Extension Using Bottleneck Features and Deep Recurrent Neural Networks.

[BibT_eX]

[DOI]

Yu Gu

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

The USTC System for Voice Conversion Challenge 2016: Neural Network Based Approaches for Spectrum, Aperiodicity and F<sub>0</sub> Conversion.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Distraction-Based Neural Networks for Modeling Document.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Modeling spectral envelopes using deep conditional restricted Boltzmann machines for statistical parametric speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

A full training framework of cross-stream dependence modelling for HMM-based singing voice synthesis.

[BibT_eX]

[DOI]

Minghui Dong

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Modulation spectrum compensation for HMM-based speech synthesis using line spectral pairs.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Deep belief network-based post-filtering for statistical parametric speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Exploring Semantic Representation in Brain Activity Using Word Embeddings.

[BibT_eX]

[DOI]

Yu-Ping Ruan

Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

2015

A Deep Generative Architecture for Postfiltering in Statistical Parametric Speech Synthesis.

[BibT_eX]

[DOI]

Cassia Valentini-Botinhao

Tuomo Raitio

IEEE ACM Trans. Audio Speech Lang. Process., 2015

Deep Learning for Acoustic Modeling in Parametric Speech Generation: A systematic review of existing techniques and future trends.

[BibT_eX]

[DOI]

IEEE Signal Process. Mag., 2015

Statistical parametric speech synthesis using a hidden trajectory model.

[BibT_eX]

[DOI]

Ming-Qi Cai

Speech Commun., 2015

Integrate Document Ranking Information into Confidence Measure Calculation for Spoken Term Detection.

[BibT_eX]

[DOI]

Quan Liu

Wu Guo

CoRR, 2015

Automatic phrase boundary labeling of speech synthesis database using context-dependent HMMs and n-gram prior distributions.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Restoring high frequency spectral envelopes using neural networks for speech bandwidth extension.

[BibT_eX]

[DOI]

Yu Gu

Proceedings of the 2015 International Joint Conference on Neural Networks, 2015

Spectral conversion using deep neural networks trained with multi-source speakers.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

LIP movement generation using restricted Boltzmann machines for visual speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015

Learning Semantic Word Embeddings based on Ordinal Knowledge Constraints.

[BibT_eX]

[DOI]

Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2014

Voice conversion using deep neural networks with layer-wise generative training.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2014

HMM-based unit selection speech synthesis using log likelihood ratios derived from perceptual data.

[BibT_eX]

[DOI]

Speech Commun., 2014

Unsupervised Prosodic Labeling of Speech Synthesis Databases Using Context-Dependent HMMs.

[BibT_eX]

[DOI]

Chen-Yu Yang

IEICE Trans. Inf. Syst., 2014

Integrating global variance of log power spectrum derived from LSPs into MGE training for HMM-based parametric speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Improving F0 prediction using bidirectional associative memories and syllable-level F0 features for HMM-based Mandarin speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Modeling DCT parameterized F0 trajectory at intonation phrase level with DNN or decision tree.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Concept-to-speech generation by integrating syntagmatic features into HMM-based speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

DNN-based stochastic postfilter for HMM-based speech synthesis.

[BibT_eX]

[DOI]

Cassia Valentini-Botinhao

Tuomo Raitio

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Voice conversion using generative trained deep neural networks with multiple frame spectral envelopes.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Formant-controlled speech synthesis using hidden trajectory model.

[BibT_eX]

[DOI]

Ming-Qi Cai

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Spectral modeling using neural autoregressive distribution estimators for statistical parametric speech synthesis.

[BibT_eX]

[DOI]

Xiang Yin

Proceedings of the IEEE International Conference on Acoustics, 2014

Using bidirectional associative memories for joint spectral envelope modeling in voice conversion.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

2013

Articulatory Control of HMM-Based Parametric Speech Synthesis Using Feature-Space-Switched Multiple Regression.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2013

Modeling Spectral Envelopes Using Restricted Boltzmann Machines and Deep Belief Networks for Statistical Parametric Speech Synthesis.

[BibT_eX]

[DOI]

Li Deng

Dong Yu

IEEE Trans. Speech Audio Process., 2013

Mage - HMM-based speech synthesis reactively controlled by the articulators.

[BibT_eX]

[DOI]

Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

Mage - reactive articulatory feature control of HMM-based parametric speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

On the evaluation of inversion mapping performance in the acoustic domain.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Joint spectral distribution modeling using restricted boltzmann machines for voice conversion.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Unsupervised prosodic phrase boundary labeling of Mandarin speech synthesis database using context-dependent HMM.

[BibT_eX]

[DOI]

Chen-Yu Yang

Proceedings of the IEEE International Conference on Acoustics, 2013

Modeling spectral envelopes using restricted Boltzmann machines for statistical parametric speech synthesis.

[BibT_eX]

[DOI]

Li Deng

Dong Yu

Proceedings of the IEEE International Conference on Acoustics, 2013

2012

Minimum Kullback-Leibler Divergence Parameter Generation for HMM-Based Speech Synthesis.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2012

Improved unit selection speech synthesis method utilizing subjective evaluation results on synthetic speech.

[BibT_eX]

[DOI]

Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Cross-stream dependency modeling using continuous F0 model for HMM-based speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Considering Global Variance of the Log Power Spectrum Derived from Mel-Cepstrum in HMM-based Parametric Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Vowel Creation by Articulatory Control in HMM-based Parametric Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

2011

Feature-Space Transform Tying in Unified Acoustic-Articulatory Modelling for Articulatory Control of HMM-Based Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Formant-Controlled HMM-Based Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Estimation of Window Coefficients for Dynamic Feature Extraction for HMM-Based Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Building HMM based unit-selection speech synthesis system using synthetic speech naturalness evaluation score.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Preserve ordering property of generated LSPS for minimum generation error training in HMM-based speech synthesis.

[BibT_eX]

[DOI]

Ming Lei

Proceedings of the IEEE International Conference on Acoustics, 2011

Non-parallel training for voice conversion based on FT-GMM.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

2010

An Analysis of HMM-based prediction of articulatory movements.

[BibT_eX]

[DOI]

Speech Commun., 2010

Cross-Validation and Minimum Generation Error based Decision Tree Pruning for HMM-based Speech Synthesis.

[BibT_eX]

[DOI]

Int. J. Comput. Linguistics Chin. Lang. Process., 2010

Minimum generation error training for HMM-based prediction of articulatory movements.

[BibT_eX]

[DOI]

Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Automatic phrase boundary labeling for Mandarin TTS corpus using context-dependent HMM.

[BibT_eX]

[DOI]

Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Statistical modeling of syllable-level F0 features for HMM-based unit selection speech synthesis.

[BibT_eX]

[DOI]

Zhiguo Wang

Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

GMM-based voice conversion with explicit modelling on feature transform.

[BibT_eX]

[DOI]

Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Automatic error detection for unit selection speech synthesis using log likelihood ratio based SVM classifier.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

HMM-based text-to-articulatory-movement prediction and analysis of critical articulators.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Global variance modeling on the log power spectrum of LSPs for HMM-based speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

A hierarchical F0 modeling method for HMM-based speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Minimum generation error training with weighted Euclidean distance on LSP for HMM-based speech synthesis.

[BibT_eX]

[DOI]

Ming Lei

Proceedings of the IEEE International Conference on Acoustics, 2010

2009

Robust Speaker-Adaptive HMM-Based Text-to-Speech Synthesis.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2009

Integrating Articulatory Features Into HMM-Based Parametric Speech Synthesis.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2009

Asynchronous F0 and spectrum modeling for HMM-based speech synthesis.

[BibT_eX]

[DOI]

Cheng-Cheng Wang

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

2008

Model Adaptation for HMM-Based Speech Synthesis under Minimum Generation Error Criterion.

[BibT_eX]

[DOI]

Proceedings of the Tenth IEEE International Symposium on Multimedia (ISM2008), 2008

Multi-Layer F0 Modeling for HMM-Based Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Cross-Stream Dependency Modeling for HMM-Based Speech Synthesis.

[BibT_eX]

[DOI]

Wei Zhang

Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Heteronym Verification for Mandarin Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Robustness of HMM-based speech synthesis.

[BibT_eX]

[DOI]

Simon King

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Articulatory control of HMM-based parametric speech synthesis driven by phonetic knowledge.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Minimum generation error criterion considering global/local variance for HMM-based speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

Minumum generation error linear regression based model adaptation for HMM-based speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

Minimum unit selection error training for HMM-based unit selection speech synthesis system.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

The USTC System for Blizzard Challenge 2008.

[BibT_eX]

[DOI]

Proceedings of the Blizzard Challenge 2008, 2008

2007

HMM-Based Hierarchical Unit Selection Combining Kullback-Leibler Divergence with Likelihood Criterion.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

The USTC and iflytek speech synthesis systems for Blizzard Challenge 2007.

[BibT_eX]

[DOI]

Proceedings of the Evaluation of text-to-speech systems: Blizzard Challenge 2007, 2007

2006

HMM-Based Emotional Speech Synthesis Using Average Emotion Model.

[BibT_eX]

[DOI]

Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

Improving the performance of HMM-based voice conversion using context clustering decision tree and appropriate regression matrix format.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

HMM-based unit selection using frame sized speech segments.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

USTC System for Blizzard Challenge 2006 an Improved HMM-based Speech Synthesis Method.

[BibT_eX]

[DOI]

Proceedings of the Blizzard Challenge 2006, Pittsburgh, PA, USA, September 16, 2006, 2006

2005

An Improved Spectral and Prosodic Transformation Method in STRAIGHT-based Voice Conversion.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Emotional Speech Synthesis Based on Improved Codebook Mapping Voice Conversion.

[BibT_eX]

[DOI]

Yu-Ping Wang

Proceedings of the Affective Computing and Intelligent Interaction, 2005

A Novel Source Analysis Method by Matching Spectral Characters of LF Model with STRAIGHT Spectrum.

[BibT_eX]

[DOI]