Zhen-Hua Ling

According to our database1, Zhen-Hua Ling authored at least 125 papers between 2002 and 2019.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

On csauthors.net:

Bibliography

2019
Sequence-to-Sequence Acoustic Modeling for Voice Conversion.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2019

Knowledge Base Question Answering With Attentive Pooling for Question Representation.
IEEE Access, 2019

Distant Supervision Relation Extraction with Intra-Bag and Inter-Bag Attentions.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2019

Improving Sequence-to-sequence Voice Conversion by Adding Text-supervision.
Proceedings of the IEEE International Conference on Acoustics, 2019

Condition-transforming Variational Autoencoder for Conversation Response Generation.
Proceedings of the IEEE International Conference on Acoustics, 2019

Channel Adversarial Training for Cross-channel Text-independent Speaker Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

Dnn-based Spectral Enhancement for Neural Waveform Generators with Low-bit Quantization.
Proceedings of the IEEE International Conference on Acoustics, 2019

Multi-Level Matching and Aggregation Network for Few-Shot Relation Classification.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Improving the Decoding Efficiency of Deep Neural Network Acoustic Models by Cluster-Based Senone Selection.
Signal Processing Systems, 2018

Unit Selection Speech Synthesis Using Frame-Sized Speech Segments and Neural Network Based Acoustic Models.
Signal Processing Systems, 2018

A Sequential Neural Encoder With Latent Structured Description for Modeling Sentences.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2018

Waveform Modeling and Generation Using Hierarchical Recurrent Neural Networks for Speech Bandwidth Extension.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2018

Extracting Spectral Features Using Deep Autoencoders With Binary Distributed Hidden Units for Statistical Parametric Speech Synthesis.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2018

Statistical Parametric Speech Synthesis Using Generalized Distillation Framework.
IEEE Signal Process. Lett., 2018

Articulatory-to-acoustic conversion using BLSTM-RNNs with augmented input representation.
Speech Communication, 2018

The Voice Conversion Challenge 2018: Promoting Development of Parallel and Nonparallel Methods.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

A Spoofing Benchmark for the 2018 Voice Conversion Challenge: Leveraging from Spoofing Countermeasures for Speech Artifact Assessment.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

GTDNN-Based Voice Conversion Using DAEs with Binary Distributed Hidden Units.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Learning and Modeling Unit Embeddings for Improving HMM-based Unit Selection Speech Synthesis.
Proceedings of the Interspeech 2018, 2018

WaveNet Vocoder with Limited Training Data for Voice Conversion.
Proceedings of the Interspeech 2018, 2018

Forward Attention in Sequence- To-Sequence Acoustic Modeling for Speech Synthesis.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Samplernn-Based Neural Vocoder for Statistical Parametric Speech Synthesis.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Enhancing Sentence Embedding with Generalized Pooling.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

A Study on Improving End-to-End Neural Coreference Resolution.
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2018

Hybrid semi-Markov CRF for Neural Sequence Labeling.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Neural Natural Language Inference Models Enhanced with External Knowledge.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
Recurrent Neural Network-Based Sentence Encoder with Gated Attention for Natural Language Inference.
Proceedings of the 2nd Workshop on Evaluating Vector Space Representations for NLP, 2017

Waveform Modeling Using Stacked Dilated Convolutional Neural Networks for Speech Bandwidth Extension.
Proceedings of the Interspeech 2017, 2017

Cause-Effect Knowledge Acquisition and Neural Association Model for Solving A Set of Winograd Schema Problems.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Extracting structural spectral features using what-where auto-encoders for statistical parametric speech synthesis.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Question Answering with Character-Level LSTM Encoders and Model-Based Data Augmentation.
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2017

The iFLYTEK system for blizzard machine learning challenge 2017-ES1.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

The USTC system for blizzard machine learning challenge 2017-ES2.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Enhanced LSTM for Natural Language Inference.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Combing Context and Commonsense Knowledge Through Neural Networks for Solving Winograd Schema Problems.
Proceedings of the 2017 AAAI Spring Symposia, 2017

2016
Anti-Spoofing for Text-Independent Speaker Verification: An Initial Database, Comparison of Countermeasures, and Human Performance.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2016

DBN-based Spectral Feature Representation for Statistical Parametric Speech Synthesis.
IEEE Signal Process. Lett., 2016

Modeling F0 trajectories in hierarchically structured deep neural networks.
Speech Communication, 2016

Concept-to-Speech generation with knowledge sharing for acoustic modelling and utterance filtering.
Computer Speech & Language, 2016

Intra-Topic Variability Normalization based on Linear Projection for Topic Classification.
Proceedings of the NAACL HLT 2016, 2016

DNN-based unit selection using frame-sized speech segments.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Cluster-based senone selection for the efficient calculation of deep neural network acoustic models.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Articulatory-to-Acoustic Conversion with Cascaded Prediction of Spectral and Excitation Features Using Neural Networks.
Proceedings of the Interspeech 2016, 2016

Speech Bandwidth Extension Using Bottleneck Features and Deep Recurrent Neural Networks.
Proceedings of the Interspeech 2016, 2016

The USTC System for Voice Conversion Challenge 2016: Neural Network Based Approaches for Spectrum, Aperiodicity and F0 Conversion.
Proceedings of the Interspeech 2016, 2016

Distraction-Based Neural Networks for Modeling Document.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Modeling spectral envelopes using deep conditional restricted Boltzmann machines for statistical parametric speech synthesis.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

A full training framework of cross-stream dependence modelling for HMM-based singing voice synthesis.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Modulation spectrum compensation for HMM-based speech synthesis using line spectral pairs.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Deep belief network-based post-filtering for statistical parametric speech synthesis.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Exploring Semantic Representation in Brain Activity Using Word Embeddings.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

2015
A Deep Generative Architecture for Postfiltering in Statistical Parametric Speech Synthesis.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2015

Deep Learning for Acoustic Modeling in Parametric Speech Generation: A systematic review of existing techniques and future trends.
IEEE Signal Process. Mag., 2015

Statistical parametric speech synthesis using a hidden trajectory model.
Speech Communication, 2015

Automatic phrase boundary labeling of speech synthesis database using context-dependent HMMs and n-gram prior distributions.
Proceedings of the INTERSPEECH 2015, 2015

Restoring high frequency spectral envelopes using neural networks for speech bandwidth extension.
Proceedings of the 2015 International Joint Conference on Neural Networks, 2015

Spectral conversion using deep neural networks trained with multi-source speakers.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

LIP movement generation using restricted Boltzmann machines for visual speech synthesis.
Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015

Learning Semantic Word Embeddings based on Ordinal Knowledge Constraints.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2014
Voice conversion using deep neural networks with layer-wise generative training.
IEEE/ACM Trans. Audio, Speech & Language Processing, 2014

HMM-based unit selection speech synthesis using log likelihood ratios derived from perceptual data.
Speech Communication, 2014

Unsupervised Prosodic Labeling of Speech Synthesis Databases Using Context-Dependent HMMs.
IEICE Transactions, 2014

Integrating global variance of log power spectrum derived from LSPs into MGE training for HMM-based parametric speech synthesis.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Improving F0 prediction using bidirectional associative memories and syllable-level F0 features for HMM-based Mandarin speech synthesis.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Modeling DCT parameterized F0 trajectory at intonation phrase level with DNN or decision tree.
Proceedings of the INTERSPEECH 2014, 2014

Concept-to-speech generation by integrating syntagmatic features into HMM-based speech synthesis.
Proceedings of the INTERSPEECH 2014, 2014

DNN-based stochastic postfilter for HMM-based speech synthesis.
Proceedings of the INTERSPEECH 2014, 2014

Voice conversion using generative trained deep neural networks with multiple frame spectral envelopes.
Proceedings of the INTERSPEECH 2014, 2014

Formant-controlled speech synthesis using hidden trajectory model.
Proceedings of the INTERSPEECH 2014, 2014

Spectral modeling using neural autoregressive distribution estimators for statistical parametric speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2014

Using bidirectional associative memories for joint spectral envelope modeling in voice conversion.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Articulatory Control of HMM-Based Parametric Speech Synthesis Using Feature-Space-Switched Multiple Regression.
IEEE Trans. Audio, Speech & Language Processing, 2013

Modeling Spectral Envelopes Using Restricted Boltzmann Machines and Deep Belief Networks for Statistical Parametric Speech Synthesis.
IEEE Trans. Audio, Speech & Language Processing, 2013

Mage - HMM-based speech synthesis reactively controlled by the articulators.
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

Mage - reactive articulatory feature control of HMM-based parametric speech synthesis.
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

On the evaluation of inversion mapping performance in the acoustic domain.
Proceedings of the INTERSPEECH 2013, 2013

Joint spectral distribution modeling using restricted boltzmann machines for voice conversion.
Proceedings of the INTERSPEECH 2013, 2013

Unsupervised prosodic phrase boundary labeling of Mandarin speech synthesis database using context-dependent HMM.
Proceedings of the IEEE International Conference on Acoustics, 2013

Modeling spectral envelopes using restricted Boltzmann machines for statistical parametric speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Minimum Kullback-Leibler Divergence Parameter Generation for HMM-Based Speech Synthesis.
IEEE Trans. Audio, Speech & Language Processing, 2012

Improved unit selection speech synthesis method utilizing subjective evaluation results on synthetic speech.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Cross-stream dependency modeling using continuous F0 model for HMM-based speech synthesis.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Considering Global Variance of the Log Power Spectrum Derived from Mel-Cepstrum in HMM-based Parametric Speech Synthesis.
Proceedings of the INTERSPEECH 2012, 2012

Vowel Creation by Articulatory Control in HMM-based Parametric Speech Synthesis.
Proceedings of the INTERSPEECH 2012, 2012

2011
Feature-Space Transform Tying in Unified Acoustic-Articulatory Modelling for Articulatory Control of HMM-Based Speech Synthesis.
Proceedings of the INTERSPEECH 2011, 2011

Formant-Controlled HMM-Based Speech Synthesis.
Proceedings of the INTERSPEECH 2011, 2011

Estimation of Window Coefficients for Dynamic Feature Extraction for HMM-Based Speech Synthesis.
Proceedings of the INTERSPEECH 2011, 2011

Building HMM based unit-selection speech synthesis system using synthetic speech naturalness evaluation score.
Proceedings of the IEEE International Conference on Acoustics, 2011

Preserve ordering property of generated LSPS for minimum generation error training in HMM-based speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2011

Non-parallel training for voice conversion based on FT-GMM.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
An Analysis of HMM-based prediction of articulatory movements.
Speech Communication, 2010

Cross-Validation and Minimum Generation Error based Decision Tree Pruning for HMM-based Speech Synthesis.
IJCLCLP, 2010

Minimum generation error training for HMM-based prediction of articulatory movements.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Automatic phrase boundary labeling for Mandarin TTS corpus using context-dependent HMM.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Statistical modeling of syllable-level F0 features for HMM-based unit selection speech synthesis.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

GMM-based voice conversion with explicit modelling on feature transform.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Automatic error detection for unit selection speech synthesis using log likelihood ratio based SVM classifier.
Proceedings of the INTERSPEECH 2010, 2010

HMM-based text-to-articulatory-movement prediction and analysis of critical articulators.
Proceedings of the INTERSPEECH 2010, 2010

Global variance modeling on the log power spectrum of LSPs for HMM-based speech synthesis.
Proceedings of the INTERSPEECH 2010, 2010

A hierarchical F0 modeling method for HMM-based speech synthesis.
Proceedings of the INTERSPEECH 2010, 2010

Minimum generation error training with weighted Euclidean distance on LSP for HMM-based speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Robust Speaker-Adaptive HMM-Based Text-to-Speech Synthesis.
IEEE Trans. Audio, Speech & Language Processing, 2009

Integrating Articulatory Features Into HMM-Based Parametric Speech Synthesis.
IEEE Trans. Audio, Speech & Language Processing, 2009

Asynchronous F0 and spectrum modeling for HMM-based speech synthesis.
Proceedings of the INTERSPEECH 2009, 2009

2008
Model Adaptation for HMM-Based Speech Synthesis under Minimum Generation Error Criterion.
Proceedings of the Tenth IEEE International Symposium on Multimedia (ISM2008), 2008

Multi-Layer F0 Modeling for HMM-Based Speech Synthesis.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Cross-Stream Dependency Modeling for HMM-Based Speech Synthesis.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Heteronym Verification for Mandarin Speech Synthesis.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Robustness of HMM-based speech synthesis.
Proceedings of the INTERSPEECH 2008, 2008

Articulatory control of HMM-based parametric speech synthesis driven by phonetic knowledge.
Proceedings of the INTERSPEECH 2008, 2008

Minimum generation error criterion considering global/local variance for HMM-based speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2008

Minumum generation error linear regression based model adaptation for HMM-based speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2008

Minimum unit selection error training for HMM-based unit selection speech synthesis system.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
HMM-Based Hierarchical Unit Selection Combining Kullback-Leibler Divergence with Likelihood Criterion.
Proceedings of the IEEE International Conference on Acoustics, 2007

2006
HMM-Based Emotional Speech Synthesis Using Average Emotion Model.
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

Improving the performance of HMM-based voice conversion using context clustering decision tree and appropriate regression matrix format.
Proceedings of the INTERSPEECH 2006, 2006

HMM-based unit selection using frame sized speech segments.
Proceedings of the INTERSPEECH 2006, 2006

2005
An Improved Spectral and Prosodic Transformation Method in STRAIGHT-based Voice Conversion.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Emotional Speech Synthesis Based on Improved Codebook Mapping Voice Conversion.
Proceedings of the Affective Computing and Intelligent Interaction, 2005

A Novel Source Analysis Method by Matching Spectral Characters of LF Model with STRAIGHT Spectrum.
Proceedings of the Affective Computing and Intelligent Interaction, 2005

2004
Modeling glottal effect on the spectral envelop of STRAIGHT using mixture of Gaussians.
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004

A novel voice conversion system based on codebook mapping with phoneme-tied weighting.
Proceedings of the INTERSPEECH 2004, 2004

Compression of speech database by feature separation and pattern clustering using STRAIGHT.
Proceedings of the INTERSPEECH 2004, 2004

2002
A miniature Chinese TTS system based on tailored corpus.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002


  Loading...