Chengzhu Yu

According to our database1, Chengzhu Yu authored at least 39 papers between 2013 and 2022.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2022
Stutter-TTS: Controlled Synthesis and Improved Recognition of Stuttered Speech.
CoRR, 2022

2021
Non-Parallel Any-to-Many Voice Conversion by Replacing Speaker Statistics.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2020
DurIAN-SC: Duration Informed Attention Network Based Singing Voice Conversion System.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

DurIAN: Duration Informed Attention Network for Speech Synthesis.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Peking Opera Synthesis via Duration Informed Attention Network.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Minimum Bayes Risk Training of RNN-Transducer for End-to-End Speech Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Pitchnet: Unsupervised Singing Voice Conversion with Pitch Adversarial Network.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

The Tencent speech synthesis system for Blizzard Challenge 2020.
Proceedings of the Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, 2020

2019
Synthesising Expressiveness in Peking Opera via Duration Informed Attention Network.
CoRR, 2019

Learning Singing From Speech.
CoRR, 2019

DurIAN: Duration Informed Attention Network For Multimodal Synthesis.
CoRR, 2019

The 2019 Inaugural Fearless Steps Challenge: A Giant Leap for Naturalistic Audio.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Unsupervised Speech Recognition via Segmental Empirical Output Distribution Matching.
Proceedings of the 7th International Conference on Learning Representations, 2019

Seq2Seq Attentional Siamese Neural Networks for Text-dependent Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
An Exploration of Directly Using Word as ACOUSTIC Modeling Unit for Speech Recognition.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Improving Attention-Based End-to-End ASR Systems with Sequence-Based Loss Functions.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

A Multistage Training Framework for Acoustic-to-Word Model.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Improving Attention Based Sequence-to-Sequence Models for End-to-End English Conversational Speech Recognition.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Fearless Steps: Apollo-11 Corpus Advancements for Speech Technologies from Earth to the Moon.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2017
Active Learning Based Constrained Clustering For Speaker Diarization.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Robust Harmonic Features for Classification-Based Pitch Estimation.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

An Investigation of Deep-Learning Frameworks for Speaker Verification Antispoofing.
IEEE J. Sel. Top. Signal Process., 2017

UTD-CRSS Systems for 2016 NIST Speaker Recognition Evaluation.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Advances in all-neural speech recognition.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
Text-Available Speaker Recognition System for Forensic Applications.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

UTD-CRSS system for the NIST 2015 language recognition i-vector machine learning challenge.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Language recognition using deep neural networks with very limited training data.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Context adaptive deep neural networks for fast acoustic model adaptation in noisy conditions.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Toward Access to Multi-Perspective Archival Spoken Word Content.
Proceedings of the Digital Libraries: Knowledge, Information, and Data in an Open Access Society, 2016

2015
I-vector based physical task stress detection with different fusion strategies.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Robust i-vector extraction for neural network adaptation in noisy environment.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

The NTT CHiME-3 system: Advances in speech enhancement and recognition for mobile multi-microphone devices.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
Utilization of unlabeled development data for speaker verification.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Investigating State-of-the-Art Speaker Verification in the case of Unlabeled Development Data.
Proceedings of the Odyssey 2014: The Speaker and Language Recognition Workshop, 2014

Acoustic feature transformation using UBM-based LDA for speaker recognition.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

'houston, we have a solution': a case study of the analysis of astronaut speech during NASA apollo 11 for long-term speaker modeling.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Uncertainty propagation in front end factor analysis for noise robust speaker recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
'houston, we have a solution': using NASA apollo program to advance speech and language processing technology.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

A new mask-based objective measure for predicting the intelligibility of binary masked speech.
Proceedings of the IEEE International Conference on Acoustics, 2013


  Loading...