Chenpeng Du

Orcid: 0000-0001-5329-0847

According to our database1, Chenpeng Du authored at least 24 papers between 2020 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
VALL-T: Decoder-Only Generative Transducer for Robust and Decoding-Controllable Text-to-Speech.
CoRR, 2024

UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Speaker Adaptive Text-to-Speech With Timbre-Normalized Vector-Quantized Feature.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

DiffDub: Person-generic Visual Dubbing Using Inpainting Renderer with Diffusion Auto-encoder.
CoRR, 2023

Acoustic BPE for Speech Generation with Discrete Tokens.
CoRR, 2023

Towards Universal Speech Discrete Tokens: A Case Study for ASR and TTS.
CoRR, 2023

VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching.
CoRR, 2023

DSE-TTS: Dual Speaker Embedding for Cross-Lingual Text-to-Speech.
CoRR, 2023

Improving Code-Switching and Named Entity Recognition in ASR with Speech Editing based Data Augmentation.
CoRR, 2023

UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding.
CoRR, 2023

DAE-Talker: High Fidelity Speech-Driven Talking Face Generation with Diffusion Autoencoder.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Emodiff: Intensity Controllable Emotional Text-to-Speech with Soft-Label Guidance.
Proceedings of the IEEE International Conference on Acoustics, 2023

Multi-Speaker Multi-Lingual VQTTS System for LIMMITS 2023 Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Phone-Level Prosody Modelling With GMM-Based MDN for Diverse and Controllable Speech Synthesis.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Neural Fusion for Voice Cloning.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

VQTTS: High-Fidelity Text-to-Speech Synthesis with Self-Supervised VQ Acoustic Feature.
Proceedings of the Interspeech 2022, 2022

Unsupervised Word-Level Prosody Tagging for Controllable Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Diverse and Controllable Speech Synthesis with GMM-Based Phone-Level Prosody Modelling.
CoRR, 2021

Mixture Density Network for Phone-Level Prosody Modelling in Speech Synthesis.
CoRR, 2021

Data Augmentation for end-to-end Code-Switching Speech Recognition.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Rich Prosody Diversity Modelling with Phone-Level Mixture Density Network.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Towards Data Selection on TTS Data for Children's Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

SynAug: Synthesis-Based Data Augmentation for Text-Dependent Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Speaker Augmentation for Low Resource Speech Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020


  Loading...