Pengcheng Zhu

Affiliations:
  • Fuxi AI Lab, NetEase Inc., Hangzhou, China
  • Northwestern Polytechnical University, School of Software and Microelectronics, Xi'an, China


According to our database1, Pengcheng Zhu authored at least 19 papers between 2014 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens.
CoRR, March, 2025

E1 TTS: Simple and Fast Non-Autoregressive TTS.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

MacST: Multi-Accent Speech Synthesis via Text Transliteration for Accent Conversion.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024
M-Vec: Matryoshka Speaker Embeddings with Flexible Dimensions.
Proceedings of the Social Robotics - 16th International Conference, 2024

DualVC 3: Leveraging Language Model Generated Pseudo Context for End-to-end Low Latency Streaming Voice Conversion.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Dualvc 2: Dynamic Masked Convolution for Unified Streaming and Non-Streaming Voice Conversion.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Accent-VITS: accent transfer for end-to-end TTS.
CoRR, 2023

Multi-GradSpeech: Towards Diffusion-based Multi-Speaker Text-to-speech Using Consistent Diffusion Models.
CoRR, 2023

DualVC: Dual-mode Voice Conversion using Intra-model Knowledge Distillation and Hybrid Predictive Coding.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Expressive-VC: Highly Expressive Voice Conversion with Attention Fusion of Bottleneck and Perturbation Features.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Learn2Sing 2.0: Diffusion and Mutual Information-Based Target Speaker SVS by Learning from Singing Teacher.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Opencpop: A High-Quality Open Source Chinese Popular Song Corpus for Singing Voice Synthesis.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

VISinger: Variational Inference with Adversarial Learning for End-to-End Singing Voice Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2022

One-Shot Voice Conversion For Style Transfer Based On Speaker Adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2022

2019
Improving Mandarin End-to-End Speech Synthesis by Self-Attention and Learnable Gaussian Bias.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2015
Head motion synthesis from speech using deep neural networks.
Multim. Tools Appl., 2015

Articulatory movement prediction using deep bidirectional long short-term memory based recurrent neural networks and word/phone embeddings.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

BLSTM neural networks for speech driven head motion synthesis.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

2014
Speech-driven head motion synthesis using neural networks.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014


  Loading...