Pengcheng Zhu

Affiliations:

Fuxi AI Lab, NetEase Inc., Hangzhou, China
Northwestern Polytechnical University, School of Software and Microelectronics, Xi'an, China

According to our database¹, Pengcheng Zhu authored at least 21 papers between 2014 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

MeanVC: Lightweight and Streaming Zero-Shot Voice Conversion via Mean Flows.

[BibT_eX]

[DOI]

CoRR, October, 2025

Cross-Lingual F5-TTS: Towards Language-Agnostic Voice Cloning and Speech Synthesis.

[BibT_eX]

[DOI]

CoRR, September, 2025

Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens.

[BibT_eX]

[DOI]

CoRR, March, 2025

E1 TTS: Simple and Fast Non-Autoregressive TTS.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

MacST: Multi-Accent Speech Synthesis via Text Transliteration for Accent Conversion.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024

M-Vec: Matryoshka Speaker Embeddings with Flexible Dimensions.

[BibT_eX]

[DOI]

Shuai Wang

Pengcheng Zhu

Haizhou Li

Proceedings of the Social Robotics - 16th International Conference, 2024

DualVC 3: Leveraging Language Model Generated Pseudo Context for End-to-end Low Latency Streaming Voice Conversion.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Dualvc 2: Dynamic Masked Convolution for Unified Streaming and Non-Streaming Voice Conversion.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

Accent-VITS: accent transfer for end-to-end TTS.

[BibT_eX]

[DOI]

CoRR, 2023

Multi-GradSpeech: Towards Diffusion-based Multi-Speaker Text-to-speech Using Consistent Diffusion Models.

[BibT_eX]

[DOI]

CoRR, 2023

DualVC: Dual-mode Voice Conversion using Intra-model Knowledge Distillation and Hybrid Predictive Coding.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Expressive-VC: Highly Expressive Voice Conversion with Attention Fusion of Bottleneck and Perturbation Features.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

Learn2Sing 2.0: Diffusion and Mutual Information-Based Target Speaker SVS by Learning from Singing Teacher.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Opencpop: A High-Quality Open Source Chinese Popular Song Corpus for Singing Voice Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

VISinger: Variational Inference with Adversarial Learning for End-to-End Singing Voice Synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

One-Shot Voice Conversion For Style Transfer Based On Speaker Adaptation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

2019

Improving Mandarin End-to-End Speech Synthesis by Self-Attention and Learnable Gaussian Bias.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2015

Head motion synthesis from speech using deep neural networks.

[BibT_eX]

[DOI]

Chuang Ding

Lei Xie

Pengcheng Zhu

Multim. Tools Appl., 2015

Articulatory movement prediction using deep bidirectional long short-term memory based recurrent neural networks and word/phone embeddings.

[BibT_eX]

[DOI]

Pengcheng Zhu

Lei Xie

Yunlin Chen

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

BLSTM neural networks for speech driven head motion synthesis.

[BibT_eX]

[DOI]

Chuang Ding

Pengcheng Zhu

Lei Xie

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

2014

Speech-driven head motion synthesis using neural networks.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Pengcheng Zhu

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...