Chaofan Ding

According to our database¹, Chaofan Ding authored at least 19 papers between 2024 and 2025.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

R2-SVC: Towards Real-World Robust and Expressive Zero-shot Singing Voice Conversion.

[BibT_eX]

[DOI]

CoRR, October, 2025

DiaMoE-TTS: A Unified IPA-Based Dialect TTS Framework with Mixture-of-Experts and Parameter-Efficient Zero-Shot Adaptation.

[BibT_eX]

[DOI]

CoRR, September, 2025

LD-LAudio-V1: Video-to-Long-Form-Audio Generation Extension with Dual Lightweight Adapters.

[BibT_eX]

[DOI]

CoRR, August, 2025

DualDub: Video-to-Soundtrack Generation via Joint Speech and Background Audio Synthesis.

[BibT_eX]

[DOI]

CoRR, July, 2025

Towards Video to Piano Music Generation with Chain-of-Perform Support Benchmarks.

[BibT_eX]

[DOI]

CoRR, May, 2025

Towards Film-Making Production Dialogue, Narration, Monologue Adaptive Moving Dubbing Benchmarks.

[BibT_eX]

[DOI]

CoRR, May, 2025

DeepDubber-V1: Towards High Quality and Dialogue, Narration, Monologue Adaptive Movie Dubbing Via Multi-Modal Chain-of-Thoughts Reasoning Guidance.

[BibT_eX]

[DOI]

CoRR, March, 2025

DeepAudio-V1:Towards Multi-Modal Multi-Stage End-to-End Video to Speech and Audio Generation.

[BibT_eX]

[DOI]

CoRR, March, 2025

DeepSound-V1: Start to Think Step-by-Step in the Audio Generation from Videos.

[BibT_eX]

[DOI]

CoRR, March, 2025

Enhance Generation Quality of Flow Matching V2A Model via Multi-Step CoT-Like Guidance and Combined Preference Optimization.

[BibT_eX]

[DOI]

CoRR, March, 2025

Enhancing Reasoning through Process Supervision with Monte Carlo Tree Search.

[BibT_eX]

[DOI]

CoRR, January, 2025

MM-MovieDubber: Towards Multi-Modal Learning for Multi-Modal Movie Dubbing.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Audio-Driven Gesture Generation via Deviation Feature in the Latent Space.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

Multiple Consistency-guided Test-Time Adaptation for Contrastive Audio-Language Models with Unlabeled Audio.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024

Towards Intrinsic Self-Correction Enhancement in Monte Carlo Tree Search Boosted Reasoning via Iterative Preference Learning.

[BibT_eX]

[DOI]

CoRR, 2024

Low-Rank Adaptation with Task-Relevant Feature Enhancement for Fine-tuning Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

YingSound: Video-Guided Sound Effects Generation with Multi-modal Chain-of-Thought Controls.

[BibT_eX]

[DOI]

CoRR, 2024

Self-Supervised Learning of Deviation in Latent Representation for Co-speech Gesture Video Generation.

[BibT_eX]

[DOI]

CoRR, 2024

Bailing-TTS: Chinese Dialectal Speech Synthesis Towards Human-like Spontaneous Representation.

[BibT_eX]

[DOI]

CoRR, 2024

Chaofan Ding

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...