Chaofan Ding
According to our database1,
Chaofan Ding
authored at least 19 papers
between 2024 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
CoRR, October, 2025
DiaMoE-TTS: A Unified IPA-Based Dialect TTS Framework with Mixture-of-Experts and Parameter-Efficient Zero-Shot Adaptation.
CoRR, September, 2025
LD-LAudio-V1: Video-to-Long-Form-Audio Generation Extension with Dual Lightweight Adapters.
CoRR, August, 2025
DualDub: Video-to-Soundtrack Generation via Joint Speech and Background Audio Synthesis.
CoRR, July, 2025
CoRR, May, 2025
Towards Film-Making Production Dialogue, Narration, Monologue Adaptive Moving Dubbing Benchmarks.
CoRR, May, 2025
DeepDubber-V1: Towards High Quality and Dialogue, Narration, Monologue Adaptive Movie Dubbing Via Multi-Modal Chain-of-Thoughts Reasoning Guidance.
CoRR, March, 2025
DeepAudio-V1:Towards Multi-Modal Multi-Stage End-to-End Video to Speech and Audio Generation.
CoRR, March, 2025
CoRR, March, 2025
Enhance Generation Quality of Flow Matching V2A Model via Multi-Step CoT-Like Guidance and Combined Preference Optimization.
CoRR, March, 2025
CoRR, January, 2025
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025
Proceedings of the IEEE International Conference on Multimedia and Expo, 2025
Multiple Consistency-guided Test-Time Adaptation for Contrastive Audio-Language Models with Unlabeled Audio.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
2024
Towards Intrinsic Self-Correction Enhancement in Monte Carlo Tree Search Boosted Reasoning via Iterative Preference Learning.
CoRR, 2024
Low-Rank Adaptation with Task-Relevant Feature Enhancement for Fine-tuning Language Models.
CoRR, 2024
YingSound: Video-Guided Sound Effects Generation with Multi-modal Chain-of-Thought Controls.
CoRR, 2024
Self-Supervised Learning of Deviation in Latent Representation for Co-speech Gesture Video Generation.
CoRR, 2024
Bailing-TTS: Chinese Dialectal Speech Synthesis Towards Human-like Spontaneous Representation.
CoRR, 2024