Chaofan Ding

According to our database1, Chaofan Ding authored at least 16 papers between 2024 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
LD-LAudio-V1: Video-to-Long-Form-Audio Generation Extension with Dual Lightweight Adapters.
CoRR, August, 2025

DualDub: Video-to-Soundtrack Generation via Joint Speech and Background Audio Synthesis.
CoRR, July, 2025

Towards Video to Piano Music Generation with Chain-of-Perform Support Benchmarks.
CoRR, May, 2025

MM-MovieDubber: Towards Multi-Modal Learning for Multi-Modal Movie Dubbing.
CoRR, May, 2025

Towards Film-Making Production Dialogue, Narration, Monologue Adaptive Moving Dubbing Benchmarks.
CoRR, May, 2025

DeepDubber-V1: Towards High Quality and Dialogue, Narration, Monologue Adaptive Movie Dubbing Via Multi-Modal Chain-of-Thoughts Reasoning Guidance.
CoRR, March, 2025

DeepAudio-V1:Towards Multi-Modal Multi-Stage End-to-End Video to Speech and Audio Generation.
CoRR, March, 2025

DeepSound-V1: Start to Think Step-by-Step in the Audio Generation from Videos.
CoRR, March, 2025

Enhance Generation Quality of Flow Matching V2A Model via Multi-Step CoT-Like Guidance and Combined Preference Optimization.
CoRR, March, 2025

Enhancing Reasoning through Process Supervision with Monte Carlo Tree Search.
CoRR, January, 2025

Multiple Consistency-guided Test-Time Adaptation for Contrastive Audio-Language Models with Unlabeled Audio.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024
Towards Intrinsic Self-Correction Enhancement in Monte Carlo Tree Search Boosted Reasoning via Iterative Preference Learning.
CoRR, 2024

Low-Rank Adaptation with Task-Relevant Feature Enhancement for Fine-tuning Language Models.
CoRR, 2024

YingSound: Video-Guided Sound Effects Generation with Multi-modal Chain-of-Thought Controls.
CoRR, 2024

Self-Supervised Learning of Deviation in Latent Representation for Co-speech Gesture Video Generation.
CoRR, 2024

Bailing-TTS: Chinese Dialectal Speech Synthesis Towards Human-like Spontaneous Representation.
CoRR, 2024


  Loading...