Sizhe Shan

According to our database1, Sizhe Shan authored at least 4 papers between 2024 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Enhance Generation Quality of Flow Matching V2A Model via Multi-Step CoT-Like Guidance and Combined Preference Optimization.
CoRR, March, 2025

Do Less and Achieve More: Free Condition Video Outpainting with Diffusion Model.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

ProsodyFlow: High-fidelity Text-to-Speech through Conditional Flow Matching and Prosody Modeling with Large Speech Language Models.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

2024
YingSound: Video-Guided Sound Effects Generation with Multi-modal Chain-of-Thought Controls.
CoRR, 2024


  Loading...