Songtao Zhou

Orcid: 0009-0008-5972-3955

According to our database1, Songtao Zhou authored at least 9 papers between 2021 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Towards Automatic Soccer Commentary Generation with Knowledge-Enhanced Visual Reasoning.
CoRR, April, 2026

From Natural Alignment to Conditional Controllability in Multimodal Dialogue.
CoRR, March, 2026

2025
HarmoniVox: Painting Voices to Match the Avatar's Soul.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

V-CASS: Vision-context-aware Expressive Speech Synthesis for Enhancing User Understanding of Videos.
Proceedings of the International Joint Conference on Neural Networks, 2025

2024
DanceCamAnimator: Keyframe-Based Controllable 3D Dance Camera Synthesis.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

SpeechCraft: A Fine-Grained Expressive Speech Dataset with Natural Language Description.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

VoxInstruct: Expressive Human Instruction-to-Speech Generation with Unified Multilingual Codec Language Modelling.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

2023
Speech-Driven 3D Face Animation with Composite and Regional Facial Movements.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

2021
Open-Access Data and Toolbox for Tracking COVID-19 Impact on Power Systems.
CoRR, 2021


  Loading...