Yuyue Wang

Orcid: 0009-0005-6987-1028

Affiliations:
  • Renmin University of China, China


According to our database1, Yuyue Wang authored at least 12 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Unified Synthesis of Compositional Speech and Sound from Free-Form Text Prompts.
CoRR, May, 2026

SyncDPO: Enhancing Temporal Synchronization in Video-Audio Joint Generation via Preference Learning.
CoRR, May, 2026

2025
ChronusOmni: Improving Time Awareness of Omni Large Language Models.
CoRR, December, 2025

VSpeechLM: A Visual Speech Language Model for Visual Text-to-Speech Task.
CoRR, November, 2025

VSSFlow: Unifying Video-conditioned Sound and Speech Generation via Joint Learning.
CoRR, September, 2025

A Visual Speech Language Model for Visual Text-to-Speech Task.
Proceedings of the 7th ACM International Conference on Multimedia in Asia, 2025

VAFlow: Video-to-Audio Generation with Cross-Modality Flow Matching.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

LoVA: Long-form Video-to-Audio Generation.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Animate and Sound an Image.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
SpeechComposer: Unifying Multiple Speech Tasks with Prompt Composition.
CoRR, 2024

TiVA: Time-Aligned Video-to-Audio Generation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

2023
ComedicSpeech: Text To Speech For Stand-up Comedies in Low-Resource Scenarios.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023


  Loading...