Xin Cheng

Orcid: 0009-0001-7581-8662

Affiliations:

Renmin University of China, Gaoling School of Artificial Intelligence, Beijing, China

According to our database¹, Xin Cheng authored at least 11 papers between 2025 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2026

Unified Synthesis of Compositional Speech and Sound from Free-Form Text Prompts.

[BibT_eX]

[DOI]

CoRR, May, 2026

SyncDPO: Enhancing Temporal Synchronization in Video-Audio Joint Generation via Preference Learning.

[BibT_eX]

[DOI]

CoRR, May, 2026

2025

VSpeechLM: A Visual Speech Language Model for Visual Text-to-Speech Task.

[BibT_eX]

[DOI]

CoRR, November, 2025

Taming Text-to-Sounding Video Generation via Advanced Modality Condition and Interaction.

[BibT_eX]

[DOI]

CoRR, October, 2025

VSSFlow: Unifying Video-conditioned Sound and Speech Generation via Joint Learning.

[BibT_eX]

[DOI]

CoRR, September, 2025

WildSpoof Challenge Evaluation Plan.

[BibT_eX]

[DOI]

CoRR, August, 2025

A Visual Speech Language Model for Visual Text-to-Speech Task.

[BibT_eX]

[DOI]

Proceedings of the 7th ACM International Conference on Multimedia in Asia, 2025

VAFlow: Video-to-Audio Generation with Cross-Modality Flow Matching.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

LoVA: Long-form Video-to-Audio Generation.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Animate and Sound an Image.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

EyEar: Learning Audio Synchronized Human Gaze Trajectory Based on Physics-Informed Dynamics.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

Xin Cheng

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...