We stand with Ukraine

We stand with Ukraine

Wenxiang Guo

Orcid: 0009-0006-7997-4140

According to our database¹, Wenxiang Guo authored at least 17 papers between 2019 and 2026.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Towards Streaming Synchronized Spatial Audio Generation via Autoregressive Diffusion Transformer.

[DOI]

,

,

,

,

,

,

CoRR, May, 2026

2025

STARS: A Unified Framework for Singing Transcription, Alignment, and Refined Style Annotation.

[DOI]

,

,

,

,

,

,

,

,

CoRR, July, 2025

Speech Quality Assessment Model Based on Mixture of Experts: System-Level Performance Enhancement and Utterance-Level Challenge Analysis.

[DOI]

,

,

,

,

CoRR, July, 2025

MRSAudio: A Large-Scale Multimodal Recorded Spatial Audio Dataset with Refined Annotations.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

A Multimodal Evaluation Framework for Spatial Audio Playback Systems: From Localization to Listener Preference.

[DOI]

,

,

,

,

,

,

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

ISDrama: Immersive Spatial Drama Generation through Multimodal Prompting.

[DOI]

,

,

,

,

,

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

ASAudio: A Survey of Advanced Spatial Audio Research.

[DOI]

,

,

,

,

Proceedings of the 14th International Joint Conference on Natural Language Processing and the 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2025

Synthetic Singers: A Review of Deep-Learning-based Singing Voice Synthesis Approaches.

[DOI]

,

,

,

,

,

,

Proceedings of the 14th International Joint Conference on Natural Language Processing and the 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2025

Versatile Framework for Song Generation with Prompt-based Control.

[DOI]

,

,

,

,

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

TCSinger 2: Customizable Multilingual Zero-shot Singing Voice Synthesis.

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics, 2025

STARS: A Unified Framework for Singing Transcription, Alignment, and Refined Style Annotation.

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics, 2025

TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching.

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

Frieren: Efficient Video-to-Audio Generation with Rectified Flow Matching.

[DOI]

,

,

,

,

,

,

,

CoRR, 2024

GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Frieren: Efficient Video-to-Audio Generation Network with Rectified Flow Matching.

[DOI]

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

2020

Anomaly Detection for Time Series Based on the Neural Networks Optimized by the Improved PSO Algorithm.

[DOI]

,

,

Proceedings of the Intelligent Computing Methodologies - 16th International Conference, 2020

2019

An Advanced Membrane Evolutionary Algorithm for Constrained Engineering Design Problems.

[DOI]

,

,

Proceedings of the Human Centered Computing - 5th International Conference, 2019

Loading...