Guanrou Yang
Orcid: 0009-0008-3614-1346
According to our database1,
Guanrou Yang authored at least 19 papers
between 2023 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
SemanticVocoder: Bridging Audio Generation and Audio Understanding via Semantic Latents.
CoRR, February, 2026
SLAM-LLM: A Modular, Open-Source Multimodal Large Language Model Framework and Best Practice for Speech, Language, Audio and Music Processing.
IEEE J. Sel. Top. Signal Process., January, 2026
2025
UltraVoice: Scaling Fine-Grained Style-Controlled Speech Conversations for Spoken Dialogue Models.
CoRR, October, 2025
DiSTAR: Diffusion over a Scalable Token Autoregressive Representation for Speech Generation.
CoRR, October, 2025
CoRR, September, 2025
CoRR, May, 2025
MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix.
CoRR, May, 2025
CoRR, April, 2025
CoRR, January, 2025
Proceedings of the 33rd ACM International Conference on Multimedia, 2025
Speech Token Prediction via Compressed-to-fine Language Modeling for Speech Generation.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
Speech Recognition Meets Large Language Model: Benchmarking, Models, and Exploration.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025
2024
Proceedings of the IEEE Spoken Language Technology Workshop, 2024
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024
TacoLM: GaTed Attention Equipped Codec Language Model are Efficient Zero-Shot Text to Speech Synthesizers.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024
2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Fast-Hubert: an Efficient Training Framework for Self-Supervised Speech Representation Learning.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023