Ruibin Yuan
According to our database1,
Ruibin Yuan
authored at least 48 papers
between 2020 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix.
CoRR, May, 2025
CoRR, March, 2025
Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens.
CoRR, March, 2025
CoRR, February, 2025
CLaMP 3: Universal Music Information Retrieval Across Unaligned Modalities and Unseen Languages.
CoRR, February, 2025
CLaMP 2: Multimodal Music Information Retrieval Across 101 Languages Using Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
The Fine Line: Navigating Large Language Model Pretraining with Down-streaming Capability Analysis.
CoRR, 2024
Modeling Analog Dynamic Range Compressors using Deep Learning and State-space Models.
CoRR, 2024
CIF-Bench: A Chinese Instruction-Following Benchmark for Evaluating the Generalizability of Large Language Models.
CoRR, 2024
CoRR, 2024
Can LLMs "Reason" in Music? an Evaluation of LLMs' Capability of Music Understanding and Generation.
Proceedings of the 25th International Society for Music Information Retrieval Conference, 2024
Proceedings of the 25th International Society for Music Information Retrieval Conference, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
MMMU: A Massive Multi-Discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Weakly-Supervised Emotion Transition Learning for Diverse 3D Co-Speech Gesture Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
CIF-Bench: A Chinese Instruction-Following Benchmark for Evaluating the Generalizability of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
2023
CoRR, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
LyricWhiz: Robust Multilingual Zero-Shot Lyrics Transcription by Whispering to ChatGPT.
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023
2022
MAP-Music2Vec: A Simple and Effective Baseline for Self-Supervised Music Audio Representation Learning.
CoRR, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
2021
Parallel Adaptive Subspace Pursuit Algorithm for Multiuser Detection of Uplink Grant-Free NOMA.
Proceedings of the IEEE Wireless Communications and Networking Conference, 2021
2020
CoRR, 2020