Haina Zhu

Orcid: 0009-0005-6286-5530

According to our database1, Haina Zhu authored at least 13 papers between 2011 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Resonate: Reinforcing Text-to-Audio Generation via Online Feedback from Large Audio Language Models.
CoRR, March, 2026

The SJTU X-LANCE Lab System for MSR Challenge 2025.
CoRR, February, 2026

Audio ControlNet for Fine-Grained Audio Generation and Editing.
CoRR, February, 2026

SLAM-LLM: A Modular, Open-Source Multimodal Large Language Model Framework and Best Practice for Speech, Language, Audio and Music Processing.
IEEE J. Sel. Top. Signal Process., January, 2026

2025
X-Talk: On the Underestimated Potential of Modular Speech-to-Speech Dialogue System.
CoRR, December, 2025

LeVo: High-Quality Song Generation with Multi-Preference Alignment.
CoRR, June, 2025

Layer-wise Investigation of Large-Scale Self-Supervised Music Representation Models.
CoRR, May, 2025

MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix.
CoRR, May, 2025

MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization.
CoRR, January, 2025

TVC-MusicGen: Time-Varying Structure Control for Background Music Generation via Self-Supervised Training.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

SongEditor: Adapting Zero-Shot Song Generation Language Model as a Multi-Task Editor.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Socially Aware Object Goal Navigation With Heterogeneous Scene Representation Learning.
IEEE Robotics Autom. Lett., August, 2024

2011
Study on the influences of quantum well structure on the performance of organic light emitting devices.
Displays, 2011


  Loading...