Muzhi Dai

Orcid: 0009-0004-6070-8470

According to our database1, Muzhi Dai authored at least 5 papers in 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2025
Stable Reinforcement Learning for Efficient Reasoning.
CoRR, May, 2025

S-GRPO: Early Exit via Reinforcement Learning in Reasoning Models.
CoRR, May, 2025

From Captions to Rewards (CaReVL): Leveraging Large Language Model Experts for Enhanced Reward Modeling in Large Vision-Language Models.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Secure Tug-of-War (SecTOW): Iterative Defense-Attack Training with Reinforcement Learning for Multimodal Model Security.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

CryoDomain: Sequence-free Protein Domain Identification from Low-resolution Cryo-EM Density Maps.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025


  Loading...