Muzhi Dai

According to our database¹, Muzhi Dai authored at least 5 papers in 2025.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Secure Tug-of-War (SecTOW): Iterative Defense-Attack Training with Reinforcement Learning for Multimodal Model Security.

[BibT_eX]

[DOI]

CoRR, July, 2025

Stable Reinforcement Learning for Efficient Reasoning.

[BibT_eX]

[DOI]

Muzhi Dai

Shixuan Liu

Qingyi Si

CoRR, May, 2025

S-GRPO: Early Exit via Reinforcement Learning in Reasoning Models.

[BibT_eX]

[DOI]

Muzhi Dai

Chenxu Yang

Qingyi Si

CoRR, May, 2025

From Captions to Rewards (CAREVL): Leveraging Large Language Model Experts for Enhanced Reward Modeling in Large Vision-Language Models.

[BibT_eX]

[DOI]

CoRR, March, 2025

CryoDomain: Sequence-free Protein Domain Identification from Low-resolution Cryo-EM Density Maps.

[BibT_eX]

[DOI]

Qiangfeng Cliff Zhang

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Muzhi Dai

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...