Muzhi Dai

According to our database1, Muzhi Dai authored at least 5 papers in 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Secure Tug-of-War (SecTOW): Iterative Defense-Attack Training with Reinforcement Learning for Multimodal Model Security.
CoRR, July, 2025

Stable Reinforcement Learning for Efficient Reasoning.
CoRR, May, 2025

S-GRPO: Early Exit via Reinforcement Learning in Reasoning Models.
CoRR, May, 2025

From Captions to Rewards (CAREVL): Leveraging Large Language Model Experts for Enhanced Reward Modeling in Large Vision-Language Models.
CoRR, March, 2025

CryoDomain: Sequence-free Protein Domain Identification from Low-resolution Cryo-EM Density Maps.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025


  Loading...