Xu Wan

Orcid: 0000-0002-6253-3545

Affiliations:
  • Zhejiang University, Hangzhou, China
  • Alibaba, DAMO Acedemy, China


According to our database1, Xu Wan authored at least 6 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of five.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
AdapThink: Adaptive Thinking Preferences for Reasoning Language Model.
CoRR, June, 2025

Think Twice, Act Once: A Co-Evolution Framework of LLM and RL for Large-Scale Decision Making.
CoRR, June, 2025

SrSv: Integrating Sequential Rollouts with Sequential Value Estimation for Multi-agent Reinforcement Learning.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
SAMG: State-Action-Aware Offline-to-Online Reinforcement Learning with Offline Model Guidance.
CoRR, 2024

2023
AdapSafe: Adaptive and Safe-Certified Deep Reinforcement Learning-Based Frequency Control for Carbon-Neutral Power Systems.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Exploring the Vulnerability of Deep Reinforcement Learning-based Emergency Control for Low Carbon Power Systems.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022


  Loading...