Xu Wan

Orcid: 0000-0002-6253-3545

Affiliations:
  • Zhejiang University, Hangzhou, China
  • Alibaba, DAMO Acedemy, China


According to our database1, Xu Wan authored at least 8 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of five.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Fuz-RL: A Fuzzy-Guided Robust Framework for Safe Reinforcement Learning under Uncertainty.
CoRR, February, 2026

2025
AdapThink: Adaptive Thinking Preferences for Reasoning Language Model.
CoRR, June, 2025

IVMR suite: An Industrial-scale Virtual Machine Rescheduling Dataset and Benchmark for Elastic Cloud Service.
Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.2, 2025

Think Twice, Act Once: A Co-Evolution Framework of LLM and RL for Large-Scale Decision Making.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

SrSv: Integrating Sequential Rollouts with Sequential Value Estimation for Multi-agent Reinforcement Learning.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
SAMG: State-Action-Aware Offline-to-Online Reinforcement Learning with Offline Model Guidance.
CoRR, 2024

2023
AdapSafe: Adaptive and Safe-Certified Deep Reinforcement Learning-Based Frequency Control for Carbon-Neutral Power Systems.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Exploring the Vulnerability of Deep Reinforcement Learning-based Emergency Control for Low Carbon Power Systems.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022


  Loading...