Zelai Xu
Orcid: 0000-0001-9052-9896
According to our database1,
Zelai Xu authored at least 25 papers
between 2022 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
MAGE: Meta-Reinforcement Learning for Language Agents toward Strategic Exploration and Exploitation.
CoRR, March, 2026
WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning.
CoRR, February, 2026
Optimal Design of Magnetic Suspension Linear Synchronous Motor Based on Six Sigma Method.
IEEE Access, 2026
<i>Earl: </i> Efficient Agentic RL Post-Training for LLMs under Dynamic Context Lengths.
Proceedings of the Sixth European Workshop on Machine Learning and Systems, EuroMLSys 2026, 2026
2025
CoRR, November, 2025
MARS: Reinforcing Multi-Agent Reasoning of LLMs through Self-Play in Strategic Games.
CoRR, October, 2025
CoRR, October, 2025
Latent Collective Preference Optimization: A General Framework for Robust LLM Alignment.
CoRR, September, 2025
VS-Bench: Evaluating VLMs for Strategic Reasoning and Decision-Making in Multi-Agent Environments.
CoRR, June, 2025
Mastering Multi-Drone Volleyball through Hierarchical Co-Self-Play Reinforcement Learning.
CoRR, May, 2025
AED: Automatic Discovery of Effective and Diverse Vulnerabilities for Autonomous Driving Policy with Large Language Models.
CoRR, March, 2025
VolleyBots: A Testbed for Multi-Drone Volleyball Game Combining Motion Control and Strategic Play.
CoRR, February, 2025
Learning Global Nash Equilibrium in Team Competitive Games with Generalized Fictitious Cross-Play.
J. Mach. Learn. Res., 2025
Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization.
Proceedings of the Forty-second International Conference on Machine Learning, 2025
2024
Fixed-time convergence of second-order nonlinear systems based on nonsingular fractional sliding mode.
Trans. Inst. Meas. Control, 2024
Fixed-time disturbance observer-based funnel control for controllable excitation linear synchronous motor with state constraints.
J. Syst. Control. Eng., 2024
Dual sliding mode control of linear maglev synchronous motor based on novel extended state observer.
J. Syst. Control. Eng., 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Multi-Agent Vulnerability Discovery for Autonomous Driving Policy by Finding AV-Responsible Scenarios.
Proceedings of the 20th IEEE International Conference on Automation Science and Engineering, 2024
Accelerate Multi-Agent Reinforcement Learning in Zero-Sum Games with Subgame Curriculum Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
Fictitious Cross-Play: Learning Global Nash Equilibrium in Mixed Cooperative-Competitive Games.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023
2022
An arbitrary Lagrangian-Eulerian method for simulating interfacial dynamics between a hydrogel and a fluid.
J. Comput. Phys., 2022
Proceedings of the International Conference on Machine Learning, 2022
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022