Zelai Xu

Orcid: 0000-0001-9052-9896

According to our database1, Zelai Xu authored at least 25 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
MAGE: Meta-Reinforcement Learning for Language Agents toward Strategic Exploration and Exploitation.
CoRR, March, 2026

WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning.
CoRR, February, 2026

Optimal Design of Magnetic Suspension Linear Synchronous Motor Based on Six Sigma Method.
IEEE Access, 2026

<i>Earl: </i> Efficient Agentic RL Post-Training for LLMs under Dynamic Context Lengths.
Proceedings of the Sixth European Workshop on Machine Learning and Systems, EuroMLSys 2026, 2026

2025
Extending Test-Time Scaling: A 3D Perspective with Context, Batch, and Turn.
CoRR, November, 2025

MARS: Reinforcing Multi-Agent Reasoning of LLMs through Self-Play in Strategic Games.
CoRR, October, 2025

EARL: Efficient Agentic Reinforcement Learning Systems for Large Language Models.
CoRR, October, 2025

Latent Collective Preference Optimization: A General Framework for Robust LLM Alignment.
CoRR, September, 2025

VS-Bench: Evaluating VLMs for Strategic Reasoning and Decision-Making in Multi-Agent Environments.
CoRR, June, 2025

Mastering Multi-Drone Volleyball through Hierarchical Co-Self-Play Reinforcement Learning.
CoRR, May, 2025

AED: Automatic Discovery of Effective and Diverse Vulnerabilities for Autonomous Driving Policy with Large Language Models.
CoRR, March, 2025

VolleyBots: A Testbed for Multi-Drone Volleyball Game Combining Motion Control and Strategic Play.
CoRR, February, 2025

Learning Global Nash Equilibrium in Team Competitive Games with Generalized Fictitious Cross-Play.
J. Mach. Learn. Res., 2025

Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

2024
Fixed-time convergence of second-order nonlinear systems based on nonsingular fractional sliding mode.
Trans. Inst. Meas. Control, 2024

Fixed-time disturbance observer-based funnel control for controllable excitation linear synchronous motor with state constraints.
J. Syst. Control. Eng., 2024

Dual sliding mode control of linear maglev synchronous motor based on novel extended state observer.
J. Syst. Control. Eng., 2024

A Survey on Self-play Methods in Reinforcement Learning.
CoRR, 2024

Language Agents with Reinforcement Learning for Strategic Play in the Werewolf Game.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Multi-Agent Vulnerability Discovery for Autonomous Driving Policy by Finding AV-Responsible Scenarios.
Proceedings of the 20th IEEE International Conference on Automation Science and Engineering, 2024

Accelerate Multi-Agent Reinforcement Learning in Zero-Sum Games with Subgame Curriculum Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Fictitious Cross-Play: Learning Global Nash Equilibrium in Mixed Cooperative-Competitive Games.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

2022
An arbitrary Lagrangian-Eulerian method for simulating interfacial dynamics between a hydrogel and a fluid.
J. Comput. Phys., 2022

Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2022

Texture BERT for Cross-modal Texture Image Retrieval.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022


  Loading...