Bo Liu

Orcid: 0000-0001-5426-515X

Affiliations:
  • National University of Singapore, Singapore
  • Chinese Academy of Sciences, Institute of Automation, Beijing, China
  • Peking Univerisity, Beijing, China


According to our database1, Bo Liu authored at least 17 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
GEM: A Gym for Agentic LLMs.
CoRR, October, 2025

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning.
CoRR, June, 2025

TextArena.
CoRR, April, 2025

Differentiable Information Enhanced Model-Based Reinforcement Learning.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Grasp Multiple Objects With One Hand.
IEEE Robotics Autom. Lett., May, 2024

Natural Language Reinforcement Learning.
CoRR, 2024

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search.
CoRR, 2024

DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data.
CoRR, 2024

DeepSeek-VL: Towards Real-World Vision-Language Understanding.
CoRR, 2024

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism.
CoRR, 2024

2023
TorchOpt: An Efficient Library for Differentiable Optimization.
J. Mach. Learn. Res., 2023

2022
EnvPool: A Highly Parallel Reinforcement Learning Environment Execution Engine.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021
Settling the Bias and Variance of Meta-Gradient Estimation for Meta-Reinforcement Learning.
CoRR, 2021

Discovering Multi-Agent Auto-Curricula in Two-Player Zero-Sum Games.
CoRR, 2021

Neural Auto-Curricula in Two-Player Zero-Sum Games.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Learning Correlated Communication Topology in Multi-Agent Reinforcement learning.
Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021


  Loading...