Bo Liu

Orcid: 0000-0001-5426-515X

Affiliations:

National University of Singapore, Singapore
Chinese Academy of Sciences, Institute of Automation, Beijing, China
Peking Univerisity, Beijing, China

According to our database¹, Bo Liu authored at least 17 papers between 2021 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2025

GEM: A Gym for Agentic LLMs.

[BibT_eX]

[DOI]

CoRR, October, 2025

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, June, 2025

TextArena.

[BibT_eX]

[DOI]

CoRR, April, 2025

Differentiable Information Enhanced Model-Based Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

Grasp Multiple Objects With One Hand.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., May, 2024

Natural Language Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2024

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search.

[BibT_eX]

[DOI]

CoRR, 2024

DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data.

[BibT_eX]

[DOI]

CoRR, 2024

DeepSeek-VL: Towards Real-World Vision-Language Understanding.

[BibT_eX]

[DOI]

CoRR, 2024

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism.

[BibT_eX]

[DOI]

CoRR, 2024

2023

TorchOpt: An Efficient Library for Differentiable Optimization.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2023

2022

EnvPool: A Highly Parallel Reinforcement Learning Environment Execution Engine.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021

Settling the Bias and Variance of Meta-Gradient Estimation for Meta-Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2021

Discovering Multi-Agent Auto-Curricula in Two-Player Zero-Sum Games.

[BibT_eX]

[DOI]

CoRR, 2021

Neural Auto-Curricula in Two-Player Zero-Sum Games.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Learning Correlated Communication Topology in Multi-Agent Reinforcement learning.

[BibT_eX]

[DOI]

Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021

Bo Liu

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...