Penghui Qi

According to our database1, Penghui Qi authored at least 10 papers between 2019 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning.
CoRR, June, 2025

Optimizing Anytime Reasoning via Budget Relative Policy Optimization.
CoRR, May, 2025

Understanding R1-Zero-Like Training: A Critical Perspective.
CoRR, March, 2025

PipeOffload: Improving Scalability of Pipeline Parallelism with Memory Optimization.
CoRR, March, 2025

2024
Balancing Pipeline Parallelism with Vocabulary Parallelism.
CoRR, 2024

Zero Bubble Pipeline Parallelism.
CoRR, 2024

Pipeline Parallelism with Controllable Memory.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Zero Bubble (Almost) Pipeline Parallelism.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2021
SCC: an efficient deep reinforcement learning agent mastering the game of StarCraft II.
Proceedings of the 38th International Conference on Machine Learning, 2021

2019
Artificial Intelligence for Prosthetics - challenge solutions.
CoRR, 2019


  Loading...