Penghui Qi

According to our database¹, Penghui Qi authored at least 11 papers between 2019 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Defeating the Training-Inference Mismatch via FP16.

[BibT_eX]

[DOI]

CoRR, October, 2025

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, June, 2025

Optimizing Anytime Reasoning via Budget Relative Policy Optimization.

[BibT_eX]

[DOI]

CoRR, May, 2025

Understanding R1-Zero-Like Training: A Critical Perspective.

[BibT_eX]

[DOI]

CoRR, March, 2025

PipeOffload: Improving Scalability of Pipeline Parallelism with Memory Optimization.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

2024

Balancing Pipeline Parallelism with Vocabulary Parallelism.

[BibT_eX]

[DOI]

CoRR, 2024

Zero Bubble Pipeline Parallelism.

[BibT_eX]

[DOI]

CoRR, 2024

Pipeline Parallelism with Controllable Memory.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Zero Bubble (Almost) Pipeline Parallelism.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2021

SCC: an efficient deep reinforcement learning agent mastering the game of StarCraft II.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

2019

Artificial Intelligence for Prosthetics - challenge solutions.

[BibT_eX]

[DOI]

CoRR, 2019

Penghui Qi

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...