Pengfei Li

Orcid: 0009-0006-1880-1297

Affiliations:

Harbin Institute of Technology, School of Mathematics, Harbin, China

According to our database¹, Pengfei Li authored at least 9 papers between 2024 and 2026.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2026

MARS<sup>2</sup>: Scaling Multi-Agent Tree Search via Reinforcement Learning for Code Generation.

[BibT_eX]

[DOI]

CoRR, April, 2026

WIST: Web-Grounded Iterative Self-Play Tree for Domain-Targeted Reasoning Improvement.

[BibT_eX]

[DOI]

CoRR, March, 2026

MARTI-MARS<sup>2</sup>: Scaling Multi-Agent Self-Search via Reinforcement Learning for Code Generation.

[BibT_eX]

[DOI]

CoRR, February, 2026

2025

A Survey of Reinforcement Learning for Large Reasoning Models.

[BibT_eX]

[DOI]

CoRR, September, 2025

Bohdi: Heterogeneous LLM Fusion with Automatic Data Exploration.

[BibT_eX]

[DOI]

CoRR, June, 2025

Fast and Slow Gradient Approximation for Binary Neural Network Optimization.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

Online DPO: Online Direct Preference Optimization with Fast-Slow Chasing.

[BibT_eX]

[DOI]

CoRR, 2024

Exploring Adversarial Robustness of Deep State Space Models.

[BibT_eX]

[DOI]

CoRR, 2024

Exploring Adversarial Robustness of Deep State Space Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Pengfei Li

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...