Pengfei Li

Orcid: 0009-0006-1880-1297

Affiliations:
  • Harbin Institute of Technology, School of Mathematics, Harbin, China


According to our database1, Pengfei Li authored at least 9 papers between 2024 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
MARS<sup>2</sup>: Scaling Multi-Agent Tree Search via Reinforcement Learning for Code Generation.
CoRR, April, 2026

WIST: Web-Grounded Iterative Self-Play Tree for Domain-Targeted Reasoning Improvement.
CoRR, March, 2026

MARTI-MARS<sup>2</sup>: Scaling Multi-Agent Self-Search via Reinforcement Learning for Code Generation.
CoRR, February, 2026

2025
A Survey of Reinforcement Learning for Large Reasoning Models.
CoRR, September, 2025

Bohdi: Heterogeneous LLM Fusion with Automatic Data Exploration.
CoRR, June, 2025

Fast and Slow Gradient Approximation for Binary Neural Network Optimization.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Online DPO: Online Direct Preference Optimization with Fast-Slow Chasing.
CoRR, 2024

Exploring Adversarial Robustness of Deep State Space Models.
CoRR, 2024

Exploring Adversarial Robustness of Deep State Space Models.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024


  Loading...