Bingxiang He

According to our database1, Bingxiang He authored at least 22 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe.
CoRR, April, 2026

Frontier-Eng: Benchmarking Self-Evolving Agents on Real-World Engineering Tasks with Generative Optimization.
CoRR, April, 2026

How Far Can Unsupervised RLVR Scale LLM Training?
CoRR, March, 2026

CPMobius: Iterative Coach-Player Reasoning for Data-Free Reinforcement Learning.
CoRR, February, 2026

Current Agents Fail to Leverage World Model as Tool for Foresight.
CoRR, January, 2026

2025
CubeBench: Diagnosing Interactive, Long-Horizon Spatial Reasoning Under Partial Observations.
CoRR, December, 2025

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe.
CoRR, December, 2025

CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments for LLM Tool-Use Agents.
CoRR, November, 2025

Veri-R1: Toward Precise and Faithful Claim Verification via Online Reinforcement Learning.
CoRR, October, 2025

MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe.
CoRR, September, 2025

A Survey of Reinforcement Learning for Large Reasoning Models.
CoRR, September, 2025

MiniCPM4: Ultra-Efficient LLMs on End Devices.
CoRR, June, 2025

AIR: A Systematic Analysis of Annotations, Instructions, and Response Pairs in Preference Dataset.
CoRR, April, 2025

Process Reinforcement through Implicit Rewards.
CoRR, February, 2025

The Right Time Matters: Data Arrangement Affects Zero-Shot Generalization in Instruction Tuning.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

EscapeBench: Towards Advancing Creative Intelligence of Language Model Agents.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
EscapeBench: Pushing Language Models to Think Outside the Box.
CoRR, 2024

Zero-Shot Generalization during Instruction Tuning: Insights from Similarity and Granularity.
CoRR, 2024

ULTRAFEEDBACK: Boosting Language Models with Scaled AI Feedback.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Beat LLMs at Their Own Game: Zero-Shot LLM-Generated Text Detection via Querying ChatGPT.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022
A Unified Evaluation of Textual Backdoor Learning: Frameworks and Benchmarks.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022


  Loading...