Bolian Li

Orcid: 0000-0002-1977-0764

According to our database1, Bolian Li authored at least 21 papers between 2021 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Uniform-Correct Policy Optimization: Breaking RLVR's Indifference to Diversity.
CoRR, May, 2026

Addressing Performance Saturation for LLM RL via Precise Entropy Curve Control.
CoRR, April, 2026

SARL: Label-Free Reinforcement Learning by Rewarding Reasoning Topology.
CoRR, March, 2026

Learning Self-Correction in Vision-Language Models via Rollout Augmentation.
CoRR, February, 2026

Why Reasoning Fails to Plan: A Planning-Centric Analysis of Long-Horizon Decision Making in LLM Agents.
CoRR, January, 2026

2025
Structure-R1: Dynamically Leveraging Structural Knowledge in LLM Reasoning through Reinforcement Learning.
CoRR, October, 2025

DRIFT: Learning from Abundant User Dissatisfaction in Real-World Preference Learning.
CoRR, October, 2025

From Personal to Collective: On the Role of Local and Global Memory in LLM Personalization.
CoRR, September, 2025

Stacey: Promoting Stochastic Steepest Descent via Accelerated ℓ<sub>p</sub>-Smooth Nonconvex Optimization.
CoRR, June, 2025

More is Less: The Pitfalls of Multi-Model Synthetic Preference Data in DPO Safety Alignment.
CoRR, April, 2025

Bayesian Computation in Deep Learning.
CoRR, February, 2025

Making Reliable and Flexible Decisions in Long-tailed Classification.
Trans. Mach. Learn. Res., 2025

Stacey: Promoting Stochastic Steepest Descent via Accelerated ℓp-Smooth Nonconvex Optimization.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

ETA: Evaluating Then Aligning Safety of Vision Language Models at Inference Time.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Reward-Shifted Speculative Sampling Is An Efficient Test-Time Weak-to-Strong Aligner.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

2024
Cascade Reward Sampling for Efficient Decoding-Time Alignment.
CoRR, 2024

Entropy-MCMC: Sampling from Flat Basins with Ease.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Long-tailed Classification from a Bayesian-decision-theory Perspective.
CoRR, 2023

2022
Graph Communal Contrastive Learning.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Trustworthy Long-Tailed Classification.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Identifying Incorrect Classifications with Balanced Uncertainty.
CoRR, 2021


  Loading...