Clive Bai

According to our database1, Clive Bai authored at least 5 papers in 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
ADWIN: Adaptive Windows for Horizon-Aware On-Policy Distillation.
CoRR, May, 2026

Listwise Policy Optimization: Group-based RLVR as Target-Projection on the LLM Response Simplex.
CoRR, May, 2026

Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models.
CoRR, February, 2026

Small Generalizable Prompt Predictive Models Can Steer Efficient RL Post-Training of Large Reasoning Models.
CoRR, February, 2026

ORBIT: On-policy Exploration-Exploitation for Controllable Multi-Budget Reasoning.
CoRR, January, 2026


  Loading...