Clive Bai

According to our database¹, Clive Bai authored at least 5 papers in 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

ADWIN: Adaptive Windows for Horizon-Aware On-Policy Distillation.

[BibT_eX]

[DOI]

CoRR, May, 2026

Listwise Policy Optimization: Group-based RLVR as Target-Projection on the LLM Response Simplex.

[BibT_eX]

[DOI]

CoRR, May, 2026

Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models.

[BibT_eX]

[DOI]

CoRR, February, 2026

Small Generalizable Prompt Predictive Models Can Steer Efficient RL Post-Training of Large Reasoning Models.

[BibT_eX]

[DOI]

CoRR, February, 2026

ORBIT: On-policy Exploration-Exploitation for Controllable Multi-Budget Reasoning.

[BibT_eX]

[DOI]

CoRR, January, 2026

Clive Bai

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...