Derek Li
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
Omni-Thinker: Scaling Cross-Domain Generalization in LLMs via Multi-Task RL with Hybrid Rewards.
CoRR, July, 2025
Reasoning on a Budget: A Survey of Adaptive and Controllable Test-Time Compute in LLMs.
CoRR, July, 2025
Proceedings of the International Conference on Computing, Networking and Communications, 2025