Chloe Li

According to our database1, Chloe Li authored at least 4 papers between 2025 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of five.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Model Spec Midtraining: Improving How Alignment Training Generalizes.
CoRR, May, 2026

2025
Spilling the Beans: Teaching LLMs to Self-Report Their Hidden Objectives.
CoRR, November, 2025

LLMs Can Covertly Sandbag on Capability Evaluations Against Chain-of-Thought Monitoring.
CoRR, August, 2025

LLMs Can Covertly Sandbag on Capability Evaluations Against Chain-of-Thought Monitoring.
Proceedings of the 14th International Joint Conference on Natural Language Processing and the 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2025


  Loading...