Daniel Kokotajlo

According to our database1, Daniel Kokotajlo authored at least 4 papers between 2023 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Chain of Thought Monitorability: A New and Fragile Opportunity for AI Safety.
CoRR, July, 2025

2024
Towards evaluations-based safety cases for AI scheming.
CoRR, 2024

2023
Taken out of context: On measuring situational awareness in LLMs.
CoRR, 2023

Model evaluation for extreme risks.
CoRR, 2023


  Loading...