Jacob Drori

According to our database1, Jacob Drori authored at least 4 papers between 2024 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2025
Recontextualization Mitigates Specification Gaming without Modifying the Specification.
CoRR, December, 2025

Output Supervision Can Obfuscate the Chain of Thought.
CoRR, November, 2025

Towards a Unified and Verified Understanding of Group-Operation Networks.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Unifying and Verifying Mechanistic Interpretations: A Case Study with Group Operations.
CoRR, 2024


  Loading...