Rohan Subramani

According to our database1, Rohan Subramani authored at least 7 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
How does information access affect LLM monitors' ability to detect sabotage?
CoRR, January, 2026

2025
Password-Activated Shutdown Protocols for Misaligned Frontier Agents.
CoRR, December, 2025

Higher-Order Belief in Incomplete Information MAIDs.
Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, 2025

The Partially Observable Off-Switch Game.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Will an AI with Private Information Allow Itself to Be Switched Off?
CoRR, 2024

On the Expressivity of Objective-Specification Formalisms in Reinforcement Learning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Generalization Analogies: A Testbed for Generalizing AI Oversight to Hard-To-Measure Domains.
CoRR, 2023


  Loading...