Rohan Subramani

According to our database¹, Rohan Subramani authored at least 7 papers between 2023 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

How does information access affect LLM monitors' ability to detect sabotage?

[BibT_eX]

[DOI]

CoRR, January, 2026

2025

Password-Activated Shutdown Protocols for Misaligned Frontier Agents.

[BibT_eX]

[DOI]

Kai Williams

Rohan Subramani

Francis Rhys Ward

CoRR, December, 2025

Higher-Order Belief in Incomplete Information MAIDs.

[BibT_eX]

[DOI]

Jack Foxabbott

Rohan Subramani

Francis Rhys Ward

Proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems, 2025

The Partially Observable Off-Switch Game.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

Will an AI with Private Information Allow Itself to Be Switched Off?

[BibT_eX]

[DOI]

CoRR, 2024

On the Expressivity of Objective-Specification Formalisms in Reinforcement Learning.

[BibT_eX]

[DOI]

Joar Max Viktor Skalse

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

Generalization Analogies: A Testbed for Generalizing AI Oversight to Hard-To-Measure Domains.

[BibT_eX]

[DOI]

CoRR, 2023

Rohan Subramani

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...