Ram Potham
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
Corrigibility as a Singular Target: A Vision for Inherently Reliable Foundation Models.
CoRR, June, 2025
Evaluating LLM Agent Adherence to Hierarchical Safety Principles: A Lightweight Benchmark for Probing Foundational Controllability Components.
CoRR, June, 2025