Subramanyam Sahoo
According to our database1,
Subramanyam Sahoo authored at least 19 papers
between 2024 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
Calibration Collapse Under Sycophancy Fine-Tuning: How Reward Hacking Breaks Uncertainty Quantification in LLMs.
CoRR, April, 2026
The Reasoning Trap - Logical Reasoning as a Mechanistic Pathway to Situational Awareness.
CoRR, March, 2026
SAHOO: Safeguarded Alignment for High-Order Optimization Objectives in Recursive Self-Improvement.
CoRR, March, 2026
CoRR, March, 2026
When Shallow Wins: Silent Failures and the Depth-Accuracy Paradox in Latent Reasoning.
CoRR, March, 2026
Policy myopia as a mechanism of gradual disempowerment in Post-AGI governance, Circa 2049.
CoRR, March, 2026
I Can't Believe It's Not Robust: Catastrophic Collapse of Safety Classifiers under Embedding Drift.
CoRR, March, 2026
CoRR, March, 2026
CoRR, February, 2026
2025
The Deepfake Detective: Interpreting Neural Forensics Through Sparse Features and Manifolds.
CoRR, December, 2025
The Double Life of Code World Models: Provably Unmasking Malicious Behavior Through Execution Traces.
CoRR, December, 2025
Catch Me If You Can: How Smaller Reasoning Models Pretend to Reason with Mathematical Fidelity.
CoRR, December, 2025
CoRR, November, 2025
The Horcrux: Mechanistically Interpretable Task Decomposition for Detecting and Mitigating Reward Hacking in Embodied AI Systems.
CoRR, November, 2025
CoRR, November, 2025
The Good, The Bad, and The Hybrid: A Reward Structure Showdown in Reasoning Models Training.
CoRR, November, 2025
Who Evaluates AI's Social Impacts? Mapping Coverage and Gaps in First and Third Party Evaluations.
CoRR, November, 2025
2024
CoRR, 2024
DUNE: Decoding Unified Naive Bayes Explainability through Gaussian methods for a Heart Disease Diagnostic.
Proceedings of the 15th International Conference on Computing Communication and Networking Technologies, 2024