Subramanyam Sahoo

According to our database¹, Subramanyam Sahoo authored at least 19 papers between 2024 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Calibration Collapse Under Sycophancy Fine-Tuning: How Reward Hacking Breaks Uncertainty Quantification in LLMs.

[BibT_eX]

[DOI]

Subramanyam Sahoo

CoRR, April, 2026

The Reasoning Trap - Logical Reasoning as a Mechanistic Pathway to Situational Awareness.

[BibT_eX]

[DOI]

CoRR, March, 2026

SAHOO: Safeguarded Alignment for High-Order Optimization Objectives in Recursive Self-Improvement.

[BibT_eX]

[DOI]

CoRR, March, 2026

The Controllability Trap: A Governance Framework for Military AI Agents.

[BibT_eX]

[DOI]

Subramanyam Sahoo

CoRR, March, 2026

When Shallow Wins: Silent Failures and the Depth-Accuracy Paradox in Latent Reasoning.

[BibT_eX]

[DOI]

CoRR, March, 2026

Policy myopia as a mechanism of gradual disempowerment in Post-AGI governance, Circa 2049.

[BibT_eX]

[DOI]

Subramanyam Sahoo

CoRR, March, 2026

I Can't Believe It's Not Robust: Catastrophic Collapse of Safety Classifiers under Embedding Drift.

[BibT_eX]

[DOI]

CoRR, March, 2026

Dial E for Ethical Enforcement: institutional VETO power as a governance primitive.

[BibT_eX]

[DOI]

CoRR, March, 2026

When AI Benchmarks Plateau: A Systematic Study of Benchmark Saturation.

[BibT_eX]

[DOI]

CoRR, February, 2026

2025

The Deepfake Detective: Interpreting Neural Forensics Through Sparse Features and Manifolds.

[BibT_eX]

[DOI]

Subramanyam Sahoo

Jared Junkin

CoRR, December, 2025

The Double Life of Code World Models: Provably Unmasking Malicious Behavior Through Execution Traces.

[BibT_eX]

[DOI]

Subramanyam Sahoo

Jared Junkin

CoRR, December, 2025

Catch Me If You Can: How Smaller Reasoning Models Pretend to Reason with Mathematical Fidelity.

[BibT_eX]

[DOI]

CoRR, December, 2025

Position: The Complexity of Perfect AI Alignment - Formalizing the RLHF Trilemma.

[BibT_eX]

[DOI]

CoRR, November, 2025

The Horcrux: Mechanistically Interpretable Task Decomposition for Detecting and Mitigating Reward Hacking in Embodied AI Systems.

[BibT_eX]

[DOI]

Subramanyam Sahoo

Jared Junkin

CoRR, November, 2025

The Last Vote: A Multi-Stakeholder Framework for Language Model Governance.

[BibT_eX]

[DOI]

Subramanyam Sahoo

Aditi Chhawacharia

CoRR, November, 2025

The Good, The Bad, and The Hybrid: A Reward Structure Showdown in Reasoning Models Training.

[BibT_eX]

[DOI]

Subramanyam Sahoo

CoRR, November, 2025

Who Evaluates AI's Social Impacts? Mapping Coverage and Gaps in First and Third Party Evaluations.

[BibT_eX]

[DOI]

CoRR, November, 2025

2024

Boardwalk Empire: How Generative AI is Revolutionizing Economic Paradigms.

[BibT_eX]

[DOI]

Subramanyam Sahoo

Kamlesh Dutta

CoRR, 2024

DUNE: Decoding Unified Naive Bayes Explainability through Gaussian methods for a Heart Disease Diagnostic.

[BibT_eX]

[DOI]

Subramanyam Sahoo

Kamlesh Dutta

Proceedings of the 15th International Conference on Computing Communication and Networking Technologies, 2024

Subramanyam Sahoo

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...