Subramanyam Sahoo

According to our database1, Subramanyam Sahoo authored at least 19 papers between 2024 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Calibration Collapse Under Sycophancy Fine-Tuning: How Reward Hacking Breaks Uncertainty Quantification in LLMs.
CoRR, April, 2026

The Reasoning Trap - Logical Reasoning as a Mechanistic Pathway to Situational Awareness.
CoRR, March, 2026

SAHOO: Safeguarded Alignment for High-Order Optimization Objectives in Recursive Self-Improvement.
CoRR, March, 2026

The Controllability Trap: A Governance Framework for Military AI Agents.
CoRR, March, 2026

When Shallow Wins: Silent Failures and the Depth-Accuracy Paradox in Latent Reasoning.
CoRR, March, 2026

Policy myopia as a mechanism of gradual disempowerment in Post-AGI governance, Circa 2049.
CoRR, March, 2026

I Can't Believe It's Not Robust: Catastrophic Collapse of Safety Classifiers under Embedding Drift.
CoRR, March, 2026

Dial E for Ethical Enforcement: institutional VETO power as a governance primitive.
CoRR, March, 2026

When AI Benchmarks Plateau: A Systematic Study of Benchmark Saturation.
CoRR, February, 2026

2025
The Deepfake Detective: Interpreting Neural Forensics Through Sparse Features and Manifolds.
CoRR, December, 2025

The Double Life of Code World Models: Provably Unmasking Malicious Behavior Through Execution Traces.
CoRR, December, 2025

Catch Me If You Can: How Smaller Reasoning Models Pretend to Reason with Mathematical Fidelity.
CoRR, December, 2025

Position: The Complexity of Perfect AI Alignment - Formalizing the RLHF Trilemma.
CoRR, November, 2025

The Horcrux: Mechanistically Interpretable Task Decomposition for Detecting and Mitigating Reward Hacking in Embodied AI Systems.
CoRR, November, 2025

The Last Vote: A Multi-Stakeholder Framework for Language Model Governance.
CoRR, November, 2025

The Good, The Bad, and The Hybrid: A Reward Structure Showdown in Reasoning Models Training.
CoRR, November, 2025

Who Evaluates AI's Social Impacts? Mapping Coverage and Gaps in First and Third Party Evaluations.
CoRR, November, 2025

2024
Boardwalk Empire: How Generative AI is Revolutionizing Economic Paradigms.
CoRR, 2024

DUNE: Decoding Unified Naive Bayes Explainability through Gaussian methods for a Heart Disease Diagnostic.
Proceedings of the 15th International Conference on Computing Communication and Networking Technologies, 2024


  Loading...