Satvik Golechha

Orcid: 0009-0000-5274-1060

According to our database¹, Satvik Golechha authored at least 13 papers between 2022 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Building Better Deception Probes Using Targeted Instruction Pairs.

[BibT_eX]

[DOI]

CoRR, February, 2026

2025

ABBEL: LLM Agents Acting through Belief Bottlenecks Expressed in Language.

[BibT_eX]

[DOI]

CoRR, December, 2025

Auditing Games for Sandbagging.

[BibT_eX]

[DOI]

CoRR, December, 2025

Who's the Evil Twin? Differential Auditing for Undesired Behavior.

[BibT_eX]

[DOI]

Ishwar Balappanawar

Venkata Hasith Vattikuti

Greta Kintzley

Ronan Azimi-Mancel

Satvik Golechha

CoRR, August, 2025

CataractBot: An LLM-powered Expert-in-the-Loop Chatbot for Cataract Patients.

[BibT_eX]

[DOI]

Proc. ACM Interact. Mob. Wearable Ubiquitous Technol., June, 2025

Among Us: A Sandbox for Agentic Deception.

[BibT_eX]

[DOI]

Satvik Golechha

Adrià Garriga-Alonso

CoRR, April, 2025

Auditing language models for hidden objectives.

[BibT_eX]

[DOI]

CoRR, March, 2025

Modular Training of Neural Networks aids Interpretability.

[BibT_eX]

[DOI]

CoRR, February, 2025

A is for Absorption: Studying Feature Splitting and Absorption in Sparse Autoencoders.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

2024

Progress Measures for Grokking on Real-world Datasets.

[BibT_eX]

[DOI]

Satvik Golechha

CoRR, 2024

Position Paper: Toward New Frameworks for Studying Model Representations.

[BibT_eX]

[DOI]

Satvik Golechha

James Dao

CoRR, 2024

NICE: To Optimize In-Context Examples or Not?

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2022

Predicting Treatment Adherence of Tuberculosis Patients at Scale.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning for Health, 2022

Satvik Golechha

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...