Anton Korznikov

According to our database¹, Anton Korznikov authored at least 5 papers between 2025 and 2026.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of five.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Sanity Checks for Sparse Autoencoders: Do SAEs Beat Random Baselines?

[BibT_eX]

[DOI]

CoRR, February, 2026

Feature Drift: How Fine-Tuning Repurposes Representations in LLMs.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EACL 2026, 2026

Out of Distribution, Out of Luck: Process Rewards Misguide Reasoning Models.

[BibT_eX]

[DOI]

Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics, 2026

2025

The Rogue Scalpel: Activation Steering Compromises LLM Safety.

[BibT_eX]

[DOI]

CoRR, September, 2025

OrtSAE: Orthogonal Sparse Autoencoders Uncover Atomic Features.

[BibT_eX]

[DOI]

CoRR, September, 2025

Anton Korznikov

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...