Anton Korznikov

According to our database1, Anton Korznikov authored at least 5 papers between 2025 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of five.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Sanity Checks for Sparse Autoencoders: Do SAEs Beat Random Baselines?
CoRR, February, 2026

Feature Drift: How Fine-Tuning Repurposes Representations in LLMs.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2026, 2026

Out of Distribution, Out of Luck: Process Rewards Misguide Reasoning Models.
Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics, 2026

2025
The Rogue Scalpel: Activation Steering Compromises LLM Safety.
CoRR, September, 2025

OrtSAE: Orthogonal Sparse Autoencoders Uncover Atomic Features.
CoRR, September, 2025


  Loading...