Yaroslav Aksenov

According to our database1, Yaroslav Aksenov authored at least 9 papers between 2024 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of five.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2025
Small Vectors, Big Effects: A Mechanistic Study of RL-Induced Reasoning via Steering Vectors.
CoRR, September, 2025

Teach Old SAEs New Domain Tricks with Boosting.
CoRR, July, 2025

Train One Sparse Autoencoder Across Multiple Sparsity Budgets to Preserve Interpretability and Accuracy.
CoRR, May, 2025

Train Sparse Autoencoders Efficiently by Utilizing Features Correlation.
CoRR, May, 2025

You Do Not Fully Utilize Transformer's Representation Capacity.
CoRR, February, 2025

Analyze Feature Flow to Enhance Interpretation and Steering in Language Models.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Learn Your Reference Model for Real Good Alignment.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Train One Sparse Autoencoder Across Multiple Sparsity Budgets to Preserve Interpretability and Accuracy.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

2024
Linear Transformers with Learnable Kernel Functions are Better In-Context Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024


  Loading...