Daniil Gavrilov

According to our database1, Daniil Gavrilov authored at least 20 papers between 2019 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of five.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Teach Old SAEs New Domain Tricks with Boosting.
CoRR, July, 2025

ESSA: Evolutionary Strategies for Scalable Alignment.
CoRR, July, 2025

Train One Sparse Autoencoder Across Multiple Sparsity Budgets to Preserve Interpretability and Accuracy.
CoRR, May, 2025

Train Sparse Autoencoders Efficiently by Utilizing Features Correlation.
CoRR, May, 2025

Steering LLM Reasoning Through Bias-Only Adaptation.
CoRR, May, 2025

You Do Not Fully Utilize Transformer's Representation Capacity.
CoRR, February, 2025

Analyze Feature Flow to Enhance Interpretation and Steering in Language Models.
CoRR, February, 2025

The Differences Between Direct Alignment Algorithms are a Blur.
CoRR, February, 2025

Learn Your Reference Model for Real Good Alignment.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Mechanistic Permutability: Match Features Across Layers.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Linear Transformers with Learnable Kernel Functions are Better In-Context Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Ahead-of-Time P-Tuning.
CoRR, 2023

Democratized Diffusion Language Model.
CoRR, 2023

2022
Linear Interpolation In Parameter Space is Good Enough for Fine-Tuned Language Models.
CoRR, 2022

Classifiers are Better Experts for Controllable Text Generation.
CoRR, 2022

FastRPB: a Scalable Relative Positional Encoding for Long Sequence Tasks.
CoRR, 2022

PALBERT: Teaching ALBERT to Ponder.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021
Implicit Unlikelihood Training: Improving Neural Text Generation with Reinforcement Learning.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

2020
Weight Squeezing: Reparameterization for Compression and Fast Inference.
CoRR, 2020

2019
Self-attentive Model for Headline Generation.
Proceedings of the Advances in Information Retrieval, 2019


  Loading...