Andrey Zhmoginov

According to our database1, Andrey Zhmoginov authored at least 27 papers between 2016 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Contextually Guided Transformers via Low-Rank Adaptation.
CoRR, June, 2025

Projectable Models: One-Shot Generation of Small Specialized Transformers from Large Ones.
CoRR, June, 2025

Long Context In-Context Compression by Getting to the Gist of Gisting.
CoRR, April, 2025

How new data permeates LLM knowledge and how to dilute it.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

MELODI: Exploring Memory Compression for Long Contexts.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Continual HyperTransformer: A Meta-Learner for Continual Few-Shot Learning.
Trans. Mach. Learn. Res., 2024

Learning and Unlearning of Fabricated Knowledge in Language Models.
CoRR, 2024

Narrowing the Focus: Learned Optimizers for Pretrained Models.
CoRR, 2024

2023
Continual Few-Shot Learning Using HyperTransformers.
CoRR, 2023

Training trajectories, mini-batch losses and the curious role of the learning rate.
CoRR, 2023

Transformers Learn In-Context by Gradient Descent.
Proceedings of the International Conference on Machine Learning, 2023

Decentralized Learning with Multi-Headed Distillation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
HyperTransformer: Model Generation for Supervised and Semi-Supervised Few-Shot Learning.
Proceedings of the International Conference on Machine Learning, 2022

Fine-tuning Image Transformers using Learnable Memory.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Compositional Models: Multi-Task Learning and Knowledge Transfer with Modular Networks.
CoRR, 2021

Meta-Learning Bidirectional Update Rules.
Proceedings of the 38th International Conference on Machine Learning, 2021

BasisNet: Two-Stage Model Synthesis for Efficient Inference.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

2020
Large-Scale Generative Data-Free Distillation.
CoRR, 2020

Image segmentation via Cellular Automata.
CoRR, 2020

Information-Bottleneck Approach to Salient Region Discovery.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2020

2019
K for the Price of 1: Parameter-efficient Multi-task and Transfer Learning.
Proceedings of the 7th International Conference on Learning Representations, 2019

Non-Discriminative Data or Weak Model? On the Relative Importance of Data and Model Resolution.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

2018
Inverted Residuals and Linear Bottlenecks: Mobile Networks for Classification, Detection and Segmentation.
CoRR, 2018

MobileNetV2: Inverted Residuals and Linear Bottlenecks.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
CycleGAN, a Master of Steganography.
CoRR, 2017

The Power of Sparsity in Convolutional Neural Networks.
CoRR, 2017

2016
Inverting face embeddings with convolutional neural networks.
CoRR, 2016


  Loading...