Andrey Zhmoginov

According to our database¹, Andrey Zhmoginov authored at least 27 papers between 2016 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Contextually Guided Transformers via Low-Rank Adaptation.

[BibT_eX]

[DOI]

CoRR, June, 2025

Projectable Models: One-Shot Generation of Small Specialized Transformers from Large Ones.

[BibT_eX]

[DOI]

Andrey Zhmoginov

Jihwan Lee

Mark Sandler

CoRR, June, 2025

Long Context In-Context Compression by Getting to the Gist of Gisting.

[BibT_eX]

[DOI]

CoRR, April, 2025

How new data permeates LLM knowledge and how to dilute it.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

MELODI: Exploring Memory Compression for Long Contexts.

[BibT_eX]

[DOI]

Jesper Sparre Andersen

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

Continual HyperTransformer: A Meta-Learner for Continual Few-Shot Learning.

[BibT_eX]

[DOI]

Max Vladymyrov

Andrey Zhmoginov

Mark Sandler

Trans. Mach. Learn. Res., 2024

Learning and Unlearning of Fabricated Knowledge in Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Narrowing the Focus: Learned Optimizers for Pretrained Models.

[BibT_eX]

[DOI]

CoRR, 2024

2023

Continual Few-Shot Learning Using HyperTransformers.

[BibT_eX]

[DOI]

Max Vladymyrov

Andrey Zhmoginov

Mark Sandler

CoRR, 2023

Training trajectories, mini-batch losses and the curious role of the learning rate.

[BibT_eX]

[DOI]

CoRR, 2023

Transformers Learn In-Context by Gradient Descent.

[BibT_eX]

[DOI]

Alexander Mordvintsev

Andrey Zhmoginov

Max Vladymyrov

Proceedings of the International Conference on Machine Learning, 2023

Decentralized Learning with Multi-Headed Distillation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

HyperTransformer: Model Generation for Supervised and Semi-Supervised Few-Shot Learning.

[BibT_eX]

[DOI]

Andrey Zhmoginov

Mark Sandler

Maksym Vladymyrov

Proceedings of the International Conference on Machine Learning, 2022

Fine-tuning Image Transformers using Learnable Memory.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Compositional Models: Multi-Task Learning and Knowledge Transfer with Modular Networks.

[BibT_eX]

[DOI]

Andrey Zhmoginov

Dina Bashkirova

Mark Sandler

CoRR, 2021

Meta-Learning Bidirectional Update Rules.

[BibT_eX]

[DOI]

Blaise Agüera y Arcas

Proceedings of the 38th International Conference on Machine Learning, 2021

BasisNet: Two-Stage Model Synthesis for Efficient Inference.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

2020

Large-Scale Generative Data-Free Distillation.

[BibT_eX]

[DOI]

CoRR, 2020

Image segmentation via Cellular Automata.

[BibT_eX]

[DOI]

Mark Sandler

Andrey Zhmoginov

Liangcheng Luo

Alexander Mordvintsev

Ettore Randazzo

Blaise Agüera y Arcas

CoRR, 2020

Information-Bottleneck Approach to Salient Region Discovery.

[BibT_eX]

[DOI]

Andrey Zhmoginov

Ian Fischer

Mark Sandler

Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2020

2019

K for the Price of 1: Parameter-efficient Multi-task and Transfer Learning.

[BibT_eX]

[DOI]

Pramod Kaushik Mudrakarta

Mark Sandler

Andrey Zhmoginov

Andrew G. Howard

Proceedings of the 7th International Conference on Learning Representations, 2019

Non-Discriminative Data or Weak Model? On the Relative Importance of Data and Model Resolution.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

2018

Inverted Residuals and Linear Bottlenecks: Mobile Networks for Classification, Detection and Segmentation.

[BibT_eX]

[DOI]

CoRR, 2018

MobileNetV2: Inverted Residuals and Linear Bottlenecks.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017

CycleGAN, a Master of Steganography.

[BibT_eX]

[DOI]

Casey Chu

Andrey Zhmoginov

Mark Sandler

CoRR, 2017

The Power of Sparsity in Convolutional Neural Networks.

[BibT_eX]

[DOI]

Soravit Changpinyo

Mark Sandler

Andrey Zhmoginov

CoRR, 2017

2016

Inverting face embeddings with convolutional neural networks.

[BibT_eX]

[DOI]

Andrey Zhmoginov

Mark Sandler

CoRR, 2016

Andrey Zhmoginov

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...