Andrey Kuznetsov

Orcid: 0000-0001-6446-8663

According to our database1, Andrey Kuznetsov authored at least 81 papers between 2010 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
SHIFT: Steering Hidden Intermediates in Flow Transformers.
CoRR, April, 2026

CADReasoner: Iterative Program Editing for CAD Reverse Engineering.
CoRR, March, 2026

Calibri: Enhancing Diffusion Transformers via Parameter-Efficient Calibration.
CoRR, March, 2026

Marchuk: Efficient Global Weather Forecasting from Mid-Range to Sub-Seasonal Scales via Flow Matching.
CoRR, March, 2026

CADEvolve: Creating Realistic CAD via Program Evolution.
CoRR, February, 2026

Back to Basics: Revisiting Exploration in Reinforcement Learning for LLM Reasoning via Generative Probabilities.
CoRR, February, 2026

CoMa: Contextual Massing Generation with Vision-Language Models.
CoRR, January, 2026

GHOST 2.0: Generative high-fidelity one shot transfer of heads.
AI Open, 2026

ESQA: Event Sequences Question Answering.
IEEE Access, 2026

SPARTA: Evaluating Reasoning Segmentation Robustness through Black-Box Adversarial Paraphrasing in Text Autoencoder Latent Space.
Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics, 2026

Bring the Apple, Not the Sofa: Impact of Irrelevant Context in Embodied AI Commands on VLA Models.
Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics, 2026

T-LoRA: Single Image Diffusion Model Customization Without Overfitting.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

BREPS: Bounding-Box Robustness Evaluation of Promptable Segmentation.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

NoReGeo: Non-Reasoning Geometry Benchmark.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
MindShift: Analyzing Language Models' Reactions to Psychological Prompts.
CoRR, December, 2025

GigaEvo: An Open Source Optimization Framework Powered By LLMs And Evolution Algorithms.
CoRR, November, 2025

Simple Vision-Language Math Reasoning via Rendered Text.
CoRR, November, 2025

RoboBenchMart: Benchmarking Robots in Retail Environment.
CoRR, November, 2025

Multi-Agent GraphRAG: A Text-to-Cypher Framework for Labeled Property Graphs.
CoRR, November, 2025

Sentence-Anchored Gist Compression for Long-Context LLMs.
CoRR, November, 2025

Real-World Transferable Adversarial Attack on Face-Recognition Systems.
CoRR, September, 2025

SONAR-LLM: Autoregressive Transformer that Thinks in Sentence Embeddings and Speaks in Tokens.
CoRR, August, 2025

Speech-to-LaTeX: New Models and Datasets for Converting Spoken Equations and Sentences.
CoRR, August, 2025

Heeding the Inner Voice: Aligning ControlNet Training via Intermediate Features Feedback.
CoRR, July, 2025

Listener-Rewarded Thinking in VLMs for Image Preferences.
CoRR, June, 2025

Inverse-and-Edit: Effective and Fast Image Editing by Cycle Consistency Models.
CoRR, June, 2025

Inverting Black-Box Face Recognition Systems via Zero-Order Optimization in Eigenface Space.
CoRR, June, 2025

Image Reconstruction as a Tool for Feature Analysis.
CoRR, June, 2025

Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models.
CoRR, June, 2025

ImageReFL: Balancing Quality and Diversity in Human-Aligned Diffusion Models.
CoRR, May, 2025

FastFace: Tuning Identity Preservation in Distilled Diffusion via Guidance and Attention.
CoRR, May, 2025

DreamBoothDPO: Improving Personalized Generation using Direct Preference Optimization.
CoRR, May, 2025

Generalized Fisher-Weighted SVD: Scalable Kronecker-Factored Fisher Approximation for Compressing Large Language Models.
CoRR, May, 2025

When Less is Enough: Adaptive Token Reduction for Efficient Image Representation.
CoRR, March, 2025

MOVE: A Mixture-of-Vision-Encoders Approach for Domain-Focused Vision-Language Processing.
CoRR, February, 2025

Toward Cybersecurity Testing and Monitoring of IoT Ecosystems.
CoRR, February, 2025

Universal Adversarial Attack on Aligned Multimodal LLMs.
CoRR, February, 2025

MaterialFusion: High-Quality, Zero-Shot, and Controllable Material Transfer with Diffusion Models.
CoRR, February, 2025

MaxInfo: A Training-Free Key-Frame Selection Method Using Maximum Volume for Enhanced Video Understanding.
CoRR, February, 2025

ImproveYourVideos: Architectural Improvements for Text-to-Video Generation Pipeline.
IEEE Access, 2025

SODAOpt: Socio-Demographic and Textual Adaptive Fusion for Optimizing Developer Task Assignment.
Proceedings of the 33rd ACM International Conference on the Foundations of Software Engineering, 2025

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

Understanding the Limitations of Deep Transformer Models for Sea Ice Forecasting.
Proceedings of the Computational Science - ICCS 2025, 2025

2024
Addressing Hallucinations in Language Models with Knowledge Graph Embeddings as an Additional Modality.
CoRR, 2024

Unleashing the power of novel conditional generative approaches for new materials discovery.
CoRR, 2024

ESQA: Event Sequences Question Answering.
CoRR, 2024

OmniFusion Technical Report.
CoRR, 2024

Kandinsky 3: Text-to-Image Synthesis for Multifunctional Generative Framework.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: EMNLP 2024, 2024

The Shape of Learning: Anisotropy and Intrinsic Dimensions in Transformer-Based Models.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

Your Transformer is Secretly Linear.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
MineralImage5k: A benchmark for zero-shot raw mineral visual recognition and description.
Comput. Geosci., September, 2023

Kandinsky 3.0 Technical Report.
CoRR, 2023

FusionFrames: Efficient Architectural Aspects for Text-to-Video Generation Pipeline.
CoRR, 2023

Revising deep learning methods in parking lot occupancy detection.
CoRR, 2023

RusTitW: Russian Language Text Dataset for Visual Text in-the-Wild Recognition.
CoRR, 2023

Kandinsky: An Improved Text-to-Image Synthesis with Image Prior and Latent Diffusion.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022
RuCLIP - new models and experiments: a technical report.
CoRR, 2022

A new face swap method for image and video domains: a technical report.
CoRR, 2022

GHOST - A New Face Swap Approach for Image and Video Domains.
IEEE Access, 2022

2020
Remote Sensing Image Inpainting with Generative Adversarial Networks.
Proceedings of the 8th International Symposium on Digital Forensics and Security, 2020

Two-Stage Classification Model for Feather Images Identification.
Proceedings of the Pattern Recognition. ICPR International Workshops and Challenges, 2020

A New Sport Teams Logo Dataset for Detection Tasks.
Proceedings of the Computer Vision and Graphics - International Conference, 2020

2019
Digital video forgery detection based on statistical features calculation.
Proceedings of the Twelfth International Conference on Machine Vision, 2019

Face Recognition Using DELF Feature Descriptors on RGB-D Data.
Proceedings of the Analysis of Images, Social Networks and Texts, 2019

2018
Person reidentification on video surveillance data.
Proceedings of the Eleventh International Conference on Machine Vision, 2018

Camera Sensor Traces Analysis in Image Forgery Detection Problem.
Proceedings of the Computer Vision and Graphics - International Conference, 2018

Copy-Move Detection Based on Different Forms of Local Binary Patterns.
Proceedings of the Analysis of Images, Social Networks and Texts, 2018

2017
A Copy-Move Detection Algorithm Based on Geometric Local Binary Pattern.
Proceedings of the Digital Communication. Towards a Smart and Secure Future Internet, 2017

Satellite Image Forgery Detection Based on Buildings Shadows Analysis.
Proceedings of the Analysis of Images, Social Networks and Texts, 2017

2016
A Copy-Move Detection Algorithm Using Binary Gradient Contours.
Proceedings of the Image Analysis and Recognition - 13th International Conference, 2016

Remote Sensing Data Copy-Move Forgery Protection Algorithm.
Proceedings of the Computer Vision and Graphics - International Conference, 2016

Using Efficient Linear Local Features in the Copy-Move Forgery Detection Task.
Proceedings of the Analysis of Images, Social Networks and Texts, 2016

2015
An evaluation of popular hyperspectral images classification approaches.
Proceedings of the Eighth International Conference on Machine Vision, 2015

Approach to building a web-based expert system interface and its application for software provisioning in clouds.
Proceedings of the 2015 Federated Conference on Computer Science and Information Systems, 2015

Understanding Software Provisioning: An Ontological View.
Proceedings of the Databases in Networked Information Systems, 2015

Remote Sensing Data Verification Using Model-Oriented Descriptors.
Proceedings of the Analysis of Images, Social Networks and Texts, 2015

2014
Leveraging User Experience through Input Style Transformation to Improve Access to Music Search Services.
Informatica (Slovenia), 2014

A Provisioning Service for Automatic Command Line Applications Deployment in Computing Clouds.
Proceedings of the 2014 IEEE International Conference on High Performance Computing and Communications, 2014

2013
An Approach for Developing a Mobile Accessed Music Search Integration Platform.
Proceedings of the 2013 Federated Conference on Computer Science and Information Systems, 2013

2012
Function-based and circuit-based symbolic music representation, or back to Beethoven.
Proceedings of the Joint International Conference on Human-Centered Computer Environments, 2012

2010
Searching for music: from melodies in mind to the resources on the web.
Proceedings of the International Conference on Humans and Computers, 2010


  Loading...