Ivan Skorokhodov

Orcid: 0000-0002-7611-9310

According to our database1, Ivan Skorokhodov authored at least 47 papers between 2018 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Taming Diffusion Transformer for Real-Time Mobile Video Generation.
CoRR, July, 2025

Improving Progressive Generation with Decomposable Flow Matching.
CoRR, June, 2025

4Real-Video-V2: Fused View-Time Attention and Feedforward Reconstruction for 4D Scene Generation.
CoRR, June, 2025

DenseDPO: Fine-Grained Temporal Preference Optimization for Video Diffusion Models.
CoRR, June, 2025

H3AE: High Compression, High Speed, and High Quality AutoEncoder for Video Diffusion Models.
CoRR, April, 2025

Dynamic Concepts Personalization from Single Videos.
CoRR, February, 2025

Improving the Diffusability of Autoencoders.
CoRR, February, 2025

VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Mind the Time: Temporally-Controlled Multi-Event Video Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Factored-NeuS: Reconstructing Surfaces, Illumination, and Materials of Possibly Glossy Objects.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Multi-subject Open-set Personalization in Video Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

4Real-Video: Learning Generalizable Photo-Realistic 4D Video Diffusion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation.
CoRR, 2024

VIMI: Grounding Video Generation through Multi-modal Instruction.
CoRR, 2024

VIA: A Spatiotemporal Video Adaptation Framework for Global and Local Video Editing.
CoRR, 2024

AToM: Amortized Text-to-Mesh using 2D Diffusion.
CoRR, 2024

SF-V: Single Forward Video Generation Model.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

VIMI: Grounding Video Generation through Multi-modal Instruction.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

TC4D: Trajectory-Conditioned Text-to-4D Generation.
Proceedings of the Computer Vision - ECCV 2024, 2024

Hierarchical Patch Diffusion Models for High-Resolution Video Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Adversarial Text to Continuous Image Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

4D-fy: Text-to-4D Generation Using Hybrid Score Distillation Sampling.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Factored-NeuS: Reconstructing Surfaces, Illumination, and Materials of Possibly Glossy Objects.
CoRR, 2023

3D generation on ImageNet.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Continual Zero-Shot Learning through Semantically Guided Generative Random Walks.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

SATR: Zero-Shot Semantic Segmentation of 3D Shapes.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

DisCoScene: Spatially Disentangled Generative Radiance Fields for Controllable 3D-aware Scene Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Unsupervised Volumetric Animation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

PET-NeuS: Positional Encoding Tri-Planes for Neural Surfaces.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Improved surface reconstruction using high-frequency details.
CoRR, 2022

3DRefTransformer: Fine-Grained Object Identification in Real-World Scenes Using Natural Language.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

EpiGRAF: Rethinking training of 3D GANs.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

HF-NeuS: Improved Surface Reconstruction Using High-Frequency Details.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Creative Walk Adversarial Networks: Novel Art Generation with Probabilistic Random Walk Deviation from Style Norms.
Proceedings of the 13th International Conference on Computational Creativity, 2022

StyleGAN-V: A Continuous Video Generator with the Price, Image Quality and Perks of StyleGAN2.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Imaginative Walks: Generative Random Walk Deviation Loss for Improved Unseen Learning Representation.
CoRR, 2021

Class Normalization for (Continual)? Generalized Zero-Shot Learning.
Proceedings of the 9th International Conference on Learning Representations, 2021

Aligning Latent and Image Spaces to Connect the Unconnectable.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Adversarial Generation of Continuous Images.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Interpolating Points on a Non-Uniform Grid using a Mixture of Gaussians.
CoRR, 2020

Normalization Matters in Zero-Shot Learning.
CoRR, 2020

2019
Loss Landscape Sightseeing with Multi-Point Optimization.
CoRR, 2019

2018
Semi-Supervised Neural Machine Translation with Language Models.
Proceedings of the Workshop on Technologies for MT of Low Resource Languages, 2018


  Loading...