Willi Menapace

Orcid: 0000-0002-0715-9300

According to our database¹, Willi Menapace authored at least 40 papers between 2020 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Sprint: Sparse-Dense Residual Fusion for Efficient Diffusion Transformers.

[BibT_eX]

[DOI]

CoRR, October, 2025

LayerComposer: Interactive Personalized T2I via Spatially-Aware Layered Canvas.

[BibT_eX]

[DOI]

Kuan-Chieh Jackson Wang

CoRR, October, 2025

AlphaFlow: Understanding and Improving MeanFlow Models.

[BibT_eX]

[DOI]

CoRR, October, 2025

Taming Diffusion Transformer for Real-Time Mobile Video Generation.

[BibT_eX]

[DOI]

CoRR, July, 2025

Improving Progressive Generation with Decomposable Flow Matching.

[BibT_eX]

[DOI]

CoRR, June, 2025

4Real-Video-V2: Fused View-Time Attention and Feedforward Reconstruction for 4D Scene Generation.

[BibT_eX]

[DOI]

CoRR, June, 2025

DenseDPO: Fine-Grained Temporal Preference Optimization for Video Diffusion Models.

[BibT_eX]

[DOI]

CoRR, June, 2025

H3AE: High Compression, High Speed, and High Quality AutoEncoder for Video Diffusion Models.

[BibT_eX]

[DOI]

CoRR, April, 2025

Dynamic Concepts Personalization from Single Videos.

[BibT_eX]

[DOI]

CoRR, February, 2025

Collaborative Neural Painting.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., 2025

Improving the Diffusability of Autoencoders.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Can Text-to-Video Generation help Video-Language Alignment?

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Mind the Time: Temporally-Controlled Multi-Event Video Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Multi-subject Open-set Personalization in Video Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

4Real-Video: Learning Generalizable Photo-Realistic 4D Video Diffusion.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024

Promptable Game Models: Text-guided Game Simulation via Masked Diffusion Models.

[BibT_eX]

[DOI]

ACM Trans. Graph., April, 2024

Delving into CLIP latent space for Video Anomaly Recognition.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., 2024

AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation.

[BibT_eX]

[DOI]

CoRR, 2024

VIMI: Grounding Video Generation through Multi-modal Instruction.

[BibT_eX]

[DOI]

CoRR, 2024

Taming Data and Transformers for Audio Generation.

[BibT_eX]

[DOI]

CoRR, 2024

SF-V: Single Forward Video Generation Model.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

AsCAN: Asymmetric Convolution-Attention Networks for Efficient Recognition and Generation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

VIMI: Grounding Video Generation through Multi-modal Instruction.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Harnessing Large Language Models for Training-Free Video Anomaly Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Hierarchical Patch Diffusion Models for High-Resolution Video Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

Interactive Neural Painting.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., October, 2023

Plotting Behind the Scenes: Towards Learnable Game Engines.

[BibT_eX]

[DOI]

CoRR, 2023

InfiniCity: Infinite-Scale City Synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Unsupervised Volumetric Animation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Quantum Multi-Model Fitting.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Quantum Motion Segmentation.

[BibT_eX]

[DOI]

Federica Arrigoni

Willi Menapace

Marcel Seelbach Benkner

Elisa Ricci

Vladislav Golyanik

Proceedings of the Computer Vision - ECCV 2022, 2022

Playable Environments: Video Manipulation in Space and Time.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Playable Video Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

Deep Learning for Classification and Localization of COVID-19 Markers in Point-of-Care Lung Ultrasound.

[BibT_eX]

[DOI]

IEEE Trans. Medical Imaging, 2020

Learning to Cluster Under Domain Shift.

[BibT_eX]

[DOI]

Willi Menapace

Stéphane Lathuilière

Elisa Ricci

Proceedings of the Computer Vision - ECCV 2020, 2020

Willi Menapace

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...