Willi Menapace

Orcid: 0000-0002-0715-9300

According to our database1, Willi Menapace authored at least 37 papers between 2020 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Taming Diffusion Transformer for Real-Time Mobile Video Generation.
CoRR, July, 2025

Improving Progressive Generation with Decomposable Flow Matching.
CoRR, June, 2025

4Real-Video-V2: Fused View-Time Attention and Feedforward Reconstruction for 4D Scene Generation.
CoRR, June, 2025

DenseDPO: Fine-Grained Temporal Preference Optimization for Video Diffusion Models.
CoRR, June, 2025

H3AE: High Compression, High Speed, and High Quality AutoEncoder for Video Diffusion Models.
CoRR, April, 2025

Dynamic Concepts Personalization from Single Videos.
CoRR, February, 2025

Improving the Diffusability of Autoencoders.
CoRR, February, 2025

Collaborative Neural Painting.
Comput. Vis. Image Underst., 2025

VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Can Text-to-Video Generation help Video-Language Alignment?
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Mind the Time: Temporally-Controlled Multi-Event Video Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Multi-subject Open-set Personalization in Video Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

4Real-Video: Learning Generalizable Photo-Realistic 4D Video Diffusion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Promptable Game Models: Text-guided Game Simulation via Masked Diffusion Models.
ACM Trans. Graph., April, 2024

Delving into CLIP latent space for Video Anomaly Recognition.
Comput. Vis. Image Underst., 2024

AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation.
CoRR, 2024

VIMI: Grounding Video Generation through Multi-modal Instruction.
CoRR, 2024

Taming Data and Transformers for Audio Generation.
CoRR, 2024

SF-V: Single Forward Video Generation Model.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

AsCAN: Asymmetric Convolution-Attention Networks for Efficient Recognition and Generation.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

VIMI: Grounding Video Generation through Multi-modal Instruction.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Harnessing Large Language Models for Training-Free Video Anomaly Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Hierarchical Patch Diffusion Models for High-Resolution Video Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Interactive Neural Painting.
Comput. Vis. Image Underst., October, 2023

Plotting Behind the Scenes: Towards Learnable Game Engines.
CoRR, 2023

InfiniCity: Infinite-Scale City Synthesis.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Unsupervised Volumetric Animation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Quantum Multi-Model Fitting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Quantum Motion Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

Playable Environments: Video Manipulation in Space and Time.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Playable Video Generation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Deep Learning for Classification and Localization of COVID-19 Markers in Point-of-Care Lung Ultrasound.
IEEE Trans. Medical Imaging, 2020

Learning to Cluster Under Domain Shift.
Proceedings of the Computer Vision - ECCV 2020, 2020


  Loading...