Wen Wang

Orcid: 0009-0007-4349-5532

Affiliations:
  • Zhejiang University, College of Information Science and Electronic Engineering, Hangzhou, China


According to our database1, Wen Wang authored at least 26 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
MMControl: Unified Multi-Modal Control for Joint Audio-Video Generation.
CoRR, April, 2026

Efficient Self-Evaluation for Diffusion Language Models via Sequence Regeneration.
CoRR, March, 2026

FreerCustom: Training-Free Multi-Concept Customization for Image and Video Generation.
Int. J. Comput. Vis., January, 2026

Improving Diffusion Language Model Decoding through Joint Search in Generation Order and Token Space.
CoRR, January, 2026

Beyond Hard Masks: Progressive Token Evolution for Diffusion Language Models.
CoRR, January, 2026

2025
Preserving Source Video Realism: High-Fidelity Face Swapping for Cinematic Quality.
CoRR, December, 2025

Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models.
CoRR, August, 2025

AutoStory: Generating Diverse Storytelling Images with Minimal Human Efforts.
Int. J. Comput. Vis., June, 2025

Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration.
CoRR, May, 2025

Efficient Multiple-Precision Floating-Point Multiply-Add Architecture for Deep Learning Applications.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2025

MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequences.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Framer: Interactive Frame Interpolation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

MovieBench: A Hierarchical Movie Level Dataset for Long Video Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequence.
CoRR, 2024

Mantissa-Aware Floating-Point Eight-Term Fused Dot Product Unit.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2024

Object-Aware Inversion and Reassembly for Image Editing.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

FreeCompose: Generic Zero-Shot Image Composition with Diffusion Prior.
Proceedings of the Computer Vision - ECCV 2024, 2024

FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
GenDeF: Learning Generative Deformation Field for Video Generation.
CoRR, 2023

AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort.
CoRR, 2023

Object-aware Inversion and Reassembly for Image Editing.
CoRR, 2023

SegGPT: Segmenting Everything In Context.
CoRR, 2023

Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models.
CoRR, 2023

SegGPT: Towards Segmenting Everything In Context.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Images Speak in Images: A Generalist Painter for In-Context Visual Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

EVA: Exploring the Limits of Masked Visual Representation Learning at Scale.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023


  Loading...