Duomin Wang

Orcid: 0009-0004-9507-6741

According to our database1, Duomin Wang authored at least 17 papers between 2020 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
PuYun-LDM: A Latent Diffusion Model for High-Resolution Ensemble Weather Forecasts.
CoRR, February, 2026

2025
LazyDrag: Enabling Stable Drag-Based Editing on Multi-Modal Diffusion Transformers via Explicit Correspondence.
CoRR, September, 2025

UniVerse-1: Unified Audio-Video Generation via Stitching of Experts.
CoRR, September, 2025

Training-Free Text-Guided Color Editing with Multi-Modal Diffusion Transformer.
CoRR, August, 2025

SpeakerVid-5M: A Large-Scale High-Quality Dataset for Audio-Visual Dyadic Interactive Human Generation.
CoRR, July, 2025

Taming Teacher Forcing for Masked Autoregressive Video Generation.
CoRR, January, 2025

Taming Teacher Forcing for Masked Autoregressive Video Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer.
CoRR, 2024

Disentangling Planning, Driving and Rendering for Photorealistic Avatar Agents.
Proceedings of the Computer Vision - ECCV 2024 Workshops, 2024

Portrait4D-V2: Pseudo Multi-view Data Creates Better 4D Head Synthesizer.
Proceedings of the Computer Vision - ECCV 2024, 2024

PICTURE: PhotorealistIC Virtual Try-on from UnconstRained dEsigns.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Portrait4D: Learning One-Shot 4D Head Avatar Synthesis using Synthetic Data.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Learning One-Shot 4D Head Avatar Synthesis using Synthetic Data.
CoRR, 2023

AgentAvatar: Disentangling Planning, Driving and Rendering for Photorealistic Avatar Agents.
CoRR, 2023

Talking Head Generation with Probabilistic Audio-to-Visual Diffusion Priors.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Progressive Disentangled Representation Learning for Fine-Grained Controllable Talking Head Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2020
The $2^\mathrm{nd}$ 106-Point Lightweight Facial Landmark Localization Grand Challenge.
Proceedings of the Pattern Recognition. ICPR International Workshops and Challenges, 2020


  Loading...