Yanhong Zeng

Orcid: 0000-0003-3596-5163

According to our database1, Yanhong Zeng authored at least 34 papers between 2017 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
StyleShot: A Snapshot on Any Style.
IEEE Trans. Pattern Anal. Mach. Intell., February, 2026

FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds.
Int. J. Comput. Vis., January, 2026

Advancing Open-source World Models.
CoRR, January, 2026

PSDiffusion: Harmonized Multi-Layer Image Generation via Layout and Appearance Alignment.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2026

2025
The World is Your Canvas: Painting Promptable Events with Reference Images, Trajectories, and Text.
CoRR, December, 2025

Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation.
CoRR, December, 2025

MagicQuillV2: Precise and Interactive Image Editing with Layered Visual Cues.
CoRR, December, 2025

HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives.
CoRR, October, 2025

Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset.
CoRR, October, 2025

CharacterShot: Controllable and Consistent 4D Character Animation.
CoRR, August, 2025

AniCrafter: Customizing Realistic Human-Centric Animation via Avatar-Background Conditioning in Video Diffusion Models.
CoRR, May, 2025

WORLDMEM: Long-term Consistent World Simulation with Memory.
CoRR, April, 2025

LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?
CoRR, March, 2025

Multi-Identity Human Image Animation with Structural Video Diffusion.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Exposure-Limited Image Enhancement with Generative Diffusion Prior.
Proceedings of the IEEE International Conference on Computational Photography, 2025

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Live2Diff: Live Stream Translation via Uni-directional Attention in Video Diffusion Models.
CoRR, 2024

FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds.
CoRR, 2024

StyleShot: A Snapshot on Any Style.
CoRR, 2024

Sagiri: Low Dynamic Range Image Enhancement with Generative Diffusion Prior.
CoRR, 2024

MotionBooth: Motion-Aware Customized Text-to-Video Generation.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

A Task Is Worth One Word: Learning with Task Prompts for High-Quality Versatile Image Inpainting.
Proceedings of the Computer Vision - ECCV 2024, 2024

PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Make-It-Vivid: Dressing Your Animatable Biped Cartoon Characters from Text.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Aggregated Contextual Transformations for High-Resolution Image Inpainting.
IEEE Trans. Vis. Comput. Graph., July, 2023

2022
Degradation-Guided Meta-Restoration Network for Blind Super-Resolution.
CoRR, 2022

Advancing High-Resolution Video-Language Representation with Large-Scale Video Transcriptions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Improving Visual Quality of Image Synthesis by A Token-based Generator with Transformers.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2020
Learning Semantic-aware Normalization for Generative Adversarial Networks.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Learning Joint Spatial-Temporal Transformations for Video Inpainting.
Proceedings of the Computer Vision - ECCV 2020, 2020

2019
Learning Pyramid-Context Encoder Network for High-Quality Image Inpainting.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2017
3D Human Body Reshaping with Anthropometric Modeling.
Proceedings of the Internet Multimedia Computing and Service, 2017


  Loading...