Yanhong Zeng

Orcid: 0000-0003-3596-5163

According to our database¹, Yanhong Zeng authored at least 35 papers between 2017 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

CausalCine: Real-Time Autoregressive Generation for Multi-Shot Video Narratives.

[BibT_eX]

[DOI]

CoRR, May, 2026

StyleShot: A Snapshot on Any Style.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., February, 2026

FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., January, 2026

Advancing Open-source World Models.

[BibT_eX]

[DOI]

CoRR, January, 2026

PSDiffusion: Harmonized Multi-Layer Image Generation via Layout and Appearance Alignment.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2026

2025

The World is Your Canvas: Painting Promptable Events with Reference Images, Trajectories, and Text.

[BibT_eX]

[DOI]

CoRR, December, 2025

Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation.

[BibT_eX]

[DOI]

CoRR, December, 2025

MagicQuillV2: Precise and Interactive Image Editing with Layered Visual Cues.

[BibT_eX]

[DOI]

CoRR, December, 2025

HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives.

[BibT_eX]

[DOI]

CoRR, October, 2025

Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset.

[BibT_eX]

[DOI]

CoRR, October, 2025

CharacterShot: Controllable and Consistent 4D Character Animation.

[BibT_eX]

[DOI]

CoRR, August, 2025

AniCrafter: Customizing Realistic Human-Centric Animation via Avatar-Background Conditioning in Video Diffusion Models.

[BibT_eX]

[DOI]

CoRR, May, 2025

WORLDMEM: Long-term Consistent World Simulation with Memory.

[BibT_eX]

[DOI]

CoRR, April, 2025

LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?

[BibT_eX]

[DOI]

CoRR, March, 2025

Multi-Identity Human Image Animation with Structural Video Diffusion.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Exposure-Limited Image Enhancement with Generative Diffusion Prior.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computational Photography, 2025

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024

Live2Diff: Live Stream Translation via Uni-directional Attention in Video Diffusion Models.

[BibT_eX]

[DOI]

CoRR, 2024

FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds.

[BibT_eX]

[DOI]

CoRR, 2024

StyleShot: A Snapshot on Any Style.

[BibT_eX]

[DOI]

CoRR, 2024

Sagiri: Low Dynamic Range Image Enhancement with Generative Diffusion Prior.

[BibT_eX]

[DOI]

CoRR, 2024

MotionBooth: Motion-Aware Customized Text-to-Video Generation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

A Task Is Worth One Word: Learning with Task Prompts for High-Quality Versatile Image Inpainting.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Make-It-Vivid: Dressing Your Animatable Biped Cartoon Characters from Text.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

Aggregated Contextual Transformations for High-Resolution Image Inpainting.

[BibT_eX]

[DOI]

IEEE Trans. Vis. Comput. Graph., July, 2023

2022

Degradation-Guided Meta-Restoration Network for Blind Super-Resolution.

[BibT_eX]

[DOI]

CoRR, 2022

Advancing High-Resolution Video-Language Representation with Large-Scale Video Transcriptions.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Improving Visual Quality of Image Synthesis by A Token-based Generator with Transformers.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2020

Learning Semantic-aware Normalization for Generative Adversarial Networks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Learning Joint Spatial-Temporal Transformations for Video Inpainting.

[BibT_eX]

[DOI]

Yanhong Zeng

Jianlong Fu

Hongyang Chao

Proceedings of the Computer Vision - ECCV 2020, 2020

2019

Learning Pyramid-Context Encoder Network for High-Quality Image Inpainting.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2017

3D Human Body Reshaping with Anthropometric Modeling.

[BibT_eX]

[DOI]

Yanhong Zeng

Jianlong Fu

Hongyang Chao

Proceedings of the Internet Multimedia Computing and Service, 2017

Yanhong Zeng

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...