Peng Zhang

Affiliations:
  • Alibaba Group, Damo Academy, Hangzhou, China
  • University of Science and Technology of China, Department of Electronic Engineering and Information Science, Hefei, China (former)


According to our database1, Peng Zhang authored at least 29 papers between 2008 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
PortraitDirector: A Hierarchical Disentanglement Framework for Controllable and Real-time Facial Reenactment.
CoRR, April, 2026

SGA-MCTS: Decoupling Planning from Execution via Training-Free Atomic Experience Retrieval.
CoRR, April, 2026

2025
SyncAnyone: Implicit Disentanglement via Progressive Self-Correction for Lip-Syncing in the wild.
CoRR, December, 2025

Knot Forcing: Taming Autoregressive Video Diffusion Models for Real-time Infinite Interactive Portrait Animation.
CoRR, December, 2025

Co-speech Gesture Video Generation via Motion-Based Graph Retrieval.
CoRR, December, 2025

Wan-Animate: Unified Character Animation and Replacement with Holistic Replication.
CoRR, September, 2025

Wan-S2V: Audio-Driven Cinematic Video Generation.
CoRR, August, 2025

MirrorMe: Towards Realtime and High Fidelity Audio-Driven Halfbody Animation.
CoRR, June, 2025

Controllable and Expressive One-Shot Video Head Swapping.
CoRR, June, 2025

MotionRAG-Diff: A Retrieval-Augmented Diffusion Framework for Long-Term Music-to-Dance Generation.
CoRR, June, 2025

OmniTalker: Real-Time Text-Driven Talking Head Generation with In-Context Audio-Visual Style Replication.
CoRR, April, 2025

ChatAnyone: Stylized Real-time Portrait Video Generation with Hierarchical Motion Diffusion Model.
CoRR, March, 2025

Animate Anyone 2: High-Fidelity Character Image Animation with Environment Affordance.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Exploring Timeline Control for Facial Motion Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior.
Proceedings of the International Conference on 3D Vision, 2025

2024
DanceMeld: Unraveling Dance Phrases with Hierarchical Latent Codes for Music-to-Dance Synthesis.
CoRR, 2024

Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

BoolQuestions: Does Dense Retrieval Understand Boolean Logic in Language?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

2023
MaTe3D: Mask-guided Text-based 3D-aware Portrait Editing.
CoRR, 2023

Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation.
CoRR, 2023

2022
DART: Articulated Hand Model with Diverse Accessories and Rich Textures.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Text/Speech-Driven Full-Body Animation.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

2021
A Virtual Character Generation and Animation System for E-Commerce Live Streaming.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Learning Position and Target Consistency for Memory-Based Video Object Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2015
SOM: Semantic obviousness metric for image quality assessment.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2011
Peak Tree: A New Tool for Multiscale Hierarchical Representation and Peak Detection of Mass Spectrometry Data.
IEEE ACM Trans. Comput. Biol. Bioinform., 2011

A novel tracking-by-encoding scheme based on linear programming matching.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2011), 2011

2010
MAP spatial pyramid mean shift for object tracking.
Proceedings of the Visual Communications and Image Processing 2010, 2010

2008
Peak detection using peak tree approach for mass spectrometry data.
Int. J. Hybrid Intell. Syst., 2008


  Loading...