Zerong Zheng

Orcid: 0000-0003-1339-2480

According to our database1, Zerong Zheng authored at least 48 papers between 2018 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
SpeechAct: Towards Generating Whole-Body Motion From Speech.
IEEE Trans. Vis. Comput. Graph., October, 2025

OmniHuman-1.5: Instilling an Active Mind in Avatars via Cognitive Simulation.
CoRR, August, 2025

DreamVVT: Mastering Realistic Video Virtual Try-On in the Wild via a Stage-Wise Diffusion Transformer Framework.
CoRR, August, 2025

DreamActor-H1: High-Fidelity Human-Product Demonstration Video Generation via Motion-designed Diffusion Transformers.
CoRR, June, 2025

InterActHuman: Multi-Concept Human Animation with Layout-Aligned Audio Conditions.
CoRR, June, 2025

ProbIBR: Fast Image-Based Rendering With Learned Probability-Guided Sampling.
IEEE Trans. Vis. Comput. Graph., March, 2025

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models.
CoRR, February, 2025

CyberHost: A One-stage Diffusion Framework for Audio-driven Talking Body Generation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
360-degree Human Video Generation with 4D Diffusion Transformer.
ACM Trans. Graph., December, 2024

HVTR++: Image and Pose Driven Human Avatars Using Hybrid Volumetric-Textural Rendering.
IEEE Trans. Vis. Comput. Graph., August, 2024

Implicit Surface Representation Using Epanechnikov Mixture Regression.
IEEE Signal Process. Lett., 2024

Human4DiT: Free-view Human Video Generation with 4D Diffusion Transformer.
CoRR, 2024

LayGA: Layered Gaussian Avatars for Animatable Clothing Transfer.
Proceedings of the ACM SIGGRAPH 2024 Conference Papers, 2024

3D Gaussian Parametric Head Model.
Proceedings of the Computer Vision - ECCV 2024, 2024

MeshAvatar: Learning High-Quality Triangular Human Avatars from Multi-view Videos.
Proceedings of the Computer Vision - ECCV 2024, 2024

Gaussian Head Avatar: Ultra High-Fidelity Head Avatar via Dynamic Gaussians.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

DiffPerformer: Iterative Learning of Consistent Latent Guidance for Diffusion-Based Human Video Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Control4D: Efficient 4D Portrait Editing With Text.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Animatable Gaussians: Learning Pose-Dependent Gaussian Maps for High-Fidelity Human Avatar Modeling.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

RAM-Avatar: Real-time Photo-Realistic Avatar from Monocular Videos with Full-body Control.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
AvatarReX: Real-time Expressive Full-body Avatars.
ACM Trans. Graph., August, 2023

Control4D: Dynamic Portrait Editing by Learning 4D GAN from 2D Diffusion-based Editor.
CoRR, 2023

PoseVocab: Learning Joint-structured Pose Embeddings for Human Avatar Modeling.
Proceedings of the ACM SIGGRAPH 2023 Conference Proceedings, 2023

Leveraging Intrinsic Properties for Non-Rigid Garment Alignment.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Tensor4D: Efficient Neural 4D Decomposition for High-Fidelity Dynamic Reconstruction and Rendering.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

CloSET: Modeling Clothed Humans on Continuous Surface with Explicit Template Decomposition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
PaMIR: Parametric Model-Conditioned Implicit Representation for Image-Based Human Reconstruction.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Robust and Accurate 3D Self-Portraits in Seconds.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

ProbNVS: Fast Novel View Synthesis with Learned Probability-Guided Sampling.
CoRR, 2022

FloRen: Real-time High-quality Human Performance Rendering via Appearance Flow Using Sparse RGB Cameras.
Proceedings of the SIGGRAPH Asia 2022 Conference Papers, 2022

DiffuStereo: High Quality Human Reconstruction via Diffusion-Based Stereo Using Sparse Cameras.
Proceedings of the Computer Vision - ECCV 2022, 2022

Learning Implicit Templates for Point-Based Clothed Human Modeling.
Proceedings of the Computer Vision - ECCV 2022, 2022

AvatarCap: Animatable Avatar Conditioned Monocular Human Volumetric Capture.
Proceedings of the Computer Vision - ECCV 2022, 2022

Structured Local Radiance Fields for Human Avatar Modeling.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

High-Fidelity Human Avatars from a Single RGB Camera.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

VERTEX: VEhicle Reconstruction and TEXture Estimation from a Single Image Using Deep Implicit Semantic Template Mapping.
Proceedings of the Artificial Intelligence - Second CAAI International Conference, 2022

HVTR: Hybrid Volumetric-Textural Rendering for Human Avatars.
Proceedings of the International Conference on 3D Vision, 2022

2021
DeepMultiCap: Performance Capture of Multiple Characters Using Sparse Multiview Cameras.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Deep Implicit Templates for 3D Shape Representation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

POSEFusion: Pose-Guided Selective Fusion for Single-View Human Volumetric Capture.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Function4D: Real-Time Human Volumetric Capture From Very Sparse Consumer RGBD Sensors.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Vehicle Reconstruction and Texture Estimation Using Deep Implicit Semantic Template Mapping.
CoRR, 2020

RobustFusion: Human Volumetric Capture with Data-Driven Visual Cues Using a RGBD Camera.
Proceedings of the Computer Vision - ECCV 2020, 2020

Robust 3D Self-Portraits in Seconds.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
DeepHuman: 3D Human Reconstruction From a Single Image.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

SimulCap : Single-View Human Performance Capture With Cloth Simulation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
HybridFusion: Real-Time Performance Capture Using a Single Depth Sensor and Sparse IMUs.
Proceedings of the Computer Vision - ECCV 2018, 2018

DoubleFusion: Real-Time Capture of Human Performances With Inner Body Shapes From a Single Depth Sensor.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018


  Loading...