Zhongang Cai

Orcid: 0000-0002-1810-3855

According to our database1, Zhongang Cai authored at least 51 papers between 2018 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Robust Partial-to-Partial Point Cloud Registration in a Full Range.
IEEE Robotics Autom. Lett., 2024

AiOS: All-in-One-Stage Expressive Human Pose and Shape Estimation.
CoRR, 2024

WHAC: World-grounded Humans and Cameras.
CoRR, 2024

IT3D: Improved Text-to-3D Generation with Explicit View Synthesis.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Variational Relational Point Completion Network for Robust 3D Classification.
IEEE Trans. Pattern Anal. Mach. Intell., September, 2023

FineMoGen: Fine-Grained Spatio-Temporal Motion Generation and Editing.
CoRR, 2023

Towards Robust and Expressive Whole-body Human Pose and Shape Estimation.
CoRR, 2023

Digital Life Project: Autonomous 3D Characters with Social Intelligence.
CoRR, 2023

AttriHuman-3D: Editable 3D Human Avatar Generation with Attribute Decomposition and Indexing.
CoRR, 2023

GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting.
CoRR, 2023

SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation.
CoRR, 2023

PointHPS: Cascaded 3D Human Pose and Shape Estimation from Point Clouds.
CoRR, 2023

IT3D: Improved Text-to-3D Generation with Explicit View Synthesis.
CoRR, 2023

Learning Dense UV Completion for Human Mesh Recovery.
CoRR, 2023

DNA-Rendering: A Diverse Neural Actor Repository for High-Fidelity Human-centric Rendering.
CoRR, 2023

ReMoDiffuse: Retrieval-Augmented Motion Diffusion Model.
CoRR, 2023

SynBody: Synthetic Dataset with Layered Human Models for 3D Human Perception and Modeling.
CoRR, 2023

Zolly: Zoom Focal Length Correctly for Perspective-Distorted Human Mesh Reconstruction.
CoRR, 2023

Story-to-Motion: Synthesizing Infinite and Controllable Character Animation from Long Text.
Proceedings of the SIGGRAPH Asia 2023 Technical Communications, 2023

FineMoGen: Fine-Grained Spatio-Temporal Motion Generation and Editing.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Towards Robust and Expressive Whole-body Human Pose and Shape Estimation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

BiBench: Benchmarking and Analyzing Network Binarization.
Proceedings of the International Conference on Machine Learning, 2023

ReMoDiffuse: Retrieval-Augmented Motion Diffusion Model.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

SynBody: Synthetic Dataset with Layered Human Models for 3D Human Perception and Modeling.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Zolly: Zoom Focal Length Correctly for Perspective-Distorted Human Mesh Reconstruction.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

DNA-Rendering: A Diverse Neural Actor Repository for High-Fidelity Human-centric Rendering.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
AvatarCLIP: zero-shot text-driven generation and animation of 3D avatars.
ACM Trans. Graph., 2022

MotionDiffuse: Text-Driven Human Motion Generation with Diffusion Model.
CoRR, 2022

Benchmarking and Analyzing 3D Human Pose and Shape Estimation Beyond Algorithms.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Monocular 3D Object Reconstruction with GAN Inversion.
Proceedings of the Computer Vision - ECCV 2022, 2022

HuMMan: Multi-modal 4D Human Dataset for Versatile Sensing and Modeling.
Proceedings of the Computer Vision - ECCV 2022, 2022

PTTR: Relational 3D Point Cloud Object Tracking with Transformer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Delving Deep into the Generalization of Vision Transformers under Distribution Shifts.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Versatile Multi-Modal Pre-Training for Human-Centric Perception.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Multi-View Partial (MVP) Point Cloud Challenge 2021 on Completion and Registration: Methods and Results.
CoRR, 2021

Playing for 3D Human Recovery.
CoRR, 2021

Delving Deep into the Generalization of Vision Transformers under Distribution Shifts.
CoRR, 2021

Garment4D: Garment Reconstruction from Point Cloud Sequences.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

BiPointNet: Binary Neural Network for Point Clouds.
Proceedings of the 9th International Conference on Learning Representations, 2021

CSG-Stump: A Learning Friendly CSG-Like Representation for Interpretable Shape Parsing.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Unsupervised Domain Adaptive 3D Detection with Multi-Level Consistency.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Unsupervised 3D Shape Completion Through GAN Inversion.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Variational Relational Point Completion Network.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

REFINE: Prediction Fusion Network for Panoptic Segmentation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Balanced Activation for Long-tailed Visual Recognition.
CoRR, 2020

Leveraging Localization for Multi-camera Association.
CoRR, 2020

Leveraging Temporal Information for 3D Detection and Domain Adaptation.
CoRR, 2020

MessyTable: Instance Association in Multiple Camera Views.
Proceedings of the Computer Vision - ECCV 2020, 2020

2019
Siamese Convolutional Neural Network for Sub-millimeter-accurate Camera Pose Estimation and Visual Servoing.
Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019

2018
3D Convolution on RGB-D Point Clouds for Accurate Model-free Object Pose Estimation.
CoRR, 2018


  Loading...