Zhiyang Dou

Orcid: 0000-0003-0186-8269

According to our database1, Zhiyang Dou authored at least 70 papers between 2021 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
QuadLink: Autoregressive Quad-Dominant Mesh Generation via Point-Relation Learning.
CoRR, May, 2026

RigidFormer: Learning Rigid Dynamics using Transformers.
CoRR, May, 2026

Soft Anisotropic Diagrams for Differentiable Image Representation.
CoRR, April, 2026

TS-Attn: Temporal-wise Separable Attention for Multi-Event Video Generation.
CoRR, April, 2026

GaussiAnimate: Reconstruct and Rig Animatable Categories with Level of Dynamics.
CoRR, April, 2026

UNIC: Neural Garment Deformation Field for Real-time Clothed Character Animation.
CoRR, March, 2026

UMO: Unified In-Context Learning Unlocks Motion Foundation Model Priors.
CoRR, March, 2026

Wonder3D++: Cross-Domain Diffusion for High-Fidelity 3D Generation From a Single Image.
IEEE Trans. Pattern Anal. Mach. Intell., February, 2026

GeoPT: Scaling Physics Simulation via Lifted Geometric Pre-Training.
CoRR, February, 2026

CoMoVi: Co-Generation of 3D Human Motions and Realistic Videos.
CoRR, January, 2026

2025
CFC: Simulating Character-Fluid Coupling using a Two-Level World Model.
ACM Trans. Graph., December, 2025

KISSColor: Kinetic and Intuitive Stroke Stretching for Vector Drawing Colorization.
ACM Trans. Graph., December, 2025

EgoReAct: Egocentric Video-Driven 3D Human Reaction Generation.
CoRR, December, 2025

Switch-JustDance: Benchmarking Whole Body Motion Tracking Policies Using a Commercial Console Game.
CoRR, November, 2025

Text2Interact: High-Fidelity and Diverse Text-to-Two-Person Interaction Generation.
CoRR, October, 2025

PartSAM: A Scalable Promptable Part Segmentation Model Trained on Native 3D Data.
CoRR, September, 2025

MeshMosaic: Scaling Artist Mesh Generation via Local-to-Global Assembly.
CoRR, September, 2025

PDT: Point Distribution Transformation with Diffusion Models.
CoRR, July, 2025

MOSPA: Human Motion Generation Driven by Spatial Audio.
CoRR, July, 2025

CoDA: Coordinated Diffusion Noise Optimization for Whole-Body Manipulation of Articulated Objects.
CoRR, May, 2025

SymbioSim: Human-in-the-loop Simulation Platform for Bidirectional Continuing Learning in Human-Robot Interaction.
CoRR, February, 2025

ProTracker: Probabilistic Integration for Robust and Accurate Point Tracking.
CoRR, January, 2025

WonderHuman: Hallucinating Unseen Parts in Dynamic 3D Human Reconstruction.
IEEE Trans. Vis. Comput. Graph., 2025

ViMoE: An Empirical Study of Designing Vision Mixture-of-Experts.
IEEE Trans. Image Process., 2025

Motion2Motion: Cross-topology Motion Transfer with Sparse Correspondence.
Proceedings of the SIGGRAPH Asia 2025 Conference Papers, 2025

SymBridge: A Human-in-the-Loop Cyber-Physical Interactive System for Adaptive Human-Robot Symbiosis.
Proceedings of the SIGGRAPH Asia 2025 Conference Papers, 2025

PDT: Point Distribution Transformation with Diffusion Models.
Proceedings of the Special Interest Group on Computer Graphics and Interactive Techniques Conference, 2025

Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control.
Proceedings of the Special Interest Group on Computer Graphics and Interactive Techniques Conference, 2025

🎧MOSPA: Human Motion Generation Driven by Spatial Audio.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

PhysCtrl: Generative Physics for Controllable and Physics-Grounded Video Generation.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

TrackingWorld: World-centric Monocular 3D Tracking of Almost All Pixels.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

SyncHuman: Synchronizing 2D and 3D Generative Models for Single-view Human Reconstruction.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

DICE: End-to-end Deformation Capture of Hand-Face Interactions from a Single Image.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

CityAnchor: City-scale 3D Visual Grounding with Multi-modality LLMs.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flow.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

SIMS: Simulating Stylized Human-Scene Interactions with Retrieval-Augmented Script Generation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

ModSkill: Physical Character Skill Modularization.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Go to Zero: Towards Zero-Shot Motion Generation with Million-Scale Data.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

TokenHSI: Unified Synthesis of Physical Human-Scene Interactions through Task Tokenization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Align3R: Aligned Monocular Depth Estimation for Dynamic Videos.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

ScaMo: Exploring the Scaling Law in Autoregressive Motion Generation Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Vid2Sim: Generalizable, Video-based Reconstruction of Appearance, Geometry and Physics for Mesh-free Simulation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Boosting Segment Anything Model Towards Open-Vocabulary Learning.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
CBIL: Collective Behavior Imitation Learning for Fish from Real Videos.
ACM Trans. Graph., December, 2024

Coverage Axis++: Efficient Inner Point Selection for 3D Shape Skeletonization.
Comput. Graph. Forum, August, 2024

GaGA: Towards Interactive Global Geolocation Assistant.
CoRR, 2024

SIMS: Simulating Human-Scene Interactions with Real World Script Planning.
CoRR, 2024

MotionWavelet: Human Motion Prediction via Wavelet Manifold Learning.
CoRR, 2024

ViMoE: An Empirical Study of Designing Vision Mixture-of-Experts.
CoRR, 2024

Dynamic Realms: 4D Content Analysis, Recovery and Generation with Geometric, Topological and Physical Priors.
CoRR, 2024

AutoCV: Empowering Reasoning with Automated Process Labeling via Confidence Variation.
CoRR, 2024

LaserHuman: Language-guided Scene-aware Human Motion Generation in Free Environment.
CoRR, 2024

Coverage Axis++: Efficient Inner Point Selection for 3D Shape Skeletonization.
CoRR, 2024

Part123: Part-aware 3D Reconstruction from a Single-view Image.
Proceedings of the ACM SIGGRAPH 2024 Conference Papers, 2024

AutoPSV: Automated Process-Supervised Verifier.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

EMDM: Efficient Motion Diffusion Model for Fast and High-Quality Motion Generation.
Proceedings of the Computer Vision - ECCV 2024, 2024

Surf-D: Generating High-Quality Surfaces of Arbitrary Topologies Using Diffusion Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

Disentangled Clothed Avatar Generation from Text Descriptions.
Proceedings of the Computer Vision - ECCV 2024, 2024

TLControl: Trajectory and Language Control for Human Motion Synthesis.
Proceedings of the Computer Vision - ECCV 2024, 2024

Wonder3D: Single Image to 3D Using Cross-Domain Diffusion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Globally Consistent Normal Orientation for Point Clouds by Regularizing the Winding-Number Field.
ACM Trans. Graph., August, 2023

Disentangled Clothed Avatar Generation from Text Descriptions.
CoRR, 2023

Boosting Segment Anything Model Towards Open-Vocabulary Learning.
CoRR, 2023

EMDM: Efficient Motion Diffusion Model for Fast, High-Quality Motion Generation.
CoRR, 2023

Surf-D: High-Quality Surface Generation for Arbitrary Topologies using Diffusion Models.
CoRR, 2023

C·ASE: Learning Conditional Adversarial Skill Embeddings for Physics-based Characters.
Proceedings of the SIGGRAPH Asia 2023 Conference Papers, 2023

TORE: Token Reduction for Efficient Human Mesh Recovery with Transformer.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
RFEPS: Reconstructing Feature-Line Equipped Polygonal Surface.
ACM Trans. Graph., 2022

Coverage Axis: Inner Point Selection for 3D Shape Skeletonization.
Comput. Graph. Forum, 2022

2021
Top-Down Shape Abstraction Based on Greedy Pole Selection.
IEEE Trans. Vis. Comput. Graph., 2021


  Loading...