Xiaowei Zhou

Orcid: 0000-0003-1926-5597

Affiliations:
  • Zhejiang University, Hangzhou, China
  • University of Pennsylvania, Department of Computer and Information Science, Philadelphia, PA, USA
  • Hong Kong University of Science and Technology, Tai Po Tsai, Hong Kong (PhD)


According to our database1, Xiaowei Zhou authored at least 158 papers between 2010 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Precise Action-to-Video Generation Through Visual Action Prompts.
CoRR, August, 2025

Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models.
CoRR, July, 2025

SpatialTrackerV2: 3D Point Tracking Made Easy.
CoRR, July, 2025

Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation.
CoRR, July, 2025

FreeTimeGS: Free Gaussian Primitives at Anytime and Anywhere for Dynamic Scene Reconstruction.
CoRR, June, 2025

HumanRAM: Feed-forward Human Reconstruction and Animation Model using Transformers.
CoRR, June, 2025

MASH: Masked Anchored SpHerical Distances for 3D Shape Representation and Generation.
CoRR, April, 2025

BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation.
CoRR, April, 2025

MotionStreamer: Streaming Motion Generation via Diffusion-based Autoregressive Model in Causal Latent Space.
CoRR, March, 2025

Mocap-2-to-3: Lifting 2D Diffusion-Based Pretrained Models for 3D Motion Capture.
CoRR, March, 2025

Acquisition through My Eyes and Steps: A Joint Predictive Agent Model in Egocentric Worlds.
CoRR, February, 2025

MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training.
CoRR, January, 2025

Dyn-E: Local appearance editing of dynamic neural radiance fields.
Comput. Graph., 2025

UniRestore3D: A Scalable Framework For General Shape Restoration.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Ready-to-React: Online Reaction Policy for Two-Character Interaction Generation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

LiDAR-RT: Gaussian-based Ray Tracing for Dynamic LiDAR Re-simulation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

FLARE: Feed-forward Geometry, Appearance and Camera Estimation from Uncalibrated Sparse Views.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

StreetCrafter: Street View Synthesis with Controllable Video Diffusion Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

EnvGS: Modeling View-Dependent Appearance with Environment Gaussian.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Reconstructing Humans with a Biomechanically Accurate Skeleton.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Glossy Object Reconstruction with Cost-effective Polarized Acquisition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

FreeTimeGS: Free Gaussian Primitives at Anytime Anywhere for Dynamic Scene Reconstruction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Prompting Depth Anything for 4K Resolution Accurate Metric Depth Estimation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Multi-view Reconstruction via SfM-guided Monocular Depth Estimation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Representing Long Volumetric Video with Temporal Gaussian Hierarchy.
ACM Trans. Graph., December, 2024

NeuralRecon: Real-Time Coherent 3D Scene Reconstruction From Monocular Video.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

PlaneStereo: Plane-aware Multi-view Stereo.
Mach. Intell. Res., December, 2024

Efficient High-Quality Vectorized Modeling of Large-Scale Scenes.
Int. J. Comput. Vis., October, 2024

Neural 3D Scene Reconstruction With Indoor Planar Priors.
IEEE Trans. Pattern Anal. Mach. Intell., September, 2024

Animatable Implicit Neural Representations for Creating Realistic Avatars From Videos.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2024

Motion-2-to-3: Leveraging 2D Motion Data to Boost 3D Motion Generation.
CoRR, 2024

AniDress: Animatable Loose-Dressed Avatar from Sparse Views Using Garment Rigging Model.
CoRR, 2024

Street Gaussians for Modeling Dynamic Urban Scenes.
CoRR, 2024

World-Grounded Human Motion Recovery via Gravity-View Coordinates.
Proceedings of the SIGGRAPH Asia 2024 Conference Papers, 2024

MaPa: Text-driven Photorealistic Material Painting for 3D Shapes.
Proceedings of the ACM SIGGRAPH 2024 Conference Papers, 2024

Neural Polynomial Gabor Fields for Macro Motion Analysis.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Street Gaussians: Modeling Dynamic Urban Scenes with Gaussian Splatting.
Proceedings of the Computer Vision - ECCV 2024, 2024

CPT-VR: Improving Surface Rendering via Closest Point Transform with View-Reflection Appearance.
Proceedings of the Computer Vision - ECCV 2024, 2024

SAM-Guided Graph Cut for 3D Instance Segmentation.
Proceedings of the Computer Vision - ECCV 2024, 2024

IntrinsicAnything: Learning Diffusion Priors for Inverse Rendering Under Unknown Illumination.
Proceedings of the Computer Vision - ECCV 2024, 2024

SpatialTracker: Tracking Any 2D Pixels in 3D Space.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

CoDeF: Content Deformation Fields for Temporally Consistent Video Processing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

EfficientDreamer: High-Fidelity and Stable 3D Creation via Orthogonal-view Diffusion Priors.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Detector-Free Structure from Motion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Generating Human Motion in 3D Scenes from Text Descriptions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

4K4D: Real-Time 4D View Synthesis at 4K Resolution.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Relightable and Animatable Neural Avatar from Sparse-View Video.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
NerfCap: Human Performance Capture With Dynamic Neural Radiance Fields.
IEEE Trans. Vis. Comput. Graph., December, 2023

Reconstructing Close Human Interactions from Multiple Views.
ACM Trans. Graph., December, 2023

EasyHeC: Accurate and Automatic Hand-Eye Calibration Via Differentiable Rendering and Space Exploration.
IEEE Robotics Autom. Lett., November, 2023

Implicit Neural Representations With Structured Latent Codes for Human Body Modeling.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

Semi-Dense Feature Matching With Transformers and its Applications in Multiple-View Geometry.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Im4D: High-Fidelity and Real-Time Novel View Synthesis for Dynamic Scenes.
CoRR, 2023

PanopticNeRF-360: Panoramic 3D-to-2D Label Transfer in Urban Scenes.
CoRR, 2023

Dyn-E: Local Appearance Editing of Dynamic Neural Radiance Fields.
CoRR, 2023

Ponder: Point Cloud Pre-training via Neural Rendering.
CoRR, 2023

EasyVolcap: Accelerating Neural Volumetric Video Research.
Proceedings of the SIGGRAPH Asia 2023 Technical Communications, 2023

High-Fidelity and Real-Time Novel View Synthesis for Dynamic Scenes.
Proceedings of the SIGGRAPH Asia 2023 Conference Papers, 2023

Compact Neural Volumetric Video Representations with Dynamic Codebooks.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Perceiving Unseen 3D Objects by Poking the Objects.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Deep Active Contours for Real-time 6-DoF Object Tracking.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Hierarchical Generation of Human-Object Interactions with Diffusion Probabilistic Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Ponder: Point Cloud Pre-training via Neural Rendering.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Painting 3D Nature in 2D: View Synthesis of Natural Scenes from a Single Semantic Mask.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Long-Term Visual Localization with Mobile Sensors.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

AutoRecon: Automated 3D Object Discovery and Reconstruction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Learning Human Mesh Recovery in 3D Scenes.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Representing Volumetric Videos as Dynamic MLP Maps.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Neural Scene Chronology.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

TensoIR: Tensorial Inverse Rendering.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Learning Neural Volumetric Representations of Dynamic Humans in Minutes.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Semantic keypoint-based pose estimation from single RGB frames.
Field Robotics, March, 2022

PVNet: Pixel-Wise Voting Network for 6DoF Object Pose Estimation.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Fast and Robust Multi-Person 3D Pose Estimation and Tracking From Multiple Views.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Shape Prior Guided Instance Disparity Estimation for 3D Object Detection.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

iMoCap: Motion Capture from Internet Videos.
Int. J. Comput. Vis., 2022

PlanarRecon: Real-time 3D Plane Detection and Reconstruction from Posed Monocular Videos.
CoRR, 2022

Animatable Neural Implicit Surfaces for Creating Avatars from Videos.
CoRR, 2022

Efficient Neural Radiance Fields for Interactive Free-viewpoint Video.
Proceedings of the SIGGRAPH Asia 2022 Conference Papers, 2022

Reconstructing Hand-Held Objects from Monocular Video.
Proceedings of the SIGGRAPH Asia 2022 Conference Papers, 2022

QuickPose: Real-time Multi-view Multi-person Pose Estimation in Crowded Scenes.
Proceedings of the SIGGRAPH '22: Special Interest Group on Computer Graphics and Interactive Techniques Conference, Vancouver, BC, Canada, August 7, 2022

Neural 3D Reconstruction in the Wild.
Proceedings of the SIGGRAPH '22: Special Interest Group on Computer Graphics and Interactive Techniques Conference, Vancouver, BC, Canada, August 7, 2022

Novel View Synthesis of Human Interactions from Sparse Multi-view Videos.
Proceedings of the SIGGRAPH '22: Special Interest Group on Computer Graphics and Interactive Techniques Conference, Vancouver, BC, Canada, August 7, 2022

OnePose++: Keypoint-Free One-Shot Object Pose Estimation without CAD Models.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

TotalSelfScan: Learning Full-body Avatars from Self-Portrait Videos of Faces, Hands, and Bodies.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Learning to Estimate Object Poses without Real Image Annotations.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Ray Priors through Reprojection: Improving Neural Radiance Fields for Novel View Extrapolation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Modeling Indirect Illumination for Inverse Rendering.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

PlanarRecon: Realtime 3D Plane Detection and Reconstruction from Posed Monocular Videos.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

OnePose: One-Shot Object Pose Estimation without CAD Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Neural Rays for Occlusion-aware Image-based Rendering.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Neural 3D Scene Reconstruction with the Manhattan-world Assumption.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Visual Sound Localization in the Wild by Cross-Modal Interference Erasing.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Panoptic NeRF: 3D-to-2D Label Transfer for Panoptic Urban Scene Segmentation.
Proceedings of the International Conference on 3D Vision, 2022

2021
Towards efficient and photorealistic 3D human reconstruction: A brief survey.
Vis. Informatics, 2021

Efficient Neural Radiance Fields with Learned Depth-Guided Sampling.
CoRR, 2021

Animatable Neural Radiance Fields for Human Body Modeling.
CoRR, 2021

The present and future of mixed reality in China.
Commun. ACM, 2021

SuperPlane: 3D Plane Detection and Description from a Single Image.
Proceedings of the IEEE Virtual Reality and 3D User Interfaces, 2021

You Don't Only Look Once: Constructing Spatial-Temporal Memory for Integrated 3D Object Detection and Tracking.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Animatable Neural Radiance Fields for Modeling Dynamic Human Bodies.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

NeuralRecon: Real-Time Coherent 3D Reconstruction From Monocular Video.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

LoFTR: Detector-Free Local Feature Matching With Transformers.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Neural Body: Implicit Neural Representations With Structured Latent Codes for Novel View Synthesis of Dynamic Humans.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

VS-Net: Voting With Segmentation for Visual Localization.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Reconstructing 3D Human Pose by Watching Humans in the Mirror.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
A survey on monocular 3D human pose estimation.
Virtual Real. Intell. Hardw., 2020

NIID-Net: Adapting Surface Normal Knowledge for Intrinsic Image Decomposition in Indoor Scenes.
IEEE Trans. Vis. Comput. Graph., 2020

EllipBody: A Light-weight and Part-based Representation for Human Pose and Shape Recovery.
CoRR, 2020

Deep Snake for Real-Time Instance Segmentation.
CoRR, 2020

Monocular Human Pose and Shape Reconstruction using Part Differentiable Rendering.
Comput. Graph. Forum, 2020

Learning Hybrid Representations for Automatic 3D Vessel Centerline Extraction.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020

A Low-Rank Matrix Approximation Approach to Multiway Matching with Applications in Multi-Sensory Data Association.
Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

SMAP: Single-Shot Multi-person Absolute 3D Pose Estimation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Learning Feature Descriptors Using Camera Pose Supervision.
Proceedings of the Computer Vision - ECCV 2020, 2020

Motion Capture from Internet Videos.
Proceedings of the Computer Vision - ECCV 2020, 2020

Disp R-CNN: Stereo 3D Object Detection via Shape Prior Guided Instance Disparity Estimation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Deep Snake for Real-Time Instance Segmentation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Coherent Reconstruction of Multiple Humans From a Single Image.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
MonoCap: Monocular Human Motion Capture using a CNN Coupled with a Geometric Prior.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

GIFT: Learning Transformation-Invariant Dense Visual Descriptors via Group CNNs.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Prior Guided Dropout for Robust Visual Localization in Dynamic Environments.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Path-Invariant Map Networks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Extreme Relative Pose Estimation for RGB-D Scans via Scene Completion.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

PVNet: Pixel-Wise Voting Network for 6DoF Pose Estimation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Learning Transformation Synchronization.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Fast and Robust Multi-Person 3D Pose Estimation From Multiple Views.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Human Motion Capture Using a Drone.
Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

Polar Transformer Networks.
Proceedings of the 6th International Conference on Learning Representations, 2018

Multi-Image Semantic Matching by Mining Consistent Features.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Learning to Estimate 3D Human Pose and Shape From a Single Color Image.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Ordinal Depth Supervision for 3D Human Pose Estimation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Sparse Representation for 3D Shape Estimation: A Convex Relaxation Approach.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

LLR: a latent low-rank approach to colocalizing genetic risk variants in multiple GWAS.
Bioinform., 2017

6-DoF object pose from semantic keypoints.
Proceedings of the 2017 IEEE International Conference on Robotics and Automation, 2017

Distributed consistent data association via permutation synchronization.
Proceedings of the 2017 IEEE International Conference on Robotics and Automation, 2017

Fast Multi-image Matching via Density-Based Clustering.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Coarse-to-Fine Volumetric Prediction for Single-Image 3D Human Pose.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Harvesting Multiple Views for Marker-Less 3D Human Pose Annotations.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Distributed Consistent Data Association.
CoRR, 2016

Articulated motion estimation from a monocular image sequence using spherical tangent bundles.
Proceedings of the 2016 IEEE International Conference on Robotics and Automation, 2016

Sparseness Meets Deepness: 3D Human Pose Estimation from Monocular Video.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

A Survey on Rotation Optimization in Structure from Motion.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2016

2015
Pose and Shape Estimation with Discriminatively Learned Parts.
CoRR, 2015

Single Image Pop-Up from Discriminatively Learned Parts.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Multi-image Matching via Fast Alternating Minimization.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

3D shape estimation from 2D landmarks: A convex relaxation approach.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014
Low-Rank Modeling and Its Applications in Image Analysis.
ACM Comput. Surv., 2014

3D Shape Reconstruction from 2D Landmarks: A Convex Formulation.
CoRR, 2014

Piecewise-constant and low-rank approximation for identification of recurrent copy number variations.
Bioinform., 2014

2013
Multisample aCGH Data Analysis via Total Variation and Spectral Regularization.
IEEE ACM Trans. Comput. Biol. Bioinform., 2013

Moving Object Detection by Detecting Contiguous Outliers in the Low-Rank Representation.
IEEE Trans. Pattern Anal. Mach. Intell., 2013

Active Contours with Group Similarity.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012
Automatic mitral leaflet tracking in echocardiography by outlier detection in the low-rank representation.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2011
Identifying disease-associated SNP clusters via contiguous outlier detection.
Bioinform., 2011

2010
Accurate segmentation of ultrasound images using the motion cue.
Proceedings of the 2010 IEEE International Symposium on Biomedical Imaging: From Nano to Macro, 2010


  Loading...