Lan Xu

Orcid: 0000-0002-8807-7787

Affiliations:
  • ShanghaiTech University, Shanghai, China


According to our database1, Lan Xu authored at least 96 papers between 2017 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
4DGCPro: Efficient Hierarchical 4D Gaussian Compression for Progressive Volumetric Video Streaming.
CoRR, September, 2025

4D-MoDe: Towards Editable and Scalable Volumetric Streaming via Motion-Decoupled 4D Gaussian Compression.
CoRR, September, 2025

Topology-Aware Optimization of Gaussian Primitives for Human-Centric Volumetric Videos.
CoRR, September, 2025

BANG: Dividing 3D Assets via Generative Exploded Dynamics.
ACM Trans. Graph., August, 2025

CAST: Component-Aligned 3D Scene Reconstruction from an RGB Image.
ACM Trans. Graph., August, 2025

Facial Appearance Capture at Home with Patch-Level Reflectance Prior.
ACM Trans. Graph., August, 2025

Towards Immersive Human-X Interaction: A Real-Time Framework for Physically Plausible Motion Synthesis.
CoRR, August, 2025

CityGo: Lightweight Urban Modeling and Rendering with Proxy Buildings and Residual Gaussians.
CoRR, May, 2025

Human-Object Interaction with Vision-Language Model Guided Relative Movement Dynamics.
CoRR, March, 2025

BEAM: Bridging Physically-based Rendering and Gaussian Modeling for Relightable Volumetric Video.
CoRR, February, 2025

TANGLED: Generating 3D Hair Strands from Images with Arbitrary Styles and Viewpoints.
CoRR, February, 2025

RePerformer: Immersive Human-centric Volumetric Videos from Playback to Photoreal Reperformance.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

HeadGAP: Few-Shot 3D Head Avatar via Generalizable Gaussian Priors.
Proceedings of the International Conference on 3D Vision, 2025

2024
V^3: Viewing Volumetric Videos on Mobiles via Streamable 2D Dynamic Gaussians.
ACM Trans. Graph., December, 2024

Robust Dual Gaussian Splatting for Immersive Human-centric Volumetric Videos.
ACM Trans. Graph., December, 2024

LetsGo: Large-Scale Garage Modeling and Rendering via LiDAR-Assisted Gaussian Primitives.
ACM Trans. Graph., December, 2024

InterGen: Diffusion-Based Multi-human Motion Generation Under Complex Interactions.
Int. J. Comput. Vis., September, 2024

CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets.
ACM Trans. Graph., July, 2024

Implicit Swept Volume SDF: Enabling Continuous Collision-Free Trajectory Generation for Arbitrary Shapes.
ACM Trans. Graph., July, 2024

DressCode: Autoregressively Sewing and Generating Garments from Text Guidance.
ACM Trans. Graph., July, 2024

LLaVA-SLT: Visual Language Tuning for Sign Language Translation.
CoRR, 2024

CADSpotting: Robust Panoptic Symbol Spotting on Large-Scale CAD Drawings.
CoRR, 2024

AerialGo: Walking-through City View Generation from Aerial Perspectives.
CoRR, 2024

SMGDiff: Soccer Motion Generation using diffusion probabilistic models.
CoRR, 2024

LetsGo: Large-Scale Garage Modeling and Rendering via LiDAR-Assisted Gaussian Primitives.
CoRR, 2024

HOI-M3: Capture Multiple Humans and Objects Interaction within Contextual Environment.
CoRR, 2024

Gaze-guided Hand-Object Interaction Synthesis: Benchmark and Method.
CoRR, 2024

THOR: Text to Human-Object Interaction Diffusion via Relation Intervention.
CoRR, 2024

GaussianHair: Hair Modeling and Rendering with Light-aware Gaussians.
CoRR, 2024

Media2Face: Co-speech Facial Animation Generation With Multi-Modality Guidance.
Proceedings of the ACM SIGGRAPH 2024 Conference Papers, 2024

Sophia-in-Audition: Virtual Production with a Robot Performer.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Gait Recognition in Large-scale Free Environment via Single LiDAR.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

HOI-M<sup>3</sup>: Capture Multiple Humans and Objects Interaction within Contextual Environment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

A Unified Diffusion Framework for Scene-aware Human Motion Estimation from Sparse Signals.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

OMG: Towards Open-vocabulary Motion Generation via Mixture of Controllers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

HiFi4G: High-Fidelity Human Performance Rendering via Compact Gaussian Splatting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

HybridGait: A Benchmark for Spatial-Temporal Cloth-Changing Gait Recognition with Hybrid Explorations.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
HACK: Learning a Parametric Head and Neck Model for High-fidelity Animation.
ACM Trans. Graph., August, 2023

DreamFace: Progressive Generation of Animatable 3D Faces under Text Guidance.
ACM Trans. Graph., August, 2023

RobustFusion: Robust Volumetric Performance Reconstruction Under Human-Object Interactions From Monocular RGBD Stream.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

BOTH2Hands: Inferring 3D Hands from Both Text Prompts and Body Dynamics.
CoRR, 2023

HandDiffuse: Generative Controllers for Two-Hand Interactions via Diffusion Models.
CoRR, 2023

NEPHELE: A Neural Platform for Highly Realistic Cloud Radiance Rendering.
CoRR, 2023

ChatAvatar: Creating Hyper-realistic Physically-based 3D Facial Assets through AI-Driven Conversations.
Proceedings of the ACM SIGGRAPH 2023 Real-Time Live!, 2023

Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

StackFLOW: Monocular Human-Object Reconstruction by Stacked Normalizing Flow with Offset.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

NeReF: Neural Refractive Field for Fluid Surface Reconstruction and Rendering.
Proceedings of the IEEE International Conference on Computational Photography, 2023

Relightable Neural Human Assets from Multi-view Gradient Illuminations.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

NeuralDome: A Neural Modeling Pipeline on Multi-View Human-Object Interactions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Instant-NVR: Instant Neural Volumetric Rendering for Human-object Interactions from Monocular RGBD Stream.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

IKOL: Inverse Kinematics Optimization Layer for 3D Human Pose and Shape Estimation via Gauss-Newton Differentiation.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Human Performance Modeling and Rendering via Neural Animated Mesh.
ACM Trans. Graph., 2022

Video-Driven Neural Physically-Based Facial Asset for Production.
ACM Trans. Graph., 2022

Artemis: articulated neural pets with appearance and motion synthesis.
ACM Trans. Graph., 2022

TightCap: 3D Human Shape Capture with Clothing Tightness Field.
ACM Trans. Graph., 2022

BuildingFusion: Semantic-Aware Structural Building-Scale 3D Reconstruction.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Learning Variational Motion Prior for Video-based Motion Capture.
CoRR, 2022

NARRATE: A Normal Assisted Free-View Portrait Stylizer.
CoRR, 2022

NeReF: Neural Refractive Field for Fluid Surface Reconstruction and Implicit Representation.
CoRR, 2022

NeuralFusion: Neural Volumetric Rendering under Human-object Interactions.
CoRR, 2022

NeuVV: Neural Volumetric Videos with Immersive Rendering and Editing.
CoRR, 2022

Mutual Adaptive Reasoning for Monocular 3D Multi-Person Pose Estimation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

HumanNeRF: Efficiently Generated Human Radiance Field from Sparse Inputs.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Fourier PlenOctrees for Dynamic Radiance Field Rendering in Real-time.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

NeuralHOFusion: Neural Volumetric Rendering under Human-object Interactions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Anisotropic Fourier Features for Neural Image-Based Rendering and Relighting.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
FlyFusion: Realtime Dynamic Scene Reconstruction Using a Flying Depth Camera.
IEEE Trans. Vis. Comput. Graph., 2021

Editable free-viewpoint video using a layered neural representation.
ACM Trans. Graph., 2021

Boosting Single Image Super-Resolution Learnt From Implicit Multi-Image Prior.
IEEE Trans. Image Process., 2021

SportsCap: Monocular 3D Human Motion Capture and Fine-Grained Understanding in Challenging Sports Videos.
Int. J. Comput. Vis., 2021

HumanNeRF: Generalizable Neural Human Radiance Field from Sparse Inputs.
CoRR, 2021

Relightable Neural Video Portrait.
CoRR, 2021

IREM: High-Resolution Magnetic Resonance (MR) Image Reconstruction via Implicit Neural Representation.
CoRR, 2021

Towards Controllable and Photorealistic Region-wise Image Manipulation.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

iButter: Neural Interactive Bullet Time Generator for Human Free-viewpoint Rendering.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Neural Free-Viewpoint Performance Rendering under Complex Human-object Interactions.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

IREM: High-Resolution Magnetic Resonance Image Reconstruction via Implicit Neural Representation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2021 - 24th International Conference, Strasbourg, France, September 27, 2021

Few-shot Neural Human Performance Rendering from Sparse RGBD Videos.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

PIANO: A Parametric Hand Bone Model from Magnetic Resonance Imaging.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Neural Video Portrait Relighting in Real-time via Consistency Modeling.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

GNeRF: GAN-based Neural Radiance Field without Posed Camera.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

MirrorNeRF: One-shot Neural Portrait Radiance Field from Multi-mirror Catadioptric Imaging.
Proceedings of the IEEE International Conference on Computational Photography, 2021

Convolutional Neural Opacity Radiance Fields.
Proceedings of the IEEE International Conference on Computational Photography, 2021

NeuralHumanFVV: Real-Time Neural Volumetric Human Performance Rendering Using RGB Cameras.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

ChallenCap: Monocular 3D Capture of Challenging Human Performances Using Multi-Modal References.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Live Semantic 3D Perception for Immersive Augmented Reality.
IEEE Trans. Vis. Comput. Graph., 2020

UnstructuredFusion: Realtime 4D Geometry and Texture Reconstruction Using Commercial RGBD Cameras.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Neural3D: Light-weight Neural Portrait Scanning via Context-aware Correspondence Learning.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Multiscale-VR: Multiscale Gigapixel 3D Panoramic Videography for Virtual Reality.
Proceedings of the 2020 IEEE International Conference on Computational Photography, 2020

RobustFusion: Human Volumetric Capture with Data-Driven Visual Cues Using a RGBD Camera.
Proceedings of the Computer Vision - ECCV 2020, 2020

EventCap: Monocular 3D Capture of High-Speed Human Motions Using an Event Camera.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

OccuSeg: Occupancy-Aware 3D Instance Segmentation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Real-Time Global Registration for Globally Consistent RGB-D SLAM.
IEEE Trans. Robotics, 2019

2018
FlyCap: Markerless Motion Capture Using Multiple Autonomous Flying Cameras.
IEEE Trans. Vis. Comput. Graph., 2018

iHuman3D: Intelligent Human Body 3D Reconstruction using a Single Flying Camera.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

2017
Beyond SIFT using binary features in Loop Closure Detection.
Proceedings of the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2017


  Loading...