Lan Xu

Orcid: 0000-0002-8807-7787

Affiliations:
  • ShanghaiTech University, Shanghai, China


According to our database1, Lan Xu authored at least 143 papers between 2017 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
GraSP: Graph-Structured Skill Compositions for LLM Agents.
CoRR, April, 2026

Strips as Tokens: Artist Mesh Generation with Native UV Segmentation.
CoRR, April, 2026

Director: Instance-aware Gaussian Splatting for Dynamic Scene Modeling and Understanding.
CoRR, April, 2026

FlashCap: Millisecond-Accurate Human Motion Capture via Flashing LEDs and Event-Based Vision.
CoRR, March, 2026

TAPESTRY: From Geometry to Appearance via Consistent Turntable Videos.
CoRR, March, 2026

ManiTwin: Scaling Data-Generation-Ready Digital Object Dataset to 100K.
CoRR, March, 2026

Towards Motion Turing Test: Evaluating Human-Likeness in Humanoid Robots.
CoRR, March, 2026

A general framework for Gaussian Splatting-based human-centric volumetric videos.
Vis. Intell., 2026

Artificial intelligence for virtual reality: a review.
Sci. China Inf. Sci., 2026

2025
WildCap: Facial Appearance Capture in the Wild via Hybrid Inverse Rendering.
CoRR, December, 2025

InterAgent: Physics-based Multi-agent Command Execution via Diffusion on Interaction Graphs.
CoRR, December, 2025

Kinematify: Open-Vocabulary Synthesis of High-DoF Articulated Objects.
CoRR, November, 2025

PartNeXt: A Next-Generation Dataset for Fine-Grained and Hierarchical 3D Part Understanding.
CoRR, October, 2025

4DGCPro: Efficient Hierarchical 4D Gaussian Compression for Progressive Volumetric Video Streaming.
CoRR, September, 2025

4D-MoDe: Towards Editable and Scalable Volumetric Streaming via Motion-Decoupled 4D Gaussian Compression.
CoRR, September, 2025

BANG: Dividing 3D Assets via Generative Exploded Dynamics.
ACM Trans. Graph., August, 2025

CAST: Component-Aligned 3D Scene Reconstruction from an RGB Image.
ACM Trans. Graph., August, 2025

Facial Appearance Capture at Home with Patch-Level Reflectance Prior.
ACM Trans. Graph., August, 2025

Human-Object Interaction with Vision-Language Model Guided Relative Movement Dynamics.
CoRR, March, 2025

Mojito: LLM-Aided Motion Instructor with Jitter-Reduced Inertial Tokens.
CoRR, February, 2025

BEAM: Bridging Physically-based Rendering and Gaussian Modeling for Relightable Volumetric Video.
CoRR, February, 2025

TANGLED: Generating 3D Hair Strands from Images with Arbitrary Styles and Viewpoints.
CoRR, February, 2025

CityGo: Lightweight Urban Modeling and Rendering with Proxy Buildings and Residual Gaussians.
Proceedings of the SIGGRAPH Asia 2025 Conference Papers, 2025

Topology-Aware Optimization of Gaussian Primitives for Human-Centric Volumetric Videos.
Proceedings of the SIGGRAPH Asia 2025 Conference Papers, 2025

Dynamic Gaussian Streams for Volumetric Video via Codebook-Based Quantization.
Proceedings of the IEEE International Workshop on Multimedia Signal Processing, 2025

Generating 3D Hair Strands from Images with Diverse Styles and Viewpoints.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

BEAM: Bridging Physically-based Rendering and Gaussian Modeling for Relightable Volumetric Video.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

THOR: Text to Human-Object Interaction Diffusion via Relation Intervention.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

SMGDiff: Soccer Motion Generation using Diffusion Probabilistic Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Towards Immersive Human-X Interaction: A Real-Time Framework for Physically Plausible Motion Synthesis.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

ClimbingCap: Multi-Modal Dataset and Method for Rock Climbing in World Coordinate.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

RePerformer: Immersive Human-centric Volumetric Videos from Playback to Photoreal Reperformance.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Capturing the Unseen: Vision-Free Facial Motion Capture Using Inertial Measurement Units.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

SCOPE: Sign Language Contextual Processing with Embedding from LLMs.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

HeadGAP: Few-Shot 3D Head Avatar via Generalizable Gaussian Priors.
Proceedings of the International Conference on 3D Vision, 2025

2024
V^3: Viewing Volumetric Videos on Mobiles via Streamable 2D Dynamic Gaussians.
ACM Trans. Graph., December, 2024

Robust Dual Gaussian Splatting for Immersive Human-centric Volumetric Videos.
ACM Trans. Graph., December, 2024

LetsGo: Large-Scale Garage Modeling and Rendering via LiDAR-Assisted Gaussian Primitives.
ACM Trans. Graph., December, 2024

HiSC4D: Human-Centered Interaction and 4D Scene Capture in Large-Scale Space Using Wearable IMUs and LiDAR.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

InterGen: Diffusion-Based Multi-human Motion Generation Under Complex Interactions.
Int. J. Comput. Vis., September, 2024

CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets.
ACM Trans. Graph., July, 2024

Implicit Swept Volume SDF: Enabling Continuous Collision-Free Trajectory Generation for Arbitrary Shapes.
ACM Trans. Graph., July, 2024

DressCode: Autoregressively Sewing and Generating Garments from Text Guidance.
ACM Trans. Graph., July, 2024

LiDARCapV2: 3D human pose estimation with human-object interaction from LiDAR point clouds.
Pattern Recognit., 2024

MP-HAR: A Novel Motion-Powered Real-Time Human Activity Recognition System.
IEEE Internet Things J., 2024

LLaVA-SLT: Visual Language Tuning for Sign Language Translation.
CoRR, 2024

CADSpotting: Robust Panoptic Symbol Spotting on Large-Scale CAD Drawings.
CoRR, 2024

AerialGo: Walking-through City View Generation from Aerial Perspectives.
CoRR, 2024

Instant Facial Gaussians Translator for Relightable and Interactable Facial Rendering.
CoRR, 2024

LetsGo: Large-Scale Garage Modeling and Rendering via LiDAR-Assisted Gaussian Primitives.
CoRR, 2024

HOI-M3: Capture Multiple Humans and Objects Interaction within Contextual Environment.
CoRR, 2024

Gaze-guided Hand-Object Interaction Synthesis: Benchmark and Method.
CoRR, 2024

GaussianHair: Hair Modeling and Rendering with Light-aware Gaussians.
CoRR, 2024

IMUSIC: IMU-based Facial Expression Capture.
CoRR, 2024

Media2Face: Co-speech Facial Animation Generation With Multi-Modality Guidance.
Proceedings of the ACM SIGGRAPH 2024 Conference Papers, 2024

Sophia-in-Audition: Virtual Production with a Robot Performer.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

HmPEAR: A Dataset for Human Pose Estimation and Action Recognition.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Gait Recognition in Large-scale Free Environment via Single LiDAR.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

I'M HOI: Inertia-Aware Monocular Capture of 3D Human-Object Interactions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

HOI-M<sup>3</sup>: Capture Multiple Humans and Objects Interaction within Contextual Environment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

BOTH2Hands: Inferring 3D Hands from Both Text Prompts and Body Dynamics.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

RELI11D: A Comprehensive Multimodal Human Motion Dataset and Method.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

VideoRF: Rendering Dynamic Radiance Fields as 2D Feature Video Streams.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

A Unified Diffusion Framework for Scene-aware Human Motion Estimation from Sparse Signals.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

LiveHPS: LiDAR-Based Scene-Level Human Pose and Shape Estimation in Free Environment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

OMG: Towards Open-vocabulary Motion Generation via Mixture of Controllers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

HiFi4G: High-Fidelity Human Performance Rendering via Compact Gaussian Splatting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

HybridGait: A Benchmark for Spatial-Temporal Cloth-Changing Gait Recognition with Hybrid Explorations.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
HACK: Learning a Parametric Head and Neck Model for High-fidelity Animation.
ACM Trans. Graph., August, 2023

DreamFace: Progressive Generation of Animatable 3D Faces under Text Guidance.
ACM Trans. Graph., August, 2023

RobustFusion: Robust Volumetric Performance Reconstruction Under Human-Object Interactions From Monocular RGBD Stream.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

Free-view Face Relighting Using a Hybrid Parametric Neural Model on a SMALL-OLAT Dataset.
Int. J. Comput. Vis., April, 2023

LiDAR-aid Inertial Poser: Large-scale Human Motion Capture by Sparse Inertial and LiDAR Sensors.
IEEE Trans. Vis. Comput. Graph., 2023

MVHuman: Tailoring 2D Diffusion with Multi-view Sampling For Realistic 3D Human Generation.
CoRR, 2023

HandDiffuse: Generative Controllers for Two-Hand Interactions via Diffusion Models.
CoRR, 2023

CryoFormer: Continuous Reconstruction of 3D Structures from Cryo-EM Data using Transformer-based Neural Representations.
CoRR, 2023

NEPHELE: A Neural Platform for Highly Realistic Cloud Radiance Rendering.
CoRR, 2023

ChatAvatar: Creating Hyper-realistic Physically-based 3D Facial Assets through AI-Driven Conversations.
Proceedings of the ACM SIGGRAPH 2023 Real-Time Live!, 2023

Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

StackFLOW: Monocular Human-Object Reconstruction by Stacked Normalizing Flow with Offset.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

NeuralGiga: Neural Giga-Image Representation with Anti-Aliasing and Continuous Viewing.
Proceedings of the 49th Annual Conference of the IEEE Industrial Electronics Society, 2023

NeReF: Neural Refractive Field for Fluid Surface Reconstruction and Rendering.
Proceedings of the IEEE International Conference on Computational Photography, 2023

Relightable Neural Human Assets from Multi-view Gradient Illuminations.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

NeuralDome: A Neural Modeling Pipeline on Multi-View Human-Object Interactions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

CIMI4D: A Large Multimodal Climbing Motion Dataset under Human-scene Interactions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Neural Residual Radiance Fields for Streamably Free-Viewpoint Videos.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Instant-NVR: Instant Neural Volumetric Rendering for Human-object Interactions from Monocular RGBD Stream.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

SLOPER4D: A Scene-Aware Dataset for Global 4D Human Pose Estimation in Urban Environments.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

IKOL: Inverse Kinematics Optimization Layer for 3D Human Pose and Shape Estimation via Gauss-Newton Differentiation.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

HybridCap: Inertia-Aid Monocular Capture of Challenging Human Motions.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Weakly Supervised 3D Multi-Person Pose Estimation for Large-Scale Scenes Based on Monocular Camera and Single LiDAR.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Human Performance Modeling and Rendering via Neural Animated Mesh.
ACM Trans. Graph., 2022

Video-Driven Neural Physically-Based Facial Asset for Production.
ACM Trans. Graph., 2022

SCULPTOR: Skeleton-Consistent Face Creation Using a Learned Parametric Generator.
ACM Trans. Graph., 2022

Artemis: articulated neural pets with appearance and motion synthesis.
ACM Trans. Graph., 2022

NIMBLE: a non-rigid hand model with bones and muscles.
ACM Trans. Graph., 2022

TightCap: 3D Human Shape Capture with Clothing Tightness Field.
ACM Trans. Graph., 2022

BuildingFusion: Semantic-Aware Structural Building-Scale 3D Reconstruction.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

LiCamGait: Gait Recognition in the Wild by Using LiDAR and Camera Multi-modal Visual Sensors.
CoRR, 2022

Learning Variational Motion Prior for Video-based Motion Capture.
CoRR, 2022

NARRATE: A Normal Assisted Free-View Portrait Stylizer.
CoRR, 2022

LiDARCap: Long-range Marker-less 3D Human Motion Capture with LiDAR Point Clouds.
CoRR, 2022

NeReF: Neural Refractive Field for Fluid Surface Reconstruction and Implicit Representation.
CoRR, 2022

NeuralFusion: Neural Volumetric Rendering under Human-object Interactions.
CoRR, 2022

NeuVV: Neural Volumetric Videos with Immersive Rendering and Editing.
CoRR, 2022

Mutual Adaptive Reasoning for Monocular 3D Multi-Person Pose Estimation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

HumanNeRF: Efficiently Generated Human Radiance Field from Sparse Inputs.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Fourier PlenOctrees for Dynamic Radiance Field Rendering in Real-time.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

LiDARCap: Long-range Markerless 3D Human Motion Capture with LiDAR Point Clouds.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

NeuralHOFusion: Neural Volumetric Rendering under Human-object Interactions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

HSC4D: Human-centered 4D Scene Capture in Large-scale Indoor-outdoor Space Using Wearable IMUs and LiDAR.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Anisotropic Fourier Features for Neural Image-Based Rendering and Relighting.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
FlyFusion: Realtime Dynamic Scene Reconstruction Using a Flying Depth Camera.
IEEE Trans. Vis. Comput. Graph., 2021

Editable free-viewpoint video using a layered neural representation.
ACM Trans. Graph., 2021

Boosting Single Image Super-Resolution Learnt From Implicit Multi-Image Prior.
IEEE Trans. Image Process., 2021

SportsCap: Monocular 3D Human Motion Capture and Fine-Grained Understanding in Challenging Sports Videos.
Int. J. Comput. Vis., 2021

HumanNeRF: Generalizable Neural Human Radiance Field from Sparse Inputs.
CoRR, 2021

Relightable Neural Video Portrait.
CoRR, 2021

IREM: High-Resolution Magnetic Resonance (MR) Image Reconstruction via Implicit Neural Representation.
CoRR, 2021

MirrorNeRF: One-shot Neural Portrait RadianceField from Multi-mirror Catadioptric Imaging.
CoRR, 2021

Towards Controllable and Photorealistic Region-wise Image Manipulation.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

iButter: Neural Interactive Bullet Time Generator for Human Free-viewpoint Rendering.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Neural Free-Viewpoint Performance Rendering under Complex Human-object Interactions.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

IREM: High-Resolution Magnetic Resonance Image Reconstruction via Implicit Neural Representation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2021 - 24th International Conference, Strasbourg, France, September 27, 2021

Few-shot Neural Human Performance Rendering from Sparse RGBD Videos.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

PIANO: A Parametric Hand Bone Model from Magnetic Resonance Imaging.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Neural Video Portrait Relighting in Real-time via Consistency Modeling.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

GNeRF: GAN-based Neural Radiance Field without Posed Camera.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

MirrorNeRF: One-shot Neural Portrait Radiance Field from Multi-mirror Catadioptric Imaging.
Proceedings of the IEEE International Conference on Computational Photography, 2021

Convolutional Neural Opacity Radiance Fields.
Proceedings of the IEEE International Conference on Computational Photography, 2021

NeuralHumanFVV: Real-Time Neural Volumetric Human Performance Rendering Using RGB Cameras.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

ChallenCap: Monocular 3D Capture of Challenging Human Performances Using Multi-Modal References.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Live Semantic 3D Perception for Immersive Augmented Reality.
IEEE Trans. Vis. Comput. Graph., 2020

UnstructuredFusion: Realtime 4D Geometry and Texture Reconstruction Using Commercial RGBD Cameras.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Neural3D: Light-weight Neural Portrait Scanning via Context-aware Correspondence Learning.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Multiscale-VR: Multiscale Gigapixel 3D Panoramic Videography for Virtual Reality.
Proceedings of the 2020 IEEE International Conference on Computational Photography, 2020

RobustFusion: Human Volumetric Capture with Data-Driven Visual Cues Using a RGBD Camera.
Proceedings of the Computer Vision - ECCV 2020, 2020

EventCap: Monocular 3D Capture of High-Speed Human Motions Using an Event Camera.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

OccuSeg: Occupancy-Aware 3D Instance Segmentation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Real-Time Global Registration for Globally Consistent RGB-D SLAM.
IEEE Trans. Robotics, 2019

2018
FlyCap: Markerless Motion Capture Using Multiple Autonomous Flying Cameras.
IEEE Trans. Vis. Comput. Graph., 2018

iHuman3D: Intelligent Human Body 3D Reconstruction using a Single Flying Camera.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

2017
Beyond SIFT using binary features in Loop Closure Detection.
Proceedings of the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2017


  Loading...