Yue Wang

Affiliations:
  • University of Southern California, CA, USA
  • NVIDIA
  • Massachusetts Institute of Technology, Cambridge, MA, USA (Ph.D.)


According to our database1, Yue Wang authored at least 54 papers between 2019 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Martian World Models: Controllable Video Synthesis with Physically Accurate 3D Reconstructions.
CoRR, July, 2025

SkillBlender: Towards Versatile Humanoid Whole-Body Loco-Manipulation via Skill Blending.
CoRR, June, 2025

RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning.
CoRR, April, 2025

SIRE: SE(3) Intrinsic Rigidity Embeddings.
CoRR, March, 2025

DreamDrive: Generative 4D Scene Modeling from Street View Images.
CoRR, January, 2025

LoRA3D: Low-Rank Self-Calibration of 3D Geometric Foundation models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Language-Image Models with 3D Understanding.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

OmniRe: Omni Urban Scene Reconstruction.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

STORM: Spatio-TempOral Reconstruction Model For Large-Scale Outdoor Scenes.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Learning Temporally Consistent Video Depth from Video Diffusion Priors.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Learning from Massive Human Videos for Universal Humanoid Pose Control.
CoRR, 2024

Extrapolated Urban View Synthesis Benchmark.
CoRR, 2024

InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models.
CoRR, 2024

Large Spatial Model: End-to-end Unposed Images to Semantic 3D.
CoRR, 2024

InstantSplat: Unbounded Sparse-view Pose-free Gaussian Splatting in 40 Seconds.
CoRR, 2024

Parallelized Spatiotemporal Binding.
CoRR, 2024

Denoising Vision Transformers.
CoRR, 2024

StreamMapNet: Streaming Mapping Network for Vectorized Online HD Map Construction.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Memorize What Matters: Emergent Scene Decomposition from Multitraverse.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

SSCBench: A Large-Scale 3D Semantic Scene Completion Benchmark for Autonomous Driving.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024

Augmenting Lane Perception and Topology Understanding with Standard Definition Navigation Maps.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Parallelized Spatiotemporal Slot Binding for Videos.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Denoising Vision Transformers.
Proceedings of the Computer Vision - ECCV 2024, 2024

PARA-Drive: Parallelized Architecture for Real-Time Autonomous Driving.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Towards Realistic Scene Generation with LiDAR Diffusion Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Driving Everywhere with Large Language Model Policy Adaptation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Tokenize the World into Object-level Knowledge to Address Long-tail Events in Autonomous Driving.
Proceedings of the Conference on Robot Learning, 6-9 November 2024, Munich, Germany., 2024

Q-SLAM: Quadric Representations for Monocular SLAM.
Proceedings of the Conference on Robot Learning, 6-9 November 2024, Munich, Germany., 2024

2023
Rethinking Directional Integration in Neural Radiance Fields.
CoRR, 2023

A Language Agent for Autonomous Driving.
CoRR, 2023

FreeNeRF: Improving Few-Shot Neural Rendering with Free Frequency Regularization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Neural Map Prior for Autonomous Driving.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

GeoMAE: Masked Geometric Target Prediction for Self-supervised Point Cloud Pre-Training.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

ViP3D: End-to-End Visual Trajectory Prediction via 3D Agent Queries.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

FUTR3D: A Unified Sensor Fusion Framework for 3D Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Learning 3D Representations from Data
PhD thesis, 2022

VectorMapNet: End-to-end Vectorized HD Map Learning.
CoRR, 2022

HDMapNet: An Online HD Map Construction and Evaluation Framework.
Proceedings of the 2022 International Conference on Robotics and Automation, 2022

MUTR3D: A Multi-camera Tracking Framework via 3D-to-2D Queries.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Representation Learning for Object Detection from Unlabeled Point Cloud Sequences.
Proceedings of the Conference on Robot Learning, 2022

2021
Improving Multi-Modal Learning with Uni-Modal Teachers.
CoRR, 2021

Object DGCNN: 3D Object Detection using Dynamic Graphs.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Few-shot Learning with Online Self-Distillation.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

On Feature Decorrelation in Self-Supervised Learning.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

DETR3D: 3D Object Detection from Multi-view Images via 3D-to-2D Queries.
Proceedings of the Conference on Robot Learning, 8-11 November 2021, London, UK., 2021

2020
Multi-Frame to Single-Frame: Knowledge Distillation for 3D Object Detection.
CoRR, 2020

Pillar-Based Object Detection for Autonomous Driving.
Proceedings of the Computer Vision - ECCV 2020, 2020

Rethinking Few-Shot Image Classification: A Good Embedding is All You Need?
Proceedings of the Computer Vision - ECCV 2020, 2020

2019
Dynamic Graph CNN for Learning on Point Clouds.
ACM Trans. Graph., 2019

PRNet: Self-Supervised Learning for Partial-to-Partial Registration.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Deep Closest Point: Learning Representations for Point Cloud Registration.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019


  Loading...