Hangjun Ye

According to our database1, Hangjun Ye authored at least 46 papers between 2003 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
PerlAD: Towards Enhanced Closed-Loop End-to-End Autonomous Driving With Pseudo-Simulation-Based Reinforcement Learning.
IEEE Robotics Autom. Lett., May, 2026

Walk With Me: Long-Horizon Social Navigation for Human-Centric Outdoor Assistance.
CoRR, April, 2026

OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation.
CoRR, April, 2026

XEmbodied: A Foundation Model with Enhanced Geometric and Physical Cues for Large-Scale Embodied Environments.
CoRR, April, 2026

DriveVA: Video Action Models are Zero-Shot Drivers.
CoRR, April, 2026

UniDriveVLA: Unifying Understanding, Perception, and Action Planning for Autonomous Driving.
CoRR, April, 2026

Toward Physically Consistent Driving Video World Models under Challenging Trajectories.
CoRR, March, 2026

Learning from Mistakes: Post-Training for Driving VLA with Takeover Data.
CoRR, March, 2026

LaST-VLA: Thinking in Latent Spatio-Temporal Space for Vision-Language-Action in Autonomous Driving.
CoRR, March, 2026

Unleashing the Potential of Diffusion Models for End-to-End Autonomous Driving.
CoRR, February, 2026

SEF-MAP: Subspace-Decomposed Expert Fusion for Robust Multimodal HD Map Prediction.
CoRR, February, 2026

UFO: Unifying Feed-Forward and Optimization-based Methods for Large Driving Scene Modeling.
CoRR, February, 2026

VGGDrive: Empowering Vision-Language Models with Cross-View Geometric Grounding for Autonomous Driving.
CoRR, February, 2026

From Pairs to Sequences: Track-Aware Policy Gradients for Keypoint Detection.
CoRR, February, 2026

MeanFuser: Fast One-Step Multi-Modal Trajectory Generation and Adaptive Reconstruction via MeanFlow for End-to-End Autonomous Driving.
CoRR, February, 2026

DriveFine: Refining-Augmented Masked Diffusion VLA for Precise and Robust Driving.
CoRR, February, 2026

Xiaomi-Robotics-0: An Open-Sourced Vision-Language-Action Model with Real-Time Execution.
CoRR, February, 2026

DriveWorld-VLA: Unified Latent-Space World Modeling with Vision-Language-Action for Autonomous Driving.
CoRR, February, 2026

SparseOccVLA: Bridging Occupancy and Vision-Language Models via Sparse Queries for Unified 4D Scene Understanding and Planning.
CoRR, January, 2026

Pixel-Perfect Visual Geometry Estimation.
CoRR, January, 2026

ParkGaussian: Surround-view 3D Gaussian Splatting for Autonomous Parking.
CoRR, January, 2026

Dichotomous Diffusion Policy Optimization.
CoRR, January, 2026

LGNet: Explicit local-global feature modeling for cloud removal.
Appl. Soft Comput., 2026

2025
Mirage: One-Step Video Diffusion for Photorealistic and Coherent Asset Editing in Driving Scenes.
CoRR, December, 2025

DriveLaW:Unifying Planning and Video Generation in a Latent Driving World.
CoRR, December, 2025

TrajMoE: Scene-Adaptive Trajectory Planning with Mixture of Experts and Reinforcement Learning.
CoRR, December, 2025

DGGT: Feedforward 4D Reconstruction of Dynamic Driving Scenes using Unposed Images.
CoRR, December, 2025

SimScale: Learning to Drive via Real-World Simulation at Scale.
CoRR, November, 2025

MiMo-Embodied: X-Embodied Foundation Model Technical Report.
CoRR, November, 2025

Is Your VLM for Autonomous Driving Safety-Ready? A Comprehensive Benchmark for Evaluating External and In-Cabin Risks.
CoRR, November, 2025

Is your VLM Sky-Ready? A Comprehensive Spatial Intelligence Benchmark for UAV Navigation.
CoRR, November, 2025

RoboAfford++: A Generative AI-Enhanced Dataset for Multimodal Affordance Learning in Robotic Manipulation and Navigation.
CoRR, November, 2025

SocialNav-Map: Dynamic Mapping with Human Trajectory Prediction for Zero-Shot Social Navigation.
CoRR, November, 2025

Rethinking Driving World Model as Synthetic Data Generator for Perception Tasks.
CoRR, October, 2025

ViSE: A Systematic Approach to Vision-Only Street-View Extrapolation.
CoRR, October, 2025

Team Xiaomi EV-AD VLA: Learning to Navigate Socially Through Proactive Risk Perception - Technical Report for IROS 2025 RoboSense Challenge Social Navigation Track.
CoRR, October, 2025

Pixel-Perfect Depth with Semantics-Prompted Diffusion Transformers.
CoRR, October, 2025

Team Xiaomi EV-AD VLA: Caption-Guided Retrieval System for Cross-Modal Drone Navigation - Technical Report for IROS 2025 RoboSense Challenge Track 4.
CoRR, October, 2025

WorldSplat: Gaussian-Centric Feed-Forward 4D Scene Generation for Autonomous Driving.
CoRR, September, 2025

ExtraGS: Geometric-Aware Trajectory Extrapolation with Uncertainty-Guided Generative Priors.
CoRR, August, 2025

DriveMRP: Enhancing Vision-Language Models with Synthetic Motion Data for Motion Risk Prediction.
CoRR, July, 2025

ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving.
CoRR, June, 2025

Genesis: Multimodal Driving Scene Generation with Spatio-Temporal and Cross-Modal Consistency.
CoRR, June, 2025

2017
Parameter Sharing Deep Deterministic Policy Gradient for Cooperative Multi-agent Reinforcement Learning.
CoRR, 2017

2003
Similarity measure learning for image retrieval using binary component discriminating function.
Proceedings of the 2003 International Conference on Image Processing, 2003

Fast Search in Large-Scale Image Database Using Vector Quantization.
Proceedings of the Image and Video Retrieval, Second International Conference, 2003


  Loading...