We stand with Ukraine

We stand with Ukraine

Hangjun Ye

Orcid: 0009-0001-9570-7927

According to our database¹, Hangjun Ye authored at least 51 papers between 2003 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

PerlAD: Towards Enhanced Closed-Loop End-to-End Autonomous Driving With Pseudo-Simulation-Based Reinforcement Learning.

[DOI]

,

,

,

,

,

,

,

,

,

,

IEEE Robotics Autom. Lett., May, 2026

LVDrive: Latent Visual Representation Enhanced Vision-Language-Action Autonomous Driving Model.

[DOI]

,

,

,

,

,

CoRR, May, 2026

Beyond Imitation: Learning Safe End-to-End Autonomous Driving from Hard Negatives.

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, May, 2026

RotVLA: Rotational Latent Action for Vision-Language-Action Model.

[DOI]

,

,

,

,

,

,

,

CoRR, May, 2026

PointForward: Feedforward Driving Reconstruction through Point-Aligned Representations.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, May, 2026

Thinking in Text and Images: Interleaved Vision-Language Reasoning Traces for Long-Horizon Robot Manipulation.

[DOI]

,

,

,

,

,

,

,

,

CoRR, May, 2026

Walk With Me: Long-Horizon Social Navigation for Human-Centric Outdoor Assistance.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, April, 2026

OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, April, 2026

XEmbodied: A Foundation Model with Enhanced Geometric and Physical Cues for Large-Scale Embodied Environments.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, April, 2026

DriveVA: Video Action Models are Zero-Shot Drivers.

[DOI]

,

,

,

,

,

,

,

Michael Ying Yang

,

,

CoRR, April, 2026

UniDriveVLA: Unifying Understanding, Perception, and Action Planning for Autonomous Driving.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, April, 2026

Toward Physically Consistent Driving Video World Models under Challenging Trajectories.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, March, 2026

Learning from Mistakes: Post-Training for Driving VLA with Takeover Data.

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, March, 2026

LaST-VLA: Thinking in Latent Spatio-Temporal Space for Vision-Language-Action in Autonomous Driving.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, March, 2026

Unleashing the Potential of Diffusion Models for End-to-End Autonomous Driving.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, February, 2026

SEF-MAP: Subspace-Decomposed Expert Fusion for Robust Multimodal HD Map Prediction.

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, February, 2026

UFO: Unifying Feed-Forward and Optimization-based Methods for Large Driving Scene Modeling.

[DOI]

,

,

,

,

,

,

,

CoRR, February, 2026

VGGDrive: Empowering Vision-Language Models with Cross-View Geometric Grounding for Autonomous Driving.

[DOI]

,

,

,

,

,

,

CoRR, February, 2026

From Pairs to Sequences: Track-Aware Policy Gradients for Keypoint Detection.

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, February, 2026

MeanFuser: Fast One-Step Multi-Modal Trajectory Generation and Adaptive Reconstruction via MeanFlow for End-to-End Autonomous Driving.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, February, 2026

DriveFine: Refining-Augmented Masked Diffusion VLA for Precise and Robust Driving.

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, February, 2026

Xiaomi-Robotics-0: An Open-Sourced Vision-Language-Action Model with Real-Time Execution.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, February, 2026

DriveWorld-VLA: Unified Latent-Space World Modeling with Vision-Language-Action for Autonomous Driving.

[DOI]

,

,

,

,

,

,

CoRR, February, 2026

SparseOccVLA: Bridging Occupancy and Vision-Language Models via Sparse Queries for Unified 4D Scene Understanding and Planning.

[DOI]

,

,

,

,

,

,

,

,

CoRR, January, 2026

Pixel-Perfect Visual Geometry Estimation.

[DOI]

,

,

,

,

,

,

,

,

CoRR, January, 2026

ParkGaussian: Surround-view 3D Gaussian Splatting for Autonomous Parking.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, January, 2026

Dichotomous Diffusion Policy Optimization.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, January, 2026

LGNet: Explicit local-global feature modeling for cloud removal.

[DOI]

,

,

,

,

,

,

,

Appl. Soft Comput., 2026

2025

Mirage: One-Step Video Diffusion for Photorealistic and Coherent Asset Editing in Driving Scenes.

[DOI]

,

,

,

,

CoRR, December, 2025

DriveLaW:Unifying Planning and Video Generation in a Latent Driving World.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, December, 2025

TrajMoE: Scene-Adaptive Trajectory Planning with Mixture of Experts and Reinforcement Learning.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, December, 2025

DGGT: Feedforward 4D Reconstruction of Dynamic Driving Scenes using Unposed Images.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, December, 2025

SimScale: Learning to Drive via Real-World Simulation at Scale.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, November, 2025

MiMo-Embodied: X-Embodied Foundation Model Technical Report.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, November, 2025

Is Your VLM for Autonomous Driving Safety-Ready? A Comprehensive Benchmark for Evaluating External and In-Cabin Risks.

[DOI]

,

,

,

,

,

,

,

Guangfeng Jiang

,

,

,

,

,

,

CoRR, November, 2025

Is your VLM Sky-Ready? A Comprehensive Spatial Intelligence Benchmark for UAV Navigation.

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, November, 2025

RoboAfford++: A Generative AI-Enhanced Dataset for Multimodal Affordance Learning in Robotic Manipulation and Navigation.

[DOI]

,

,

,

,

,

,

,

,

CoRR, November, 2025

SocialNav-Map: Dynamic Mapping with Human Trajectory Prediction for Zero-Shot Social Navigation.

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, November, 2025

Rethinking Driving World Model as Synthetic Data Generator for Perception Tasks.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, October, 2025

ViSE: A Systematic Approach to Vision-Only Street-View Extrapolation.

[DOI]

,

,

,

,

,

CoRR, October, 2025

Team Xiaomi EV-AD VLA: Learning to Navigate Socially Through Proactive Risk Perception - Technical Report for IROS 2025 RoboSense Challenge Social Navigation Track.

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, October, 2025

Team Xiaomi EV-AD VLA: Caption-Guided Retrieval System for Cross-Modal Drone Navigation - Technical Report for IROS 2025 RoboSense Challenge Track 4.

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, October, 2025

WorldSplat: Gaussian-Centric Feed-Forward 4D Scene Generation for Autonomous Driving.

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, September, 2025

ExtraGS: Geometric-Aware Trajectory Extrapolation with Uncertainty-Guided Generative Priors.

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, August, 2025

DriveMRP: Enhancing Vision-Language Models with Synthetic Motion Data for Motion Risk Prediction.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, July, 2025

ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, June, 2025

Genesis: Multimodal Driving Scene Generation with Spatio-Temporal and Cross-Modal Consistency.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, June, 2025

Pixel-Perfect Depth with Semantics-Prompted Diffusion Transformers.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

2017

Parameter Sharing Deep Deterministic Policy Gradient for Cooperative Multi-agent Reinforcement Learning.

[DOI]

,

CoRR, 2017

2003

Similarity measure learning for image retrieval using binary component discriminating function.

[DOI]

,

Proceedings of the 2003 International Conference on Image Processing, 2003

Fast Search in Large-Scale Image Database Using Vector Quantization.

[DOI]

,

Proceedings of the Image and Video Retrieval, Second International Conference, 2003

Loading...