Haoang Li

Orcid: 0000-0002-1576-9408

According to our database1, Haoang Li authored at least 85 papers between 2017 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Efficient Feature-Free Initialization for Monocular Visual-Inertial Systems Using a Feed-Forward 3D Model.
CoRR, May, 2026

SEDualVLN: A Spatially-Enhanced Dual-System for Vision-Language Navigation.
CoRR, May, 2026

HCSG: Human-Centric Semantic-Geometric Reasoning for Vision-Language Navigation.
CoRR, May, 2026

RoboMemArena: A Comprehensive and Challenging Robotic Memory Benchmark.
CoRR, May, 2026

CapVector: Learning Transferable Capability Vectors in Parametric Space for Vision-Language-Action Models.
CoRR, May, 2026

Robust Energy-Aware Routing for Air-Ground Cooperative Multi-UAV Delivery in Wind-Uncertain Environments.
CoRR, April, 2026

VLA-OPD: Bridging Offline SFT and Online RL for Vision-Language-Action Models via On-Policy Distillation.
CoRR, March, 2026

DFM-VLA: Iterative Action Refinement for Robot Manipulation via Discrete Flow Matching.
CoRR, March, 2026

Fast-dVLA: Accelerating Discrete Diffusion VLA to Real-Time Performance.
CoRR, March, 2026

DualCoT-VLA: Visual-Linguistic Chain of Thought via Parallel Reasoning for Vision-Language-Action Models.
CoRR, March, 2026

DyGeoVLN: Infusing Dynamic Geometry Foundation Model into Vision-Language Navigation.
CoRR, March, 2026

VAMPO: Policy Optimization for Improving Visual Dynamics in Video Action Models.
CoRR, March, 2026

AERR-Nav: Adaptive Exploration-Recovery-Reminiscing Strategy for Zero-Shot Object Navigation.
CoRR, March, 2026

P<sup>3</sup>Nav: End-to-End Perception, Prediction and Planning for Vision-and-Language Navigation.
CoRR, March, 2026

S-VAM: Shortcut Video-Action Model by Self-Distilling Geometric and Semantic Foresight.
CoRR, March, 2026

A Unified Calibration Framework for Coordinate and Kinematic Parameters in Dual-Arm Robots.
CoRR, March, 2026

Global Truncated Loss Minimization for Robust and Threshold-Resilient Geometric Estimation.
CoRR, March, 2026

RegFormer++: An Efficient Large-Scale 3D LiDAR Point Registration Network with Projection-Aware 2D Transformer.
CoRR, March, 2026

OmniDP: Beyond-FOV Large-Workspace Humanoid Manipulation with Omnidirectional 3D Perception.
CoRR, March, 2026

Enhancing Vision-Language Navigation with Multimodal Event Knowledge from Real-World Indoor Tour Videos.
CoRR, February, 2026

Rethinking the Practicality of Vision-language-action Model: A Comprehensive Benchmark and An Improved Baseline.
CoRR, February, 2026

Dream-SLAM: Dreaming the Unseen for Active SLAM in Dynamic Environments.
CoRR, February, 2026

FRAPPE: Infusing World Modeling into Generalist Policies via Multiple Future Representation Alignment.
CoRR, February, 2026

Advances in Global Solvers for 3D Vision.
CoRR, February, 2026

Neural Predictor-Corrector: Solving Homotopy Problems with Reinforcement Learning.
CoRR, February, 2026

SCSV: Spatial-Temporal Consistent Dynamic 3D Scene Generation From Sparse Views.
IEEE Trans. Image Process., 2026

ReconVLA: Reconstructive Vision-Language-Action Model as Effective Robot Perceiver.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Embodied Robot Manipulation in the Era of Foundation Models: Planning and Learning Perspectives.
CoRR, December, 2025

Open-world Hand-Object Interaction Video Generation Based on Structure and Contact-aware Representation.
CoRR, December, 2025

IRPO: Boosting Image Restoration via Post-training GRPO.
CoRR, December, 2025

Unified Diffusion VLA: Vision-Language-Action Model via Joint Discrete Denoising Diffusion Process.
CoRR, November, 2025

Spatial Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model.
CoRR, October, 2025

Towards a Unified Understanding of Robot Manipulation: A Comprehensive Survey.
CoRR, October, 2025

SLAM-Free Visual Navigation with Hierarchical Vision-Language Perception and Coarse-to-Fine Semantic Topological Planning.
CoRR, September, 2025

Embracing Bulky Objects with Humanoid Robots: Whole-Body Manipulation with Reinforcement Learning.
CoRR, September, 2025

G2-SDF: Geometry-Guided Neural Signed Distance Fields for Scalable and Detailed Reconstruction.
IEEE Robotics Autom. Lett., August, 2025

FlowVLA: Thinking in Motion with a Visual Chain of Thought.
CoRR, August, 2025

CEED-VLA: Consistency Vision-Language-Action Model with Early-Exit Decoding.
CoRR, June, 2025

RationalVLA: A Rational Vision-Language-Action Model with Dual System.
CoRR, June, 2025

Accelerating Vision-Language-Action Model Integrated with Action Chunking via Parallel Decoding.
CoRR, March, 2025

San Francisco World: Leveraging Structural Regularities of Slope for 3-DoF Visual Compass.
IEEE Robotics Autom. Lett., January, 2025

Place Recognition for Place: Toward Point Cloud Descriptor Stability Across Viewpoints.
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2025

EndoFlow-SLAM: Real-Time Endoscopic SLAM with Flow-Constrained Gaussian Splatting.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2025, 2025

L2COcc: Lightweight Camera-Centric Semantic Scene Completion via Distillation of LiDAR Model.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2025

PD-VLA: Accelerating Vision-Language-Action Model Integrated with Action Chunking via Parallel Decoding.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2025

RoboDexVLM: Visual Language Model-Enabled Task Planning and Motion Control for Dexterous Robot Manipulation.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2025

RMG: Real-Time Expressive Motion Generation with Self-collision Avoidance for 6-DOF Companion Robotic Arms.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2025

SkyVLN: Vision-and-Language Navigation and NMPC Control for UAVs in Urban Environments.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2025

STG-Avatar: Animatable Human Avatars via Spacetime Gaussian.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2025

Interactive Navigation for Legged Manipulators with Learned Arm-Pushing Controller.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2025

Convex Relaxation for Robust Vanishing Point Estimation in Manhattan World.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Efficient and Robust Point Cloud Registration via Heuristics-Guided Parameter Search.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2024

DynSUP: Dynamic Gaussian Splatting from An Unposed Image Pair.
CoRR, 2024

PG-SLAM: Photo-realistic and Geometry-aware RGB-D SLAM in Dynamic Environments.
CoRR, 2024

The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition.
CoRR, 2024

Physically-Based Photometric Bundle Adjustment in Non-Lambertian Environments.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024

Structured-NeRF: Hierarchical Scene Graph with Neural Representation.
Proceedings of the Computer Vision - ECCV 2024, 2024

GlobalPointer: Large-Scale Plane Adjustment with Bi-Convex Relaxation.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
Hong Kong World: Leveraging Structural Regularity for Line-Based SLAM.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

DDIT: Semantic Scene Completion via Deformable Deep Implicit Templates.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Learning Accurate 3D Shape Based on Stereo Polarimetric Imaging.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Integrated Task Allocation and Path Coordination for Large-Scale Robot Networks With Uncertainties.
IEEE Trans Autom. Sci. Eng., 2022

Quasi-Globally Optimal and Real-Time Visual Compass in Manhattan Structured Environments.
IEEE Robotics Autom. Lett., 2022

Quasi-Globally Optimal and Near/True Real-Time Vanishing Point Estimation in Manhattan World.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Deterministic Point Cloud Registration via Novel Transformation Decomposition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Task-Space Trajectory Tracking Control for Coordinated Manipulation Using Sampled Coupling Data.
IEEE Robotics Autom. Lett., 2021

Learning Icosahedral Spherical Probability Map Based on Bingham Mixture Model for Vanishing Point Estimation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Learning To Identify Correct 2D-2D Line Correspondences on Sphere.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Robust Estimation of Absolute Camera Pose via Intersection Constraint and Flow Consensus.
IEEE Trans. Image Process., 2020

Deep Learning-Based Localization and Perception Systems: Approaches for Autonomous Cargo Transportation Vehicles in Large-Scale, Semiclosed Environments.
IEEE Robotics Autom. Mag., 2020

End-to-End 3D Point Cloud Learning for Registration Task Using Virtual Correspondences.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

CUHK-AHU Dataset: Promoting Practical Self-Driving Applications in the Complex Airport Logistics, Hill and Urban Environments.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

A Synchronization Approach for Achieving Cooperative Adaptive Cruise Control Based Non-Stop Intersection Passing.
Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

Robust and Efficient Estimation of Absolute Camera Pose for Monocular Visual Odometry.
Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

Globally Optimal and Efficient Vanishing Point Estimation in Atlanta World.
Proceedings of the Computer Vision - ECCV 2020, 2020

2019
Line-based Absolute and Relative Camera Pose Estimation in Structured Environments.
Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019

A Hierarchical Framework for Coordinating Large-Scale Robot Networks.
Proceedings of the International Conference on Robotics and Automation, 2019

Leveraging Structural Regularity of Atlanta World for Monocular SLAM.
Proceedings of the International Conference on Robotics and Automation, 2019

LPD-Net: 3D Point Cloud Learning for Large-Scale Place Recognition and Environment Analysis.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Quasi-Globally Optimal and Efficient Vanishing Point Estimation in Manhattan World.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

2018
Robust Camera Pose Estimation via Consensus on Ray Bundle and Vector Field.
Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018

A Monocular SLAM System Leveraging Structural Regularity in Manhattan World.
Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

2017
Optimal seamline detection in dynamic scenes via graph cuts for image mosaicking.
Mach. Vis. Appl., 2017

2-Line Exhaustive Searching for Real-Time Vanishing Point Estimation in Manhattan World.
Proceedings of the 2017 IEEE Winter Conference on Applications of Computer Vision, 2017

Combining points and lines for camera pose estimation and optimization in monocular visual odometry.
Proceedings of the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2017


  Loading...