Han Zhao

Orcid: 0000-0003-3948-0649

Affiliations:
  • Westlake University, Hangzhou, China
  • Zhejiang University, Hangzhou, China


According to our database1, Han Zhao authored at least 38 papers between 2021 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
RoboMemArena: A Comprehensive and Challenging Robotic Memory Benchmark.
CoRR, May, 2026

CapVector: Learning Transferable Capability Vectors in Parametric Space for Vision-Language-Action Models.
CoRR, May, 2026

Fast-dVLA: Accelerating Discrete Diffusion VLA to Real-Time Performance.
CoRR, March, 2026

MMaDA-VLA: Large Diffusion Vision-Language-Action Model with Unified Multi-Modal Instruction and Generation.
CoRR, March, 2026

VAMPO: Policy Optimization for Improving Visual Dynamics in Video Action Models.
CoRR, March, 2026

Rethinking the Practicality of Vision-language-action Model: A Comprehensive Benchmark and An Improved Baseline.
CoRR, February, 2026

FRAPPE: Infusing World Modeling into Generalist Policies via Multiple Future Representation Alignment.
CoRR, February, 2026

CRL-VLA: Continual Vision-Language-Action Learning.
CoRR, February, 2026

VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

ReconVLA: Reconstructive Vision-Language-Action Model as Effective Robot Perceiver.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Embodied Robot Manipulation in the Era of Foundation Models: Planning and Learning Perspectives.
CoRR, December, 2025

Unified Diffusion VLA: Vision-Language-Action Model via Joint Discrete Denoising Diffusion Process.
CoRR, November, 2025

VLA^2: Empowering Vision-Language-Action Models with an Agentic Framework for Unseen Concept Manipulation.
CoRR, October, 2025

Spatial Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model.
CoRR, October, 2025

Towards a Unified Understanding of Robot Manipulation: A Comprehensive Survey.
CoRR, October, 2025

CEED-VLA: Consistency Vision-Language-Action Model with Early-Exit Decoding.
CoRR, June, 2025

RationalVLA: A Rational Vision-Language-Action Model with Dual System.
CoRR, June, 2025

Unveiling the Potential of Vision-Language-Action Models with Open-Ended Multimodal Instructions.
CoRR, May, 2025

OpenHelix: A Short Survey, Empirical Analysis, and Open-Source Dual-System VLA Model for Robotic Manipulation.
CoRR, May, 2025

Accelerating Vision-Language-Action Model Integrated with Action Chunking via Parallel Decoding.
CoRR, March, 2025

A nearly optimal adaptive saturation function tuning method for quasi-sliding mode control based on integral reinforcement learning.
Neurocomputing, 2025

SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

PD-VLA: Accelerating Vision-Language-Action Model Integrated with Action Chunking via Parallel Decoding.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2025

MoRE: Unlocking Scalability in Reinforcement Learning for Quadruped Vision-Language-Action Models.
Proceedings of the IEEE International Conference on Robotics and Automation, 2025

Quart-Online: Latency-Free Multimodal Large Language Model for Quadruped Robot Learning.
Proceedings of the IEEE International Conference on Robotics and Automation, 2025

ReinboT: Amplifying Robot Visual-Language Manipulation with Reinforcement Learning.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

VLAS: Vision-Language-Action Model with Speech Instructions for Customized Robot Manipulation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
QUART-Online: Latency-Free Large Multimodal Language Model for Quadruped Robot Learning.
CoRR, 2024

RL2AC: Reinforcement Learning-based Rapid Online Adaptive Control for Legged Robot Robust Locomotion.
Proceedings of the Robotics: Science and Systems XX, 2024

GeRM: A Generalist Robotic Model with Mixture-of-experts for Quadruped Robot.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024

PiTe: Pixel-Temporal Alignment for Large Video-Language Model.
Proceedings of the Computer Vision - ECCV 2024, 2024

QUAR-VLA: Vision-Language-Action Model for Quadruped Robots.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
Online adaptive optimal control algorithm based on synchronous integral reinforcement learning with explorations.
Neurocomputing, 2023

QUAR-VLA: Vision-Language-Action Model for Quadruped Robots.
CoRR, 2023

RSG: Fast Learning Adaptive Skills for Quadruped Robots by Skill Graph.
CoRR, 2023

A Composite Control Strategy for Quadruped Robot by Integrating Reinforcement Learning and Model-Based Control.
IROS, 2023

2021
A Nearly Optimal Chattering Reduction Method of Sliding Mode Control With an Application to a Two-wheeled Mobile Robot.
CoRR, 2021


  Loading...