Han Zhao

Orcid: 0000-0003-3948-0649

Affiliations:
  • Westlake University, Hangzhou, China
  • Zhejiang University, Hangzhou, China


According to our database1, Han Zhao authored at least 28 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Unified Diffusion VLA: Vision-Language-Action Model via Joint Discrete Denoising Diffusion Process.
CoRR, November, 2025

VLA^2: Empowering Vision-Language-Action Models with an Agentic Framework for Unseen Concept Manipulation.
CoRR, October, 2025

Spatial Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model.
CoRR, October, 2025

Towards a Unified Understanding of Robot Manipulation: A Comprehensive Survey.
CoRR, October, 2025

VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model.
CoRR, September, 2025

ReconVLA: Reconstructive Vision-Language-Action Model as Effective Robot Perceiver.
CoRR, August, 2025

CEED-VLA: Consistency Vision-Language-Action Model with Early-Exit Decoding.
CoRR, June, 2025

RationalVLA: A Rational Vision-Language-Action Model with Dual System.
CoRR, June, 2025

SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning.
CoRR, May, 2025

Unveiling the Potential of Vision-Language-Action Models with Open-Ended Multimodal Instructions.
CoRR, May, 2025

ReinboT: Amplifying Robot Visual-Language Manipulation with Reinforcement Learning.
CoRR, May, 2025

OpenHelix: A Short Survey, Empirical Analysis, and Open-Source Dual-System VLA Model for Robotic Manipulation.
CoRR, May, 2025

Accelerating Vision-Language-Action Model Integrated with Action Chunking via Parallel Decoding.
CoRR, March, 2025

A nearly optimal adaptive saturation function tuning method for quasi-sliding mode control based on integral reinforcement learning.
Neurocomputing, 2025

MoRE: Unlocking Scalability in Reinforcement Learning for Quadruped Vision-Language-Action Models.
Proceedings of the IEEE International Conference on Robotics and Automation, 2025

Quart-Online: Latency-Free Multimodal Large Language Model for Quadruped Robot Learning.
Proceedings of the IEEE International Conference on Robotics and Automation, 2025

VLAS: Vision-Language-Action Model with Speech Instructions for Customized Robot Manipulation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
QUART-Online: Latency-Free Large Multimodal Language Model for Quadruped Robot Learning.
CoRR, 2024

RL2AC: Reinforcement Learning-based Rapid Online Adaptive Control for Legged Robot Robust Locomotion.
Proceedings of the Robotics: Science and Systems XX, 2024

GeRM: A Generalist Robotic Model with Mixture-of-experts for Quadruped Robot.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024

PiTe: Pixel-Temporal Alignment for Large Video-Language Model.
Proceedings of the Computer Vision - ECCV 2024, 2024

QUAR-VLA: Vision-Language-Action Model for Quadruped Robots.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
Online adaptive optimal control algorithm based on synchronous integral reinforcement learning with explorations.
Neurocomputing, 2023

QUAR-VLA: Vision-Language-Action Model for Quadruped Robots.
CoRR, 2023

RSG: Fast Learning Adaptive Skills for Quadruped Robots by Skill Graph.
CoRR, 2023

A Composite Control Strategy for Quadruped Robot by Integrating Reinforcement Learning and Model-Based Control.
IROS, 2023

2021
A Nearly Optimal Chattering Reduction Method of Sliding Mode Control With an Application to a Two-wheeled Mobile Robot.
CoRR, 2021


  Loading...