Zhizheng Zhang

Affiliations:
  • GalBot, Beijing, China


According to our database1, Zhizheng Zhang authored at least 22 papers between 2024 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
NavGSim: High-Fidelity Gaussian Splatting Simulator for Large-Scale Navigation.
CoRR, March, 2026

Emerging Extrinsic Dexterity in Cluttered Scenes via Dynamics-aware Policy Learning.
CoRR, March, 2026

SPAN-Nav: Generalized Spatial Awareness for Versatile Vision-Language Navigation.
CoRR, March, 2026

SimRecon: SimReady Compositional Scene Reconstruction from Real Videos.
CoRR, March, 2026

LDA-1B: Scaling Latent Dynamics Action Model via Universal Embodied Data Ingestion.
CoRR, February, 2026

Any3D-VLA: Enhancing VLA Robustness via Diverse Point Clouds.
CoRR, February, 2026

2025
StereoVLA: Enhancing Vision-Language-Action Models with Stereo Vision.
CoRR, December, 2025

RoboCOIN: An Open-Sourced Bimanual Robotic Data COllection for INtegrated Manipulation.
CoRR, November, 2025

UrbanVLA: A Vision-Language-Action Model for Urban Micromobility.
CoRR, October, 2025

TrackVLA++: Unleashing Reasoning and Memory Capabilities in VLA Models for Embodied Visual Tracking.
CoRR, October, 2025

MM-Nav: Multi-View VLA Model for Robust Visual Navigation via Multi-Expert Learning.
CoRR, October, 2025

CL3R: 3D Reconstruction and Contrastive Learning for Enhanced Robotic Manipulation Representations.
CoRR, July, 2025

DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge.
CoRR, July, 2025

TrackVLA: Embodied Visual Tracking in the Wild.
CoRR, May, 2025

GraspVLA: a Grasping Foundation Model Pre-trained on Billion-scale Synthetic Action Data.
CoRR, May, 2025

FetchBot: Object Fetching in Cluttered Shelves via Zero-Shot Sim2Real.
CoRR, February, 2025

SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation.
CoRR, February, 2025

DexVLG: Dexterous Vision-Language-Grasp Model at Scale.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Uni-NaVid: A Video-based Vision-Language-Action Model for Unifying Embodied Navigation Tasks.
CoRR, 2024

NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation.
Proceedings of the Robotics: Science and Systems XX, 2024

Open6DOR: Benchmarking Open-instruction 6-DoF Object Rearrangement and A VLM-based Approach.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024


  Loading...