Gengze Zhou

Orcid: 0000-0003-0279-9277

According to our database1, Gengze Zhou authored at least 15 papers between 2024 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments.
CoRR, May, 2026

LightMover: Generative Light Movement with Color and Intensity Controls.
CoRR, March, 2026

LiveWorld: Simulating Out-of-Sight Dynamics in Generative Video World Models.
CoRR, March, 2026

VLN-MME: Diagnosing MLLMs as Language-guided Visual Navigation Agents.
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

2025
VLNVerse: A Benchmark for Vision-Language Navigation with Versatile, Embodied, Realistic Simulation and Evaluation.
CoRR, December, 2025

MMGR: Multi-Modal Generative Reasoning.
CoRR, December, 2025

Rethinking Training Dynamics in Scale-wise Autoregressive Generation.
CoRR, December, 2025

Learning Goal-Oriented Language-Guided Navigation with Self-Improving Demonstrations at Scale.
CoRR, September, 2025

Embodied Navigation Foundation Model.
CoRR, September, 2025

Ground-Level Viewpoint Vision-and-Language Navigation in Continuous Environments.
Proceedings of the IEEE International Conference on Robotics and Automation, 2025

SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

2024
NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation.
Proceedings of the Robotics: Science and Systems XX, 2024

NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

WebVLN: Vision-and-Language Navigation on Websites.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024


  Loading...