Zhi Gao

Orcid: 0000-0002-4424-4352

Affiliations:

Beijing Institute of Technology, China

According to our database¹, Zhi Gao authored at least 40 papers between 2017 and 2025.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2025

Curvature Learning for Generalization of Hyperbolic Neural Networks.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., December, 2025

Modality Alignment across Trees on Heterogeneous Hyperbolic Manifolds.

[BibT_eX]

[DOI]

CoRR, October, 2025

GUI Knowledge Bench: Revealing the Knowledge Gap Behind VLM Failures in GUI Tasks.

[BibT_eX]

[DOI]

CoRR, October, 2025

Multi-Step Reasoning for Embodied Question Answering via Tool Augmentation.

[BibT_eX]

[DOI]

CoRR, October, 2025

KORE: Enhancing Knowledge Injection for Large Multimodal Models via Knowledge-Oriented Augmentations and Constraints.

[BibT_eX]

[DOI]

CoRR, October, 2025

Beyond the Seen: Bounded Distribution Estimation for Open-Vocabulary Learning.

[BibT_eX]

[DOI]

CoRR, October, 2025

Adaptive Model Ensemble for Continual Learning.

[BibT_eX]

[DOI]

CoRR, September, 2025

Geometry-aware Distance Measure for Diverse Hierarchical Structures in Hyperbolic Spaces.

[BibT_eX]

[DOI]

CoRR, June, 2025

A Set-to-Set Distance Measure in Hyperbolic Space.

[BibT_eX]

[DOI]

CoRR, June, 2025

Hyperbolic Dual Feature Augmentation for Open-Environment.

[BibT_eX]

[DOI]

CoRR, June, 2025

When Large Multimodal Models Confront Evolving Knowledge:Challenges and Pathways.

[BibT_eX]

[DOI]

CoRR, May, 2025

Chain-of-Focus: Adaptive Visual Search and Zooming for Multimodal Reasoning via RL.

[BibT_eX]

[DOI]

CoRR, May, 2025

Memory-Centric Embodied Question Answer.

[BibT_eX]

[DOI]

CoRR, May, 2025

Iterative Trajectory Exploration for Multimodal Agents.

[BibT_eX]

[DOI]

CoRR, April, 2025

TongUI: Building Generalized GUI Agents by Learning from Multimodal Web Tutorials.

[BibT_eX]

[DOI]

CoRR, April, 2025

Building LLM Agents by Incorporating Insights from Computer Systems.

[BibT_eX]

[DOI]

CoRR, April, 2025

Large-scale Riemannian meta-optimization via subspace adaptation.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., 2025

Multi-modal Agent Tuning: Building a VLM-Driven Agent for Efficient Tool Usage.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

MMKE-Bench: A Multimodal Editing Benchmark for Diverse Visual Knowledge.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding.

[BibT_eX]

[DOI]

CoRR, 2024

FIRE: A Dataset for Feedback Integration and Refinement Evaluation of Multimodal Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

[inline-graphic not available: see fulltext]VideoAgent: A Memory-Augmented Multimodal Agent for Video Understanding.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

CLOVA: A Closed-LOop Visual Assistant with Tool Usage and Update.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Geometry-adaptive Meta-learning in Riemannian Manifolds.

[BibT_eX]

[DOI]

Zhi Gao

Proceedings of the ACM Turing Award Celebration Conference 2024, 2024

2023

Learning to Optimize on Riemannian Manifolds.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

Curvature-Adaptive Meta-Learning for Fast Adaptation to Manifold Data.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2023

Exploring Data Geometry for Continual Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Infinite-dimensional feature aggregation via a factorized bilinear model.

[BibT_eX]

[DOI]

Pattern Recognit., 2022

Hyperbolic Feature Augmentation via Distribution Estimation and Infinite Sampling on Manifolds.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Efficient Riemannian Meta-Optimization by Implicit Differentiation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Curvature Generation in Curved Spaces for Few-Shot Learning.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

A Hyperbolic-to-Hyperbolic Graph Convolutional Network.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Learning a Gradient-free Riemannian Optimizer on Tangent Spaces.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

A Robust Distance Measure for Similarity-Based Classification on the SPD Manifold.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2020

Learning to Optimize on SPD Manifolds.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Revisiting Bilinear Pooling: A Coding Perspective.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Learning a robust representation via a deep network on symmetric positive definite manifolds.

[BibT_eX]

[DOI]

Pattern Recognit., 2019

Deep convolutional network with locality and sparsity constraints for texture classification.

[BibT_eX]

[DOI]

Pattern Recognit., 2019

2018

Set-to-Set Distance Metric Learning on SPD Manifolds.

[BibT_eX]

[DOI]

Zhi Gao

Yuwei Wu

Yunde Jia

Proceedings of the Pattern Recognition and Computer Vision - First Chinese Conference, 2018

2017

Learning a Robust Representation via a Deep Network on Symmetric Positive Definite Manifolds.

[BibT_eX]

[DOI]

CoRR, 2017

Zhi Gao

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...