Zilun Zhang

Orcid: 0009-0008-5961-5970

According to our database1, Zilun Zhang authored at least 16 papers between 2020 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
VLM-R1: A Stable and Generalizable R1-style Large Vision-Language Model.
CoRR, April, 2025

GeoRSMLLM: A Multimodal Large Language Model for Vision-Language Tasks in Geoscience and Remote Sensing.
CoRR, March, 2025

SRMF: A Data Augmentation and Multimodal Fusion Approach for Long-Tail UHR Satellite Image Segmentation.
IEEE Trans. Geosci. Remote. Sens., 2025

The Self-Improvement Paradox: Can Language Models Bootstrap Reasoning Capabilities without External Scaffolding?
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
RS5M and GeoRSCLIP: A Large-Scale Vision- Language Dataset and a Large Vision-Language Model for Remote Sensing.
IEEE Trans. Geosci. Remote. Sens., 2024

ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration.
CoRR, 2024

Enhancing Ultra High Resolution Remote Sensing Imagery Analysis with ImageRAG.
CoRR, 2024

Preserving Knowledge in Large Language Model with Model-Agnostic Self-Decompression.
CoRR, 2024

Methods for Monitoring the Photovoltaic Panel: A Review.
Proceedings of the 12th International Conference on Agro-Geoinformatics, 2024

Top-k Collective Spatial Keyword Approximate Query.
Proceedings of the Web Information Systems and Applications, 2024

2023
A Novel Geo-Localization Method for UAV and Satellite Images Using Cross-View Consistent Attention.
Remote. Sens., October, 2023

RS5M: A Large Scale Vision-Language Dataset for Remote Sensing Vision-Language Foundation Model.
CoRR, 2023

2022
Introducing Vision Transformer for Alzheimer's Disease classification task with 3D input.
CoRR, 2022

Injecting Image Details into CLIP's Feature Space.
CoRR, 2022

2021
Will Multi-modal Data Improves Few-shot Learning?
CoRR, 2021

2020
DPGN: Distribution Propagation Graph Network for Few-Shot Learning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020


  Loading...