Shan Zhang

Orcid: 0000-0002-5531-3296

Affiliations:
  • Australian Institute for Machine Learning, University of Adelaide, Adelaide, SA, Australia


According to our database1, Shan Zhang authored at least 17 papers between 2020 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Hierarchical Process Reward Models are Symbolic Vision Learners.
CoRR, December, 2025

Artemis: Structured Visual Reasoning for Perception Policy Learning.
CoRR, December, 2025

Agentic Learner with Grow-and-Refine Multimodal Semantic Memory.
CoRR, November, 2025

MATHGLANCE: Multimodal Large Language Models Do Not Know Where to Look in Mathematical Diagrams.
CoRR, March, 2025

Open Eyes, Then Reason: Fine-grained Visual Mathematical Understanding in MLLMs.
CoRR, January, 2025

Primitive Vision: Improving Diagram Understanding in MLLMs.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Open-World Objectness Modeling Unifies Novel Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection.
CoRR, 2024

Semantic Transfer from Head to Tail: Enlarging Tail Margin for Long-Tailed Visual Recognition.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

PACE: Marrying generalization in PArameter-efficient fine-tuning with Consistency rEgularization.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

VRP-SAM: SAM with Visual Reference Prompt.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
σ-Adaptive Decoupled Prototype for Few-Shot Object Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
Group DETR v2: Strong Object Detector with Encoder-Decoder Pretraining.
CoRR, 2022

Time-rEversed DiffusioN tEnsor Transformer: A New TENET of Few-Shot Object Detection.
Proceedings of the Computer Vision - ECCV 2022, 2022

Kernelized Few-shot Object Detection with Efficient Integral Aggregation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2020
Few-Shot Object Detection by Second-Order Pooling.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020


  Loading...