Qihang Fan

Orcid: 0000-0002-6115-5503

According to our database1, Qihang Fan authored at least 26 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
UniPrefill: Universal Long-Context Prefill Acceleration via Block-wise Dynamic Sparsification.
CoRR, May, 2026

Advancing Vision Transformer with Enhanced Spatial Priors.
CoRR, April, 2026

FlashPrefill: Instantaneous Pattern Discovery and Thresholding for Ultra-Fast Long-Context Prefilling.
CoRR, March, 2026

Random Wins All: Rethinking Grouping Strategies for Vision Tokens.
CoRR, March, 2026

A novel cupping mark recognition method base on multi-scale feature fusion network.
Biomed. Signal Process. Control., 2026

2025
Expand and Prune: Maximizing Trajectory Diversity for Effective GRPO in Generative Models.
CoRR, December, 2025

Thinking With Bounding Boxes: Enhancing Spatio-Temporal Video Grounding via Reinforcement Fine-Tuning.
CoRR, November, 2025

Vidi2: Large Multimodal Models for Video Understanding and Creation.
CoRR, November, 2025

Breaking Complexity Barriers: High-Resolution Image Restoration with Rank Enhanced Linear Attention.
CoRR, May, 2025

DiCo: Revitalizing ConvNets for Scalable and Efficient Diffusion Modeling.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Vision Transformer with Sparse Scan Prior.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Semantic Equitable Clustering: A Simple and Effective Strategy for Clustering Vision Tokens.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Rectifying Magnitude Neglect in Linear Attention.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

Breaking the Low-Rank Dilemma of Linear Attention.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Network Group Partition and Core Placement Optimization for Neuromorphic Multi-Core and Multi-Chip Systems.
IEEE Trans. Emerg. Top. Comput. Intell., December, 2024

Semantic Equitable Clustering: A Simple, Fast and Effective Strategy for Vision Transformer.
CoRR, 2024

Vision Transformer with Sparse Scan Prior.
CoRR, 2024

ViTAR: Vision Transformer with Any Resolution.
CoRR, 2024

ViPro-BEV: Few-Shot Visual Prompting for Bird's-Eye-View Perception.
Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024

RMT: Retentive Networks Meet Vision Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

DeVAn: Dense Video Annotation for Video-Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Video-CSR: Complex Video Digest Creation for Visual-Language Models.
CoRR, 2023

Video-Teller: Enhancing Cross-Modal Generation with Fusion and Decoupling.
CoRR, 2023

Rethinking Local Perception in Lightweight Vision Transformer.
CoRR, 2023

Lightweight Vision Transformer with Bidirectional Interaction.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023


  Loading...