Guolei Sun

Orcid: 0000-0001-8667-9656

According to our database¹, Guolei Sun authored at least 57 papers between 2017 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2026

RGB-D Indiscernible Object Counting in Underwater Scenes.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., May, 2026

Breaking Modality Heterogeneity in Low-Bit Quantization for Large Vision-Language Models.

[BibT_eX]

[DOI]

CoRR, May, 2026

Video Understanding: From Geometry and Semantics to Unified Models.

[BibT_eX]

[DOI]

CoRR, March, 2026

EgoSound: Benchmarking Sound Understanding in Egocentric Videos.

[BibT_eX]

[DOI]

CoRR, February, 2026

DINO-Mix: Distilling Foundational Knowledge with Cross-Domain CutMix for Semi-supervised Class-imbalanced Medical Image Segmentation.

[BibT_eX]

[DOI]

Xinyu Liu

Guolei Sun

CoRR, February, 2026

Revisiting Adaptive Rounding with Vectorized Reparameterization for LLM Quantization.

[BibT_eX]

[DOI]

CoRR, February, 2026

2025

Evaluating SAM2 for Video Semantic Segmentation.

[BibT_eX]

[DOI]

Syed Ariff Syed Hesham

CoRR, December, 2025

EgoNight: Towards Egocentric Vision Understanding at Night with a Challenging Benchmark.

[BibT_eX]

[DOI]

CoRR, October, 2025

A Comprehensive Survey on Video Scene Parsing:Advances, Challenges, and Prospects.

[BibT_eX]

[DOI]

Guohuan Xie

Syed Ariff Syed Hesham

CoRR, June, 2025

CamSAM2: Segment Anything Accurately in Camouflaged Videos.

[BibT_eX]

[DOI]

CoRR, March, 2025

When SAM2 meets video camouflaged object segmentation: a comprehensive evaluation and adaptation.

[BibT_eX]

[DOI]

Vis. Intell., 2025

Towards Open-Vocabulary Video Semantic Segmentation.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2025

HiM2SAM: Enhancing SAM2 with Hierarchical Motion Estimation and Memory Optimization towards Long-term Tracking.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Computer Vision - 8th Chinese Conference, 2025

Multimodality Helps Few-shot 3D Point Cloud Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

XTrack: Multimodal Training Boosts RGB-X Video Object Trackers.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

MedVSR: Medical Video Super-Resolution with Cross State-Space Propagation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

ObjectRelator: Enabling Cross-View Object Relation Understanding Across Ego-Centric and Exo-Centric Perspectives.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Exploiting Temporal State Space Sharing for Video Semantic Segmentation.

[BibT_eX]

[DOI]

Syed Ariff Syed Hesham

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

SAM-Aware Graph Prompt Reasoning Network for Cross-Domain Few-Shot Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

Learning Local and Global Temporal Contexts for Video Semantic Segmentation.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., October, 2024

Rethinking Global Context in Crowd Counting.

[BibT_eX]

[DOI]

Mach. Intell. Res., August, 2024

Vision Transformers with Hierarchical Attention.

[BibT_eX]

[DOI]

Mach. Intell. Res., August, 2024

Looking Beyond Single Images for Weakly Supervised Semantic Segmentation Learning.

[BibT_eX]

[DOI]

Wenguan Wang

Guolei Sun

Luc Van Gool

IEEE Trans. Pattern Anal. Mach. Intell., March, 2024

Advanced Attention Mechanisms for Dense Prediction.

[BibT_eX]

[DOI]

Guolei Sun

PhD thesis, 2024

When SAM2 Meets Video Camouflaged Object Segmentation: A Comprehensive Evaluation and Adaptation.

[BibT_eX]

[DOI]

CoRR, 2024

Towards a Generalist and Blind RGB-X Tracker.

[BibT_eX]

[DOI]

CoRR, 2024

Self-Explainable Affordance Learning with Embodied Caption.

[BibT_eX]

[DOI]

CoRR, 2024

Global and Compact Video Context Embedding for Video Semantic Segmentation.

[BibT_eX]

[DOI]

IEEE Access, 2024

Video Foundation Model for Medical 3D Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Supervised and Semi-supervised Multi-structure Segmentation and Landmark Detection in Dental Data, 2024

Rethinking Few-shot 3D Point Cloud Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

Edge Guided GANs With Multi-Scale Contrastive Learning for Semantic Image Synthesis.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Object Segmentation by Mining Cross-Modal Semantics.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Edge Guided GANs with Contrastive Learning for Semantic Image Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Indiscernible Object Counting in Underwater Scenes.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Temporal-aware Hierarchical Mask Classification for Video Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 34th British Machine Vision Conference 2023, 2023

2022

Towards Partial Supervision for Generic Object Counting in Natural Scenes.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Mining Relations Among Cross-Frame Affinities for Video Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Coarse-to-Fine Feature Mining for Video Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Boosting Few-shot Semantic Segmentation with Transformers.

[BibT_eX]

[DOI]

CoRR, 2021

Transformer in Convolutional Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2021

Boosting Crowd Counting with Transformers.

[BibT_eX]

[DOI]

CoRR, 2021

SwinIR: Image Restoration Using Swin Transformer.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

Task Switching Network for Multi-task Learning.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Mutual Affine Network for Spatially Variant Kernel Estimation in Blind Image Super-Resolution.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

CompositeTasking: Understanding Images by Spatial Composition of Tasks.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

LID 2020: The Learning from Imperfect Data Challenge Results.

[BibT_eX]

[DOI]

CoRR, 2020

Mining Cross-Image Semantics for Weakly Supervised Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Fixing Localization Errors to Improve Image Classification.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Camouflaged Object Detection.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Fine-Grained Recognition: Accounting for Subtle Differences between Similar Classes.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

A Novel Framework for Node/Edge Attributed Graph Embedding.

[BibT_eX]

[DOI]

Guolei Sun

Xiangliang Zhang

Proceedings of the Advances in Knowledge Discovery and Data Mining, 2019

iSAID: A Large-scale Dataset for Instance Segmentation in Aerial Images.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Object Counting and Instance Segmentation With Image-Level Supervision.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

Multi-label Learning with Highly Incomplete Data via Collaborative Embedding.

[BibT_eX]

[DOI]

Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

WalkRanker: A Unified Pairwise Ranking Model With Multiple Relations for Item Recommendation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Graph Embedding with Rich Information through Bipartite Heterogeneous Network.

[BibT_eX]

[DOI]

Guolei Sun

Xiangliang Zhang

CoRR, 2017

Guolei Sun

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...