Guolei Sun

Orcid: 0000-0001-8667-9656

According to our database1, Guolei Sun authored at least 57 papers between 2017 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
RGB-D Indiscernible Object Counting in Underwater Scenes.
Int. J. Comput. Vis., May, 2026

Breaking Modality Heterogeneity in Low-Bit Quantization for Large Vision-Language Models.
CoRR, May, 2026

Video Understanding: From Geometry and Semantics to Unified Models.
CoRR, March, 2026

EgoSound: Benchmarking Sound Understanding in Egocentric Videos.
CoRR, February, 2026

DINO-Mix: Distilling Foundational Knowledge with Cross-Domain CutMix for Semi-supervised Class-imbalanced Medical Image Segmentation.
CoRR, February, 2026

Revisiting Adaptive Rounding with Vectorized Reparameterization for LLM Quantization.
CoRR, February, 2026

2025
Evaluating SAM2 for Video Semantic Segmentation.
CoRR, December, 2025

EgoNight: Towards Egocentric Vision Understanding at Night with a Challenging Benchmark.
CoRR, October, 2025

A Comprehensive Survey on Video Scene Parsing:Advances, Challenges, and Prospects.
CoRR, June, 2025

CamSAM2: Segment Anything Accurately in Camouflaged Videos.
CoRR, March, 2025

When SAM2 meets video camouflaged object segmentation: a comprehensive evaluation and adaptation.
Vis. Intell., 2025

Towards Open-Vocabulary Video Semantic Segmentation.
IEEE Trans. Multim., 2025

HiM2SAM: Enhancing SAM2 with Hierarchical Motion Estimation and Memory Optimization towards Long-term Tracking.
Proceedings of the Pattern Recognition and Computer Vision - 8th Chinese Conference, 2025

Multimodality Helps Few-shot 3D Point Cloud Semantic Segmentation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

XTrack: Multimodal Training Boosts RGB-X Video Object Trackers.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

MedVSR: Medical Video Super-Resolution with Cross State-Space Propagation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

ObjectRelator: Enabling Cross-View Object Relation Understanding Across Ego-Centric and Exo-Centric Perspectives.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Exploiting Temporal State Space Sharing for Video Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

SAM-Aware Graph Prompt Reasoning Network for Cross-Domain Few-Shot Segmentation.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Learning Local and Global Temporal Contexts for Video Semantic Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2024

Rethinking Global Context in Crowd Counting.
Mach. Intell. Res., August, 2024

Vision Transformers with Hierarchical Attention.
Mach. Intell. Res., August, 2024

Looking Beyond Single Images for Weakly Supervised Semantic Segmentation Learning.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2024

Advanced Attention Mechanisms for Dense Prediction.
PhD thesis, 2024

When SAM2 Meets Video Camouflaged Object Segmentation: A Comprehensive Evaluation and Adaptation.
CoRR, 2024

Towards a Generalist and Blind RGB-X Tracker.
CoRR, 2024

Self-Explainable Affordance Learning with Embodied Caption.
CoRR, 2024

Global and Compact Video Context Embedding for Video Semantic Segmentation.
IEEE Access, 2024

Video Foundation Model for Medical 3D Segmentation.
Proceedings of the Supervised and Semi-supervised Multi-structure Segmentation and Landmark Detection in Dental Data, 2024

Rethinking Few-shot 3D Point Cloud Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Edge Guided GANs With Multi-Scale Contrastive Learning for Semantic Image Synthesis.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Object Segmentation by Mining Cross-Modal Semantics.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Edge Guided GANs with Contrastive Learning for Semantic Image Synthesis.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Indiscernible Object Counting in Underwater Scenes.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Temporal-aware Hierarchical Mask Classification for Video Semantic Segmentation.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

2022
Towards Partial Supervision for Generic Object Counting in Natural Scenes.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Mining Relations Among Cross-Frame Affinities for Video Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

Coarse-to-Fine Feature Mining for Video Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Boosting Few-shot Semantic Segmentation with Transformers.
CoRR, 2021

Transformer in Convolutional Neural Networks.
CoRR, 2021

Boosting Crowd Counting with Transformers.
CoRR, 2021

SwinIR: Image Restoration Using Swin Transformer.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

Task Switching Network for Multi-task Learning.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Mutual Affine Network for Spatially Variant Kernel Estimation in Blind Image Super-Resolution.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

CompositeTasking: Understanding Images by Spatial Composition of Tasks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
LID 2020: The Learning from Imperfect Data Challenge Results.
CoRR, 2020

Mining Cross-Image Semantics for Weakly Supervised Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Fixing Localization Errors to Improve Image Classification.
Proceedings of the Computer Vision - ECCV 2020, 2020

Camouflaged Object Detection.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Fine-Grained Recognition: Accounting for Subtle Differences between Similar Classes.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
A Novel Framework for Node/Edge Attributed Graph Embedding.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2019

iSAID: A Large-scale Dataset for Instance Segmentation in Aerial Images.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Object Counting and Instance Segmentation With Image-Level Supervision.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Multi-label Learning with Highly Incomplete Data via Collaborative Embedding.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

WalkRanker: A Unified Pairwise Ranking Model With Multiple Relations for Item Recommendation.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Graph Embedding with Rich Information through Bipartite Heterogeneous Network.
CoRR, 2017


  Loading...