Xiangtai Li

Orcid: 0000-0002-0550-8247

According to our database1, Xiangtai Li authored at least 76 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Multi-Task Learning With Multi-Query Transformer for Dense Prediction.
IEEE Trans. Circuits Syst. Video Technol., February, 2024

Sfnet: Faster and Accurate Semantic Segmentation Via Semantic Flow.
Int. J. Comput. Vis., February, 2024

Toward Robust Referring Image Segmentation.
IEEE Trans. Image Process., 2024

DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries.
CoRR, 2024

GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning.
CoRR, 2024

Explore In-Context Segmentation via Latent Diffusion Models.
CoRR, 2024

Point Cloud Mamba: Point Cloud Learning via State Space Model.
CoRR, 2024

Generalizable Entity Grounding via Assistance of Large Language Model.
CoRR, 2024

OMG-Seg: Is One Model Good Enough For All Segmentation?
CoRR, 2024

RAP-SAM: Towards Real-Time All-Purpose Segment Anything.
CoRR, 2024

Towards Language-Driven Video Inpainting via Multimodal Large Language Models.
CoRR, 2024

ModelNet-O: A Large-Scale Synthetic Dataset for Occlusion-Aware Point Cloud Classification.
CoRR, 2024

Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes Interactively.
CoRR, 2024

An Open and Comprehensive Pipeline for Unified Object Grounding and Detection.
CoRR, 2024

BA-SAM: Scalable Bias-Mode Attention Mask for Segment Anything Model.
CoRR, 2024

A Generalist FaceX via Learning Unified Facial Representation.
CoRR, 2024

2023
Exploring Self-Supervised Learning for Multi-Modal Remote Sensing Pre-Training via Asymmetric Attention Fusion.
Remote. Sens., December, 2023

Convolution-Enhanced Evolving Attention Networks.
IEEE Trans. Pattern Anal. Mach. Intell., July, 2023

TransVOD: End-to-End Video Object Detection With Spatial-Temporal Transformers.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Improving Video Instance Segmentation via Temporal Pyramid Routing.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

RTMO: Towards High-Performance One-Stage Real-Time Multi-Person Pose Estimation.
CoRR, 2023

Exploring Plain ViT Reconstruction for Multi-class Unsupervised Anomaly Detection.
CoRR, 2023

EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM.
CoRR, 2023

Skeleton-in-Context: Unified Skeleton Sequence Modeling with In-Context Learning.
CoRR, 2023

Effective Adapter for Face Recognition in the Wild.
CoRR, 2023

Rethinking Evaluation Metrics of Open-Vocabulary Segmentaion.
CoRR, 2023

OV-VG: A Benchmark for Open-Vocabulary Visual Grounding.
CoRR, 2023

CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction.
CoRR, 2023

DST-Det: Simple Dynamic Self-Training for Open-Vocabulary Object Detection.
CoRR, 2023

MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation.
CoRR, 2023

Neural Collapse Terminus: A Unified Solution for Class Incremental Learning and Its Variants.
CoRR, 2023

Pair then Relation: Pair-Net for Panoptic Scene Graph Generation.
CoRR, 2023

Towards Open Vocabulary Learning: A Survey.
CoRR, 2023

Change Detection Methods for Remote Sensing in the Last Decade: A Comprehensive Review.
CoRR, 2023

Transformer-Based Visual Segmentation: A Survey.
CoRR, 2023

Tube-Link: A Flexible Cross Tube Baseline for Universal Video Segmentation.
CoRR, 2023

Reference Twice: A Simple and Unified Baseline for Few-Shot Instance Segmentation.
CoRR, 2023

Rethinking Mobile Block for Efficient Neural Models.
CoRR, 2023

PanopticPartFormer++: A Unified and Decoupled View for Panoptic Part Segmentation.
CoRR, 2023

4D Panoptic Scene Graph Generation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Explore In-Context Learning for 3D Point Cloud Understanding.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Neural Collapse Inspired Feature-Classifier Alignment for Few-Shot Class-Incremental Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Iterative Robust Visual Grounding with Masked Reference based Centerpoint Supervision.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Rethinking Mobile Block for Efficient Attention-based Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Tube-Link: A Flexible Cross Tube Framework for Universal Video Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Panoptic Video Scene Graph Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Towards Robust Referring Image Segmentation.
CoRR, 2022

SFNet: Faster, Accurate, and Domain Agnostic Semantic Segmentation via Semantic Flow.
CoRR, 2022

EATFormer: Improving Vision Transformer Inspired by Evolutionary Algorithm.
CoRR, 2022

Multi-Task Learning with Multi-query Transformer for Dense Prediction.
CoRR, 2022

Do We Really Need a Learnable Classifier at the End of Deep Neural Network?
CoRR, 2022

Inducing Neural Collapse in Imbalanced Learning: Do We Really Need a Learnable Classifier at the End of Deep Neural Network?
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Query Learning of Both Thing and Stuff for Panoptic Segmentation.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

PolyphonicFormer: Unified Query Learning for Depth-Aware Video Panoptic Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

Fashionformer: A Simple, Effective and Unified Baseline for Human Fashion Segmentation and Recognition.
Proceedings of the Computer Vision - ECCV 2022, 2022

Panoptic-PartFormer: Learning a Unified Model for Panoptic Part Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Global Aggregation Then Local Distribution for Scene Parsing.
IEEE Trans. Image Process., 2021

Towards Efficient Scene Understanding via Squeeze Reasoning.
IEEE Trans. Image Process., 2021

Improving Video Instance Segmentation via Temporal Pyramid Routing.
CoRR, 2021

BoundarySqueeze: Image Segmentation as Boundary Squeezing.
CoRR, 2021

End-to-End Video Object Detection with Spatial-Temporal Transformers.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Fast and Accurate Scene Parsing via Bi-Direction Alignment Networks.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Dynamic Dual Sampling Module For Fine-Grained Semantic Segmentation.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Enhanced Boundary Learning for Glass-like Object Segmentation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

PointFlow: Flowing Semantics Through Points for Aerial Image Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Involution: Inverting the Inherence of Convolution for Visual Recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Semantic Flow for Fast and Accurate Scene Parsing.
CoRR, 2020

Semantic Flow for Fast and Accurate Scene Parsing.
Proceedings of the Computer Vision - ECCV 2020, 2020

Improving Semantic Segmentation via Decoupled Body and Edge Supervision.
Proceedings of the Computer Vision - ECCV 2020, 2020

Gated Fully Fusion for Semantic Segmentation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
GFF: Gated Fully Fusion for Semantic Segmentation.
CoRR, 2019

Flow2Seg: Motion-Aided Semantic Segmentation.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2019: Image Processing, 2019

Dual Graph Convolutional Network for Semantic Segmentation.
Proceedings of the 30th British Machine Vision Conference 2019, 2019

Global Aggregation then Local Distribution in Fully Convolutional Networks.
Proceedings of the 30th British Machine Vision Conference 2019, 2019


  Loading...