Xiangtai Li
Orcid: 0000-0002-0550-8247
According to our database1,
Xiangtai Li
authored at least 159 papers
between 2019 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2025
IEEE Trans. Pattern Anal. Mach. Intell., August, 2025
VimoRAG: Video-based Retrieval-augmented 3D Motion Generation for Motion Language Models.
CoRR, August, 2025
Human-in-Context: Unified Cross-Domain 3D Human Motion Modeling via In-Context Learning.
CoRR, August, 2025
Bridge Feature Matching and Cross-Modal Alignment with Mutual-filtering for Zero-shot Anomaly Detection.
CoRR, July, 2025
CoRR, July, 2025
Reasoning to Edit: Hypothetical Instruction-Based Image Editing with Visual Reasoning.
CoRR, July, 2025
CoRR, June, 2025
CoRR, June, 2025
CoRR, June, 2025
AdaVideoRAG: Omni-Contextual Adaptive Retrieval-Augmented Efficient Long Video Understanding.
CoRR, June, 2025
CoRR, June, 2025
IEEE Trans. Circuits Syst. Video Technol., May, 2025
Mixed-R1: Unified Reward Perspective For Reasoning Capability in Multimodal Large Language Models.
CoRR, May, 2025
Muddit: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model.
CoRR, May, 2025
DiffDecompose: Layer-Wise Decomposition of Alpha-Composited Images via Diffusion Transformers.
CoRR, May, 2025
CoRR, May, 2025
CoRR, May, 2025
CoRR, May, 2025
Vision Mamba in Remote Sensing: A Comprehensive Survey of Techniques, Applications and Outlook.
CoRR, May, 2025
MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation.
Int. J. Comput. Vis., April, 2025
NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images: Methods and Results.
CoRR, April, 2025
CoRR, April, 2025
PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild.
CoRR, April, 2025
The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer.
CoRR, April, 2025
Decouple and Track: Benchmarking and Improving Video Diffusion Transformers for Motion Transfer.
CoRR, March, 2025
CoRR, February, 2025
A Masked Reference Token Supervision-Based Iterative Visual-Language Framework for Robust Visual Grounding.
IEEE Trans. Circuits Syst. Video Technol., January, 2025
CoRR, January, 2025
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos.
CoRR, January, 2025
Comput. Vis. Image Underst., 2025
RobuRCDet: Enhancing Robustness of Radar-Camera Fusion in Bird's Eye View for 3D Object Detection.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
SIDA: Social Media Image Deepfake Detection, Localization and Explanation with Large Multimodal Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images: Methods and Results.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
PointDGMamba: Domain Generalization of Point Cloud Classification via Generalized State Space Model.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
Proceedings of the International Conference on 3D Vision, 2025
2024
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024
Int. J. Comput. Vis., September, 2024
Change Detection Methods for Remote Sensing in the Last Decade: A Comprehensive Review.
Remote. Sens., July, 2024
IEEE Trans. Circuits Syst. Video Technol., February, 2024
Int. J. Comput. Vis., February, 2024
IEEE Trans. Pattern Anal. Mach. Intell., 2024
ModelNet-O: A large-scale synthetic dataset for occlusion-aware point cloud classification.
Comput. Vis. Image Underst., 2024
HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editing.
CoRR, 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
Learning Feature Inversion for Multi-class Anomaly Detection under General-purpose COCO-AD Benchmark.
CoRR, 2024
CoRR, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the IEEE International Conference on Robotics and Automation, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning.
Proceedings of the Computer Vision - ECCV 2024, 2024
Face-Adapter for Pre-trained Diffusion Models with Fine-Grained ID and Attribute Control.
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
You Can't Ignore Either: Unifying Structure and Feature Denoising for Robust Graph Learning.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024
2023
Exploring Self-Supervised Learning for Multi-Modal Remote Sensing Pre-Training via Asymmetric Attention Fusion.
Remote. Sens., December, 2023
IEEE Trans. Pattern Anal. Mach. Intell., July, 2023
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023
IEEE Trans. Pattern Anal. Mach. Intell., May, 2023
CoRR, 2023
CoRR, 2023
Neural Collapse Terminus: A Unified Solution for Class Incremental Learning and Its Variants.
CoRR, 2023
Change Detection Methods for Remote Sensing in the Last Decade: A Comprehensive Review.
CoRR, 2023
CoRR, 2023
CoRR, 2023
CoRR, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Neural Collapse Inspired Feature-Classifier Alignment for Few-Shot Class-Incremental Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Iterative Robust Visual Grounding with Masked Reference based Centerpoint Supervision.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
2022
SFNet: Faster, Accurate, and Domain Agnostic Semantic Segmentation via Semantic Flow.
CoRR, 2022
CoRR, 2022
Inducing Neural Collapse in Imbalanced Learning: Do We Really Need a Learnable Classifier at the End of Deep Neural Network?
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022
PolyphonicFormer: Unified Query Learning for Depth-Aware Video Panoptic Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022
Fashionformer: A Simple, Effective and Unified Baseline for Human Fashion Segmentation and Recognition.
Proceedings of the Computer Vision - ECCV 2022, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
2021
IEEE Trans. Image Process., 2021
IEEE Trans. Image Process., 2021
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
2020
Proceedings of the Computer Vision - ECCV 2020, 2020
Proceedings of the Computer Vision - ECCV 2020, 2020
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2019: Image Processing, 2019
Proceedings of the 30th British Machine Vision Conference 2019, 2019
Proceedings of the 30th British Machine Vision Conference 2019, 2019