Shuo Li

Orcid: 0000-0003-2002-3894

Affiliations:
  • Xidian University, School of Artificial Intelligent, MoE Key Laboratory of Intelligent Perception and Image Understanding / International Research Center for Intelligent Perception and Computation, Xi'an, China


According to our database1, Shuo Li authored at least 54 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Delving Aleatoric Uncertainty in Medical Image Segmentation via Vision Foundation Models.
CoRR, April, 2026

PromptVAD: Abnormal Prompt via Vision-Language Model.
IEEE Trans. Neural Networks Learn. Syst., March, 2026

ERFC: Energy-Aware Reinforcement Feedback Calibration for Zero-Shot Captioning.
IEEE Trans. Circuits Syst. Video Technol., February, 2026

Adaptive Visual Prompting for Effective Satellite Video Tracking.
IEEE Trans. Multim., 2026

Adaptive Multi-Modal Visual Tracking With Dynamic Semantic Prompts.
IEEE Trans. Multim., 2026

Language-guided modulation-update for semi-supervised semantic segmentation.
Pattern Recognit., 2026

Vision-by-prompt: Context-aware dual prompts for composed video retrieval.
Pattern Recognit., 2026

VCGPrompt: Visual Concept Graph-Aware Prompt Learning for Vision-Language Models.
Pattern Recognit., 2026

Concept-Aware Learning for Weakly Supervised Video Anomaly Detection.
Pattern Recognit., 2026

Text augmentation for vision: Modality-preference aware few-shot learning.
Knowl. Based Syst., 2026

Multi-level vision language interaction learning for cross-modal retrieval.
Inf. Fusion, 2026

Enhancing few-shot segmentation via mask combination learning.
Neurocomputing, 2026

Semantic Feature Purification for Adversarially-Aware RGB-T Tracking.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

HTTrack: Learning to Perceive Targets via Historical Trajectories in Satellite Video Tracking.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Chain-of-Situation Aware Progressive Inference Learning.
IEEE Trans. Neural Networks Learn. Syst., October, 2025

Visual-Language Scene-Relation-Aware Zero-Shot Captioner.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2025

Prompt-Based Concept Learning for Few-Shot Class-Incremental Learning.
IEEE Trans. Circuits Syst. Video Technol., May, 2025

Revealing Bias Formation in Deep Neural Networks Through the Geometric Mechanisms of Human Visual Decoupling.
CoRR, February, 2025

Anomaly-Led Prompting Learning Caption Generating Model and Benchmark.
IEEE Trans. Multim., 2025

Local-Global Spectral Feature-Aware Learning for Hyperspectral Imagery Classification.
IEEE Trans. Geosci. Remote. Sens., 2025

Change Knowledge-Guided Vision-Language Remote Sensing Change Detection.
IEEE Trans. Geosci. Remote. Sens., 2025

Fine-Grained Visual-Language Alignment for Remote Sensing Image-Text Retrieval.
IEEE Trans. Geosci. Remote. Sens., 2025

Remote Sensing Video Tracking: Current Status, Challenges, and Future.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2025

VLPA-CLIP: Video Language Prompting and Adapting CLIP for efficient video action recognition.
Pattern Recognit., 2025

Knowledge-Driven Compositional Action Recognition.
Pattern Recognit., 2025

Text generation and multi-modal knowledge transfer for few-shot object detection.
Pattern Recognit., 2025

LLM Knowledge-Driven Target Prototype Learning for Few-Shot Segmentation.
Knowl. Based Syst., 2025

Preserving text space integrity for robust compositional zero-shot learning via mixture of pretrained experts.
Neurocomputing, 2025

Tradeoffs Between Richness and Bias of Augmented Data in Long-Tail Recognition.
Entropy, 2025

Exploring Beyond Logits: Hierarchical Dynamic Labeling Based on Embeddings for Semi-Supervised Classification.
IEEE Access, 2025

Imagining Vision From Language for Few-Shot Class-Incremental Learning.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

FA<sup>3</sup>T: Feature-Aware Adversarial Attacks for Multi-modal Tracking.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Hierarchical Variational Test-Time Prompt Generation for Zero-Shot Generalization.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Knowledge-Guided Part Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Logits DeConfusion with CLIP for Few-Shot Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Few-Shot Classification of Fungi Species Using Contrastive Representation Learning and Multimodal Fusion.
Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum, 2025

2024
Visual and Language Collaborative Learning for RGBT Object Tracking.
IEEE Trans. Circuits Syst. Video Technol., December, 2024

Multi-Grained Gradual Inference Model for Multimedia Event Extraction.
IEEE Trans. Circuits Syst. Video Technol., October, 2024

Self-Supervised Self-Organizing Clustering Network: A Novel Unsupervised Representation Learning Method.
IEEE Trans. Neural Networks Learn. Syst., February, 2024

Mask-Guided Correlation Learning for Few-Shot Segmentation in Remote Sensing Imagery.
IEEE Trans. Geosci. Remote. Sens., 2024

Satellite Video Object Tracking Based on Location Prompts.
IEEE Trans. Circuits Syst. Video Technol., 2024

A Patch-Level Region-Aware Module with a Multi-Label Framework for Remote Sensing Image Captioning.
Remote. Sens., 2024

Multi-modal visual tracking based on textual generation.
Inf. Fusion, 2024

Self-restrained contrastive enhanced network for graph structure learning.
Expert Syst. Appl., 2024

ViLT-CLIP: Video and Language Tuning CLIP with Multimodal Prompt Learning and Scenario-Guided Optimization.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Knowledge transfer evolutionary search for lightweight neural architecture with dynamic inference.
Pattern Recognit., November, 2023

Knowledge transduction for cross-domain few-shot learning.
Pattern Recognit., September, 2023

MinEnt: Minimum entropy for self-supervised representation learning.
Pattern Recognit., June, 2023

Learning Salient Feature for Salient Object Detection Without Labels.
IEEE Trans. Cybern., 2023

Task context transformer and GCN for few-shot learning of cross-domain.
Neurocomputing, 2023

2022
MFNet: A Novel GNN-Based Multi-Level Feature Network With Superpixel Priors.
IEEE Trans. Image Process., 2022

Augmentative contrastive learning for one-shot object detection.
Neurocomputing, 2022

Unsupervised Few-Shot Image Classification by Learning Features into Clustering Space.
Proceedings of the Computer Vision - ECCV 2022, 2022

Self-Training Multi-Sequence Learning with Transformer for Weakly Supervised Video Anomaly Detection.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022


  Loading...