Wei Suo

Orcid: 0000-0002-4896-0006

According to our database1, Wei Suo authored at least 29 papers between 2019 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Hallucination-aware intermediate representation edit in large vision-language models.
CoRR, March, 2026

Understanding and Mitigating Hallucinations in Multimodal Chain-of-Thought Models.
CoRR, March, 2026

More video-relevant paragraph captioning via Perturbed Attention Self-Distillation.
Pattern Recognit., 2026

Look and check: A multi-label classification pipeline via multi-agent cooperation.
Neurocomputing, 2026

2025
Distributed Online Convex Optimization Over Time-Varying Unbalanced Digraphs With Multiple Coupled Constraints.
IEEE Trans. Syst. Man Cybern. Syst., December, 2025

CoLeCLIP: Open-Domain Continual Learning via Joint Task Prompt and Vocabulary Learning.
IEEE Trans. Neural Networks Learn. Syst., August, 2025

Distributed online constrained nonconvex optimization in dynamic environments over directed graphs.
Signal Process., 2025

CHA: Conditional Hyper-Adapter method for detecting human-object interaction.
Pattern Recognit., 2025

Mitigating Information Loss under High Pruning Rates for Efficient Large Vision Language Models.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Short-LVLM: Compressing and Accelerating Large Vision-Language Models by Pruning Redundant Layers.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Pruning All-Rounder: Rethinking and Improving Inference Efficiency for Large Vision Language Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Octopus: Alleviating Hallucination via Dynamic Contrastive Decoding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
An Adaptive Correlation Filtering Method for Text-Based Person Search.
Int. J. Comput. Vis., October, 2024

Pruning All-Rounder: Rethinking and Improving Inference Efficiency for Large Vision Language Models.
CoRR, 2024

Visual Prompt Selection for In-Context Learning Segmentation.
CoRR, 2024

A Plug-and-Play Method for Rare Human-Object Interactions Detection by Bridging Domain Gap.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

C3L: Content Correlated Vision-Language Instruction Tuning Data Generation via Contrastive Learning.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Rethinking and Improving Visual Prompt Selection for In-Context Learning Segmentation.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
Distributed online convex optimization with multiple coupled constraints: A double accelerated push-pull algorithm.
J. Frankl. Inst., December, 2023

A Proposal-Free One-Stage Framework for Referring Expression Comprehension and Generation via Dense Cross-Attention.
IEEE Trans. Multim., 2023

Rethinking and Improving Feature Pyramids for One-Stage Referring Expression Comprehension.
IEEE Trans. Image Process., 2023

S3C: Semi-Supervised VQA Natural Language Explanation via Self-Critical Learning.
CoRR, 2023

AHT: A Novel Aggregation Hyper-transformer for Few-Shot Object Detection.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

S<sup>3</sup>C: Semi-Supervised VQA Natural Language Explanation via Self-Critical Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Improving Image Captioning via Enhancing Dual-Side Context Awareness.
Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022

Dual-Level Decoupled Transformer for Video Captioning.
Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022

A Simple and Robust Correlation Filtering Method for Text-Based Person Search.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
Proposal-free One-stage Referring Expression via Grid-Word Cross-Attention.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

2019
3D Convolutional Long-Short Term Memory Network for Spatiotemporal Modeling of fMRI Data.
Proceedings of the Multimodal Brain Image Analysis and Mathematical Foundations of Computational Anatomy, 2019


  Loading...