Xiaolin Wei

Orcid: 0000-0003-0641-5330

According to our database¹, Xiaolin Wei authored at least 80 papers between 2014 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2025

Pose-Aware 3D Talking Face Synthesis Using Geometry-Guided Audio-Vertices Attention.

[BibT_eX]

[DOI]

IEEE Trans. Vis. Comput. Graph., March, 2025

Low-Dose CT Image Super-Resolution With Noise Suppression Based on Prior Degradation Estimator and Self-Guidance Mechanism.

[BibT_eX]

[DOI]

IEEE Trans. Medical Imaging, February, 2025

RGB-T Tracking With Template-Bridged Search Interaction and Target-Preserved Template Updating.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., January, 2025

An End-to-End Framework for Aerial Object Detection with State Space Models and Multiscale Attention Gating in Hazy Scenes.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Computer Vision - 8th Chinese Conference, 2025

2024

Exploration and Exploitation of Unlabeled Data for Open-Set Semi-supervised Learning.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., December, 2024

Anti-Drift Gas Detection Algorithm Based on Neural Network.

[BibT_eX]

[DOI]

IEEE Trans. Instrum. Meas., 2024

SeisFusion: Constrained Diffusion Model With Input Guidance for 3-D Seismic Data Interpolation and Reconstruction.

[BibT_eX]

[DOI]

IEEE Trans. Geosci. Remote. Sens., 2024

Low-Dose CT Image Super-resolution Network with Noise Inhibition Based on Feedback Feature Distillation Mechanism.

[BibT_eX]

[DOI]

J. Imaging Inform. Medicine, 2024

3D colored object reconstruction from a single view image through diffusion.

[BibT_eX]

[DOI]

Expert Syst. Appl., 2024

SeisFusion: Constrained Diffusion Model with Input Guidance for 3D Seismic Data Interpolation and Reconstruction.

[BibT_eX]

[DOI]

CoRR, 2024

Improve Deep Learning Autofocus with Depth Information Supervision and Current Focal Distance Cues.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2024

Enhancing Autofocus Performance through Predictive Motion-Targeting and Self-Attention in a Deep Reinforcement Learning Framework.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2024

Animating General Image with Large Visual Motion Model.

[BibT_eX]

[DOI]

Dengsheng Chen

Xiaoming Wei

Xiaolin Wei

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

Large Scale Visual Food Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

Weakly Supervised Semantic Segmentation Via Progressive Patch Learning.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices.

[BibT_eX]

[DOI]

CoRR, 2023

3rd Place Solution for PVUW Challenge 2023: Video Panoptic Segmentation.

[BibT_eX]

[DOI]

CoRR, 2023

Pose-Controllable 3D Facial Animation Synthesis using Hierarchical Audio-Vertex Attention.

[BibT_eX]

[DOI]

CoRR, 2023

3D Colored Shape Reconstruction from a Single RGB Image through Diffusion.

[BibT_eX]

[DOI]

CoRR, 2023

Orthogonal Temporal Interpolation for Zero-Shot Video Recognition.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Enriching Phrases with Coupled Pixel and Object Contexts for Panoptic Narrative Grounding.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

NTIRE 2023 Image Shadow Removal Challenge Report.

[BibT_eX]

[DOI]

Florin-Alexandru Vasluianu

Fredrik K. Gustafsson

Santosh Kumar Vipparthi

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Bridging Search Region Interaction with Template for RGB-T Tracking.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Masked Auto-Encoders Meet Generative Adversarial Networks and Beyond.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Pyramid Ensemble Structure for High Resolution Image Shadow Removal.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Uncertainty-Aware Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Ingredient-Guided Region Discovery and Relationship Modeling for Food Category-Ingredient Prediction.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2022

Dimension-aware attention for efficient mobile networks.

[BibT_eX]

[DOI]

Pattern Recognit., 2022

Contrastive attention network with dense field estimation for face completion.

[BibT_eX]

[DOI]

Pattern Recognit., 2022

Multiple Object Tracking Challenge Technical Report for Team MT_IoT.

[BibT_eX]

[DOI]

CoRR, 2022

HAM: Hierarchical Attention Model with High Performance for 3D Visual Grounding.

[BibT_eX]

[DOI]

CoRR, 2022

Progressive Denoising Model for Fine-Grained Text-to-Image Generation.

[BibT_eX]

[DOI]

CoRR, 2022

Meta-Ensemble Parameter Learning.

[BibT_eX]

[DOI]

CoRR, 2022

MSMDFusion: Fusing LiDAR and Camera at Multiple Scales with Multi-Depth Seeds for 3D Object Detection.

[BibT_eX]

[DOI]

CoRR, 2022

YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications.

[BibT_eX]

[DOI]

CoRR, 2022

Efficient Modeling of Future Context for Image Captioning.

[BibT_eX]

[DOI]

CoRR, 2022

MT-Net Submission to the Waymo 3D Detection Leaderboard.

[BibT_eX]

[DOI]

CoRR, 2022

PromptDet: Expand Your Detector Vocabulary with Uncurated Images.

[BibT_eX]

[DOI]

CoRR, 2022

InsCon: Instance Consistency Feature Representation via Self-Supervised Learning.

[BibT_eX]

[DOI]

CoRR, 2022

Fully Convolutional One-Stage 3D Object Detection on LiDAR Range Images.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Expansion and Shrinkage of Localization for Weakly-Supervised Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

SegViT: Semantic Segmentation with Plain Vision Transformers.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Zero-shot Video Classification with Appropriate Web and Task Knowledge Transfer.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Synthesizing Counterfactual Samples for Effective Image-Text Matching.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

SoccerNet 2022 Challenges Results.

[BibT_eX]

[DOI]

Christophe De Vleeschouwer

Alexandre Alahi

Bernard Ghanem

Marc Van Droogenbroeck

Miguel Santos Marques

Proceedings of the MMSports@MM 2022: Proceedings of the 5th International ACM Workshop on Multimedia Content Analysis in Sports, 2022

Concept Propagation via Attentional Knowledge Graph Reasoning for Video-Text Retrieval.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Towards Accurate Post-Training Quantization for Vision Transformer.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

PPMN: Pixel-Phrase Matching Network for One-Stage Panoptic Narrative Grounding.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Adaptive Spatial-BCE Loss for Weakly Supervised Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

PromptDet: Towards Open-Vocabulary Detection Using Uncurated Images.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Rethinking the Optimization of Average Precision: Only Penalizing Negative Instances before Positive Ones Is Enough.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Coupled adversarial learning for semi-supervised heterogeneous face recognition.

[BibT_eX]

[DOI]

Pattern Recognit., 2021

Twins: Revisiting Spatial Attention Design in Vision Transformers.

[BibT_eX]

[DOI]

CoRR, 2021

Do We Really Need Explicit Position Encodings for Vision Transformers?

[BibT_eX]

[DOI]

CoRR, 2021

Twins: Revisiting the Design of Spatial Attention in Vision Transformers.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Two-stage Visual Cues Enhancement Network for Referring Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Structure Guided Lane Detection.

[BibT_eX]

[DOI]

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Light Weight Facial Landmark Detection With Weakly Supervised Learning.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE International Conference on Multimedia & Expo Workshops, 2021

DARTS-: Robustly Stepping out of Performance Collapse Without Indicators.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

Scene Text Detection with Scribble Line.

[BibT_eX]

[DOI]

Proceedings of the 16th International Conference on Document Analysis and Recognition, 2021

Heterogeneous Network Based Semi-supervised Learning for Scene Text Recognition.

[BibT_eX]

[DOI]

Proceedings of the 16th International Conference on Document Analysis and Recognition, 2021

Transformer Meets Part Model: Adaptive Part Division for Person Re-Identification.

[BibT_eX]

[DOI]

Shenqi Lai

Zhenhua Chai

Xiaolin Wei

Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

Learn to Cluster Faces via Pairwise Classification.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Trash to Treasure: Harvesting OOD Data with Cross-Modal Matching for Open-Set Semi-Supervised Learning.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Embedded Discriminative Attention Mechanism for Weakly Supervised Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Rethinking BiSeNet for Real-Time Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

Scene Text Detection with Scribble Lines.

[BibT_eX]

[DOI]

CoRR, 2020

Beyond Single Instance Multi-view Unsupervised Representation Learning.

[BibT_eX]

[DOI]

Xiangxiang Chu

Xiaohang Zhan

Xiaolin Wei

CoRR, 2020

ROME: Robustifying Memory-Efficient NAS via Topology Disentanglement and Gradients Accumulation.

[BibT_eX]

[DOI]

CoRR, 2020

FedOCR: Communication-Efficient Federated Learning for Scene Text Recognition.

[BibT_eX]

[DOI]

CoRR, 2020

Query Twice: Dual Mixture Attention Meta Learning for Video Summarization.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

ISIA Food-500: A Dataset for Large-Scale Food Recognition via Stacked Global-Local Attention Network.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Robust Lexicon-Free Confidence Prediction for Text Recognition.

[BibT_eX]

[DOI]

Proceedings of the 25th International Conference on Pattern Recognition, 2020

ReADS: A Rectified Attentional Double Supervised Network for Scene Text Recognition.

[BibT_eX]

[DOI]

Proceedings of the 25th International Conference on Pattern Recognition, 2020

Free-Form Image Inpainting via Contrastive Attention Network.

[BibT_eX]

[DOI]

Proceedings of the 25th International Conference on Pattern Recognition, 2020

An Improved Convolutional Block Attention Module for Chinese Character Recognition.

[BibT_eX]

[DOI]

Proceedings of the Document Analysis Systems - 14th IAPR International Workshop, 2020

A Method for Scene Text Style Transfer.

[BibT_eX]

[DOI]

Proceedings of the Document Analysis Systems - 14th IAPR International Workshop, 2020

ALEC: An Accurate, Light and Efficient Network for CAPTCHA Recognition.

[BibT_eX]

[DOI]

Proceedings of the Document Analysis Systems - 14th IAPR International Workshop, 2020

An Accurate Segmentation-Based Scene Text Detector with Context Attention and Repulsive Text Border.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2014

Mosaic method based on feature points detection and tracking for unmanned aerial vehicle videos.

[BibT_eX]

[DOI]

Proceedings of the 10th International Conference on Natural Computation, 2014

Xiaolin Wei

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...