Xiaolin Wei

Orcid: 0000-0002-3983-047X

According to our database1, Xiaolin Wei authored at least 70 papers between 2014 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
SeisFusion: Constrained Diffusion Model with Input Guidance for 3D Seismic Data Interpolation and Reconstruction.
CoRR, 2024

2023
Large Scale Visual Food Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

Weakly Supervised Semantic Segmentation Via Progressive Patch Learning.
IEEE Trans. Multim., 2023

MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices.
CoRR, 2023

Exploration and Exploitation of Unlabeled Data for Open-Set Semi-Supervised Learning.
CoRR, 2023

3rd Place Solution for PVUW Challenge 2023: Video Panoptic Segmentation.
CoRR, 2023

Pose-Controllable 3D Facial Animation Synthesis using Hierarchical Audio-Vertex Attention.
CoRR, 2023

3D Colored Shape Reconstruction from a Single RGB Image through Diffusion.
CoRR, 2023

Orthogonal Temporal Interpolation for Zero-Shot Video Recognition.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Enriching Phrases with Coupled Pixel and Object Contexts for Panoptic Narrative Grounding.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

NTIRE 2023 Image Shadow Removal Challenge Report.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Bridging Search Region Interaction with Template for RGB-T Tracking.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Masked Auto-Encoders Meet Generative Adversarial Networks and Beyond.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Pyramid Ensemble Structure for High Resolution Image Shadow Removal.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Uncertainty-Aware Image Captioning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Ingredient-Guided Region Discovery and Relationship Modeling for Food Category-Ingredient Prediction.
IEEE Trans. Image Process., 2022

Dimension-aware attention for efficient mobile networks.
Pattern Recognit., 2022

Contrastive attention network with dense field estimation for face completion.
Pattern Recognit., 2022

Multiple Object Tracking Challenge Technical Report for Team MT_IoT.
CoRR, 2022

HAM: Hierarchical Attention Model with High Performance for 3D Visual Grounding.
CoRR, 2022

SoccerNet 2022 Challenges Results.
CoRR, 2022

Progressive Denoising Model for Fine-Grained Text-to-Image Generation.
CoRR, 2022

Meta-Ensemble Parameter Learning.
CoRR, 2022

MSMDFusion: Fusing LiDAR and Camera at Multiple Scales with Multi-Depth Seeds for 3D Object Detection.
CoRR, 2022

YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications.
CoRR, 2022

Efficient Modeling of Future Context for Image Captioning.
CoRR, 2022

MT-Net Submission to the Waymo 3D Detection Leaderboard.
CoRR, 2022

PromptDet: Expand Your Detector Vocabulary with Uncurated Images.
CoRR, 2022

InsCon: Instance Consistency Feature Representation via Self-Supervised Learning.
CoRR, 2022

Fully Convolutional One-Stage 3D Object Detection on LiDAR Range Images.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Expansion and Shrinkage of Localization for Weakly-Supervised Semantic Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

SegViT: Semantic Segmentation with Plain Vision Transformers.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Zero-shot Video Classification with Appropriate Web and Task Knowledge Transfer.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Synthesizing Counterfactual Samples for Effective Image-Text Matching.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022


Concept Propagation via Attentional Knowledge Graph Reasoning for Video-Text Retrieval.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Towards Accurate Post-Training Quantization for Vision Transformer.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

PPMN: Pixel-Phrase Matching Network for One-Stage Panoptic Narrative Grounding.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Adaptive Spatial-BCE Loss for Weakly Supervised Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

PromptDet: Towards Open-Vocabulary Detection Using Uncurated Images.
Proceedings of the Computer Vision - ECCV 2022, 2022

Rethinking the Optimization of Average Precision: Only Penalizing Negative Instances before Positive Ones Is Enough.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Coupled adversarial learning for semi-supervised heterogeneous face recognition.
Pattern Recognit., 2021

Twins: Revisiting Spatial Attention Design in Vision Transformers.
CoRR, 2021

Do We Really Need Explicit Position Encodings for Vision Transformers?
CoRR, 2021

Twins: Revisiting the Design of Spatial Attention in Vision Transformers.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Two-stage Visual Cues Enhancement Network for Referring Image Segmentation.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Structure Guided Lane Detection.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Light Weight Facial Landmark Detection With Weakly Supervised Learning.
Proceedings of the 2021 IEEE International Conference on Multimedia & Expo Workshops, 2021

DARTS-: Robustly Stepping out of Performance Collapse Without Indicators.
Proceedings of the 9th International Conference on Learning Representations, 2021

Scene Text Detection with Scribble Line.
Proceedings of the 16th International Conference on Document Analysis and Recognition, 2021

Heterogeneous Network Based Semi-supervised Learning for Scene Text Recognition.
Proceedings of the 16th International Conference on Document Analysis and Recognition, 2021

Transformer Meets Part Model: Adaptive Part Division for Person Re-Identification.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

Learn to Cluster Faces via Pairwise Classification.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Trash to Treasure: Harvesting OOD Data with Cross-Modal Matching for Open-Set Semi-Supervised Learning.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Embedded Discriminative Attention Mechanism for Weakly Supervised Semantic Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Rethinking BiSeNet for Real-Time Semantic Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Scene Text Detection with Scribble Lines.
CoRR, 2020

Beyond Single Instance Multi-view Unsupervised Representation Learning.
CoRR, 2020

ROME: Robustifying Memory-Efficient NAS via Topology Disentanglement and Gradients Accumulation.
CoRR, 2020

FedOCR: Communication-Efficient Federated Learning for Scene Text Recognition.
CoRR, 2020

Query Twice: Dual Mixture Attention Meta Learning for Video Summarization.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

ISIA Food-500: A Dataset for Large-Scale Food Recognition via Stacked Global-Local Attention Network.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Robust Lexicon-Free Confidence Prediction for Text Recognition.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

ReADS: A Rectified Attentional Double Supervised Network for Scene Text Recognition.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Free-Form Image Inpainting via Contrastive Attention Network.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

An Improved Convolutional Block Attention Module for Chinese Character Recognition.
Proceedings of the Document Analysis Systems - 14th IAPR International Workshop, 2020

A Method for Scene Text Style Transfer.
Proceedings of the Document Analysis Systems - 14th IAPR International Workshop, 2020

ALEC: An Accurate, Light and Efficient Network for CAPTCHA Recognition.
Proceedings of the Document Analysis Systems - 14th IAPR International Workshop, 2020

An Accurate Segmentation-Based Scene Text Detector with Context Attention and Repulsive Text Border.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2014
Mosaic method based on feature points detection and tracking for unmanned aerial vehicle videos.
Proceedings of the 10th International Conference on Natural Computation, 2014


  Loading...