Xiang Li

Orcid: 0000-0002-4996-7365

Affiliations:
  • Nankai University, College of Computer Science, Tianjin, China
  • Momenta, China
  • Nanjing University of Science and Technology, School of Computer Science and Engineering, PCA Lab, Nanjing, China (PhD 2020)
  • Nanjing University of Science and Technology, Jiangsu Key Lab of Image and Video Understanding for Social Security, Nanjing, China


According to our database1, Xiang Li authored at least 147 papers between 2013 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Unifying Heterogeneous Multi-Modal Remote Sensing Detection Via Language-Pivoted Pretraining.
CoRR, March, 2026

CrystaL: Spontaneous Emergence of Visual Latents in MLLMs.
CoRR, February, 2026

DTSI: Towards faster convergence of query-based detectors for rotated dense aerial images.
Pattern Recognit., 2026

S2I-DiT: Unlocking the semantic-to-image transferability by fine-tuning large diffusion transformer models.
Pattern Recognit., 2026

Strip R-CNN: Large Strip Convolution for Remote Sensing Object Detection.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

SpatioTemporal Difference Network for Video Depth Super-Resolution.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

DenoDet V2: Phase-Amplitude Cross Denoising for SAR Object Detection.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

SM3Det: A Unified Model for Multi-Modal Remote Sensing Object Detection.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Tri-Perspective View Decomposition for Geometry Aware Depth Completion and Super-Resolution.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2025

AnchorOPT: Towards Optimizing Dynamic Anchors for Adaptive Prompt Learning.
CoRR, November, 2025

Multi-Order Matching Network for Alignment-Free Depth Super-Resolution.
CoRR, November, 2025

UniChange: Unifying Change Detection with Multimodal Large Language Model.
CoRR, November, 2025

PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection.
Int. J. Comput. Vis., September, 2025

RigNet++: Semantic Assisted Repetitive Image Guided Network for Depth Completion.
Int. J. Comput. Vis., September, 2025

Point2RBox-v3: Self-Bootstrapping from Point Annotations via Integrated Pseudo-Label Refinement and Utilization.
CoRR, September, 2025

RoRecomp: Enhancing Reasoning Efficiency via Rollout Response Recomposition in Reinforcement Learning.
CoRR, September, 2025

NAIPv2: Debiased Pairwise Learning for Efficient Paper Quality Estimation.
CoRR, September, 2025

Revisiting Data Challenges of Computational Pathology: A Pack-based Multiple Instance Learning Framework.
CoRR, September, 2025

Visual Instruction Pretraining for Domain-Specific Foundation Models.
CoRR, September, 2025

Representation Entanglement for Generation:Training Diffusion Transformers Is Much Easier Than You Think.
CoRR, July, 2025

YOLO-MS: Rethinking Multi-Scale Representation Learning for Real-Time Object Detection.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2025

Revisiting End-to-End Learning with Slide-level Supervision in Computational Pathology.
CoRR, June, 2025

MotionSight: Boosting Fine-Grained Motion Understanding in Multimodal LLMs.
CoRR, June, 2025

See through the Dark: Learning Illumination-affined Representations for Nighttime Occupancy Prediction.
CoRR, May, 2025

AuxDet: Auxiliary Metadata Matters for Omni-Domain Infrared Small Target Detection.
CoRR, May, 2025

M4-SAR: A Multi-Resolution, Multi-Polarization, Multi-Scene, Multi-Source Dataset and Benchmark for Optical-SAR Fusion Object Detection.
CoRR, May, 2025

DenoDet: Attention as Deformable Multisubspace Feature Denoising for Target Detection in SAR Images.
IEEE Trans. Aerosp. Electron. Syst., April, 2025

A Vision for Auto Research with LLM Agents.
CoRR, April, 2025

Fine-Grained Visual Text Prompting.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2025

LSKNet: A Foundation Lightweight Backbone for Remote Sensing.
Int. J. Comput. Vis., March, 2025

MSOD: A Large-Scale Multiscene Dataset and a Novel Diagonal-Geometry Loss for SAR Object Detection.
IEEE Trans. Geosci. Remote. Sens., 2025

MoCoLSK: Modality-Conditioned High-Resolution Downscaling for Land Surface Temperature.
IEEE Trans. Geosci. Remote. Sens., 2025

Non-Aligned Supervision for Real Image Dehazing.
IEEE Trans. Circuits Syst. Video Technol., 2025

SlimHead: Rethinking the Efficiency Bottleneck in Dense Object Detection.
Proceedings of the Pattern Recognition and Computer Vision - 8th Chinese Conference, 2025

Category-Aware 3D Object Composition with Disentangled Texture and Shape Multi-view Diffusion.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Deep Height Decoupling for Precise Vision-Based 3D Occupancy Prediction.
Proceedings of the IEEE International Conference on Robotics and Automation, 2025

Rethinking Point Cloud Data Augmentation: Topologically Consistent Deformation.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Advancing Textual Prompt Learning with Anchored Attributes.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

DISTA-Net: Dynamic Closely-Spaced Infrared Small Target Unmixing.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

A Simple Detector with Frame Dynamics is a Strong Tracker.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Wolf in Sheep's Clothing: Understanding and Detecting Mobile Cloaking in Blackhat SEO.
Proceedings of the Applied Cryptography and Network Security, 2025

From Words to Worth: Newborn Article Impact Prediction with LLM.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

Multi-clue Consistency Learning to Bridge Gaps Between General and Oriented Object in Semi-supervised Detection.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Zone Evaluation: Revealing Spatial Bias in Object Detection.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Learning Complementary Correlations for Depth Super-Resolution With Incomplete Data in Real World.
IEEE Trans. Neural Networks Learn. Syst., April, 2024

Learnable differencing center for nighttime depth perception.
Vis. Intell., 2024

Towards more reliable evaluation in pedestrian detection by rethinking "ignore regions".
Vis. Intell., 2024

Pick of the Bunch: Detecting Infrared Small Targets Beyond Hit-Miss Trade-Offs via Selective Rank-Aware Attention.
IEEE Trans. Geosci. Remote. Sens., 2024

Dual teachers for self-knowledge distillation.
Pattern Recognit., 2024

Agent-based Video Trimming.
CoRR, 2024

ATPrompt: Textual Prompt Learning with Embedded Attributes.
CoRR, 2024

GeoGround: A Unified Large Vision-Language Model. for Remote Sensing Visual Grounding.
CoRR, 2024

GrokLST: Towards High-Resolution Benchmark and Toolkit for Land Surface Temperature Downscaling.
CoRR, 2024

HazyDet: Open-source Benchmark for Drone-view Object Detection with Depth-cues in Hazy Scenes.
CoRR, 2024

Revisiting Prompt Pretraining of Vision-Language Models.
CoRR, 2024

FBD-SV-2024: Flying Bird Object Detection Dataset in Surveillance Video.
CoRR, 2024

Add-SD: Rational Generation without Manual Reference.
CoRR, 2024

DenoDet: Attention as Deformable Multi-Subspace Feature Denoising for Target Detection in SAR Images.
CoRR, 2024

A Literature Review of Literature Reviews in Pattern Analysis and Machine Intelligence.
CoRR, 2024

Novel Object Synthesis via Adaptive Text-Image Harmony.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

DCDepth: Progressive Monocular Depth Estimation in Discrete Cosine Domain.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

SARDet-100K: Towards Open-Source Benchmark and ToolKit for Large-Scale SAR Object Detection.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Cascade Prompt Learning for Vision-Language Model Adaptation.
Proceedings of the Computer Vision - ECCV 2024, 2024

Distilling Knowledge from Large-Scale Image Models for Object Detection.
Proceedings of the Computer Vision - ECCV 2024, 2024

CrossKD: Cross-Head Knowledge Distillation for Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

PromptKD: Unsupervised Prompt Distillation for Vision-Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

AltNeRF: Learning Robust Neural Radiance Field via Alternating Depth-Pose Optimization.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Boundary-restricted metric learning.
Mach. Learn., December, 2023

Denseformer: A dense transformer framework for person re-identification.
IET Comput. Vis., August, 2023

Generalized Focal Loss: Towards Efficient Representation Learning for Dense Object Detection.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

APF-GAN: Exploring asymmetric pre-training and fine-tuning strategy for conditional generative adversarial network.
Comput. Vis. Media, February, 2023

One-Stage Cascade Refinement Networks for Infrared Small Target Detection.
IEEE Trans. Geosci. Remote. Sens., 2023

Two-phase self-supervised pretraining for object re-identification.
Knowl. Based Syst., 2023

RigNet++: Efficient Repetitive Image Guided Network for Depth Completion.
CoRR, 2023

Learnable Differencing Center for Nighttime Depth Perception.
CoRR, 2023

CrossKD: Cross-Head Knowledge Distillation for Dense Object Detection.
CoRR, 2023

Variable Radiance Field for Real-Life Category-Specifc Reconstruction from Single Image.
CoRR, 2023

Fine-Grained Visual Prompting.
CoRR, 2023

Is Synthetic Data From Diffusion Models Ready for Knowledge Distillation?
CoRR, 2023

A Survey of Historical Learning: Learning Models with Learning History.
CoRR, 2023

Non-aligned supervision for Real Image Dehazing.
CoRR, 2023

Towards Spatial Equilibrium Object Detection.
CoRR, 2023

Fine-Grained Visual Prompting.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Security Compressed Sensing Image Encryption Algorithm Based on Elliptic Curve.
Proceedings of the Data Science, 2023

Distortion and Uncertainty Aware Loss for Panoramic Depth Completion.
Proceedings of the International Conference on Machine Learning, 2023

ADNet: Lane Shape Prediction via Anchor Decomposition.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Creative Birds: Self-Supervised Single-View 3D Style Transfer.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Large Selective Kernel Network for Remote Sensing Object Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Structure Flow-Guided Network for Real Depth Super-resolution.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Recurrent Structure Attention Guidance for Depth Super-resolution.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

DesNet: Decomposed Scale-Consistent Network for Unsupervised Depth Completion.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Curriculum Temperature for Knowledge Distillation.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
CBi-GNN: Cross-Scale Bilateral Graph Neural Network for 3D Object Detection.
IEEE Trans. Intell. Transp. Syst., 2022

PAN++: Towards Efficient and Accurate End-to-End Spotting of Arbitrarily-Shaped Text.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality.
CoRR, 2022

RecursiveMix: Mixed Learning with History.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

DTG-SSOD: Dense Teacher Guidance for Semi-Supervised Object Detection.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

PPT: Anomaly Detection Dataset of Printed Products with Templates.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

RigNet: Repetitive Image Guided Network for Depth Completion.
Proceedings of the Computer Vision - ECCV 2022, 2022

Multi-modal Masked Pre-training for Monocular Panoramic Depth Completion.
Proceedings of the Computer Vision - ECCV 2022, 2022

PseCo: Pseudo Labeling and Consistency Training for Semi-Supervised Object Detection.
Proceedings of the Computer Vision - ECCV 2022, 2022

Dynamic MLP for Fine-Grained Image Classification by Leveraging Geographical and Temporal Information.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Solution for SnakeCLEF 2022 by Tackling Long-tailed Categorization.
Proceedings of the Working Notes of CLEF 2022 - Conference and Labs of the Evaluation Forum, Bologna, Italy, September 5th - to, 2022

Spatial Group-Wise Enhance: Enhancing Semantic Feature Learning in CNN.
Proceedings of the Computer Vision - ACCV 2022, 2022

Knowledge Distillation for Object Detection via Rank Mimicking and Prediction-Guided Feature Imitation.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Analysis and optimisation of server load balancing-based multi-factor integrated algorithm.
Int. J. Wirel. Mob. Comput., 2021

Student Helping Teacher: Teacher Evolution via Self-Knowledge Distillation.
CoRR, 2021

RigNet: Repetitive Image Guided Network for Depth Completion.
CoRR, 2021

Regularizing Nighttime Weirdness: Efficient Self-supervised Monocular Depth Estimation in the Dark.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Generalized Focal Loss V2: Learning Reliable Localization Quality Estimation for Dense Object Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

E/D Mode Logic Cells and Series-to-Parallel Interface with Less Transistors and Better Structure Consistence in GaAs Process.
Proceedings of the 14th IEEE International Conference on ASIC, 2021

Hierarchical Attentive Upsampling on Input Signals for Remote Heart Rate Estimation.
Proceedings of the Pattern Recognition - 6th Asian Conference, 2021

2020
Toward Making Unsupervised Graph Hashing Discriminative.
IEEE Trans. Multim., 2020

Line-CNN: End-to-End Traffic Line Detection With Line Proposal Unit.
IEEE Trans. Intell. Transp. Syst., 2020

Joint Task-Recursive Learning for RGB-D Scene Understanding.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Generalized Focal Loss: Learning Qualified and Distributed Bounding Boxes for Dense Object Detection.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Optimization and Reconstruction of EPMA Image Based on SAMP Algorithm.
Proceedings of the ICMLC 2020: 2020 12th International Conference on Machine Learning and Computing, 2020

Optimization of EPMA Image Reconstruction Based on Generalized Orthogonal Matching Pursuit Algorithm.
Proceedings of the CNIOT 2020: 2020 International Conference on Computing, 2020

Understanding the Disharmony between Weight Normalization Family and Weight Decay.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Spatial Group-wise Enhance: Improving Semantic Feature Learning in Convolutional Networks.
CoRR, 2019

Research on Optimization of CAPTCHA Recognition Algorithm Based on SVM.
Proceedings of the 2019 11th International Conference on Machine Learning and Computing, 2019

Shape Robust Text Detection With Progressive Scale Expansion Network.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Selective Kernel Networks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Understanding the Disharmony Between Dropout and Batch Normalization by Variance Shift.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Inter-Class Angular Loss for Convolutional Neural Networks.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Triple Attention Mixed Link Network for Single Image Super Resolution.
CoRR, 2018

Shape Robust Text Detection with Progressive Scale Expansion Network.
CoRR, 2018

Densely Connected Bidirectional LSTM with Applications to Sentence Classification.
Proceedings of the Natural Language Processing and Chinese Computing, 2018

Mixed Link Networks.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Adversarial Metric Learning.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Unsupervised Multi-Domain Image Translation with Domain-Specific Encoders/Decoders.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

SESR: Single Image Super Resolution with Recursive Squeeze and Excitation Networks.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Teach to Hash: A Deep Supervised Hashing Framework with Data Selection.
Proceedings of the Neural Information Processing - 25th International Conference, 2018

Discrete Locally-Linear Preserving Hashing.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Joint Task-Recursive Learning for Semantic Segmentation and Depth Estimation.
Proceedings of the Computer Vision - ECCV 2018, 2018

Stacked Conditional Generative Adversarial Networks for Jointly Learning Shadow Detection and Shadow Removal.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Stacked Conditional Generative Adversarial Networks for Jointly Learning Shadow Detection and Shadow Removal.
CoRR, 2017

Unsupervised Multi-Domain Image Translation with Domain-Specific Encoders/Decoders.
CoRR, 2017

A Point and Line Features Based Method for Disturbed Surface Motion Estimation.
Proceedings of the Neural Information Processing - 24th International Conference, 2017

Research on Optimization and Application of Slope One Algorithm in Personalized Recommendation System.
Proceedings of the 9th International Conference on Machine Learning and Computing, 2017

2016
LightRNN: Memory and Computation-Efficient Recurrent Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

2015
The Hierarchical Model to Ali Mobile Recommendation Competition.
Proceedings of the IEEE International Conference on Data Mining Workshop, 2015

Deep Convolutional Neural Network and Multi-view Stacking Ensemble in Ali Mobile Recommendation Algorithm Competition: The Solution to the Winning of Ali Mobile Recommendation Algorithm.
Proceedings of the IEEE International Conference on Data Mining Workshop, 2015

2013
LPT Optimization Algorithm in the Nuclear Environment Image Monitoring.
J. Softw., 2013


  Loading...