Ming Li

Orcid: 0000-0002-7852-0159

Affiliations:
  • Guangdong Laboratory of Artificial Intelligence and Digital Economy, Shenzhen, China
  • National University of Singapore, Institute of Data Science, Singapore (PhD 2024)
  • Worcester Polytechnic Institute, Worcester, MA, USA (2019 - 2021)
  • University of North Carolina at Chapel Hill, Chapel Hill, NC, USA (2018 - 2019)


According to our database1, Ming Li authored at least 32 papers between 2020 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
VSE-MOT: Multi-Object Tracking in Low-Quality Video Scenes Guided by Visual Semantic Enhancement.
CoRR, September, 2025

FedVLA: Federated Vision-Language-Action Learning with Dual Gating Mixture-of-Experts for Robotic Manipulation.
CoRR, August, 2025

L3A: Label-Augmented Analytic Adaptation for Multi-Label Class Incremental Learning.
CoRR, June, 2025

Safe-Sora: Safe Text-to-Video Generation via Graphical Watermarking.
CoRR, May, 2025

PVChat: Personalized Video Chat with One-Shot Learning.
CoRR, March, 2025

Safe-VAR: Safe Visual Autoregressive Model for Text-to-Image Generative Watermarking.
CoRR, March, 2025

Semantic Shift Estimation via Dual-Projection and Classifier Reconstruction for Exemplar-Free Class-Incremental Learning.
CoRR, March, 2025

X-SG<sup>2</sup>S: Safe and Generalizable Gaussian Splatting with X-dimensional Watermarks.
CoRR, February, 2025

Correction: Instant3D: Instant Text-to-3D Generation.
Int. J. Comput. Vis., January, 2025

Uncertainty Quantification via Hölder Divergence for Multi-View Representation Learning.
IEEE Trans. Multim., 2025

Uncertainty Quantification for Incomplete Multi-View Data Using Divergence Measures.
IEEE Trans. Image Process., 2025

ColonNeRF: High-fidelity neural reconstruction of long colonoscopy.
Neurocomputing, 2025

Inter3D: A Benchmark and Strong Baseline for Human-Interactive 3D Object Reconstruction.
Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

L3A: Label-Augmented Analytic Adaptation for Multi-Label Class Incremental Learning.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

LV-VTON: Long-Video Virtual Try-On via Enhanced Visual Autoregressive Modeling.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

OmniStyle: Attention-Optimized Global and Local Image Stylization with Diffusion Model Inversion.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

DEP-SLAM: A Dynamic Environment Perception SLAM System with Large Language Models.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

EventGPT: Event Stream Understanding with Multimodal Large Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Instant3D: Instant Text-to-3D Generation.
Int. J. Comput. Vis., October, 2024

Semi-Supervised Disease Classification Based on Limited Medical Image Data.
IEEE J. Biomed. Health Informatics, March, 2024

DR-FER: Discriminative and Robust Representation Learning for Facial Expression Recognition.
IEEE Trans. Multim., 2024

EFTViT: Efficient Federated Training of Vision Transformers with Masked Images on Resource-Constrained Edge Devices.
CoRR, 2024

Uncertainty Quantification via Hölder Divergence for Multi-View Representation Learning.
CoRR, 2024

2023
FakePoI: A Large-Scale Fake Person of Interest Video Detection Benchmark and a Strong Baseline.
IEEE Trans. Circuits Syst. Video Technol., November, 2023

Exploiting Multi-View Part-Wise Correlation via an Efficient Transformer for Vehicle Re-Identification.
IEEE Trans. Multim., 2023

ColonNeRF: Neural Radiance Fields for High-Fidelity Long-Sequence Colonoscopy Reconstruction.
CoRR, 2023

Self-supervised Geometric Features Discovery via Interpretable Attentio for Vehicle Re-Identification and Beyond (Complete Version).
CoRR, 2023

STPrivacy: Spatio-Temporal Tubelet Sparsification and Anonymization for Privacy-preserving Action Recognition.
CoRR, 2023

STPrivacy: Spatio-Temporal Privacy-Preserving Action Recognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2021
Self-supervised Geometric Features Discovery via Interpretable Attention for Vehicle Re-Identification and Beyond.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020
TreeRNN: Topology-Preserving Deep GraphEmbedding and Learning.
CoRR, 2020

TreeRNN: Topology-Preserving Deep Graph Embedding and Learning.
Proceedings of the 25th International Conference on Pattern Recognition, 2020


  Loading...