Mingrui Lao

Orcid: 0000-0001-8413-7220

According to our database1, Mingrui Lao authored at least 41 papers between 2018 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
PAL: Prompting analytic learning with missing modality for multi-modal class-incremental learning.
Pattern Recognit., 2026

FSKD: A few-shot knowledge distillation framework for object tracking.
Pattern Recognit., 2026

Closed-loop correction reprogramming for fine-grained visual prompting.
Neural Networks, 2026

Overcoming semantic manifold deviation for robust multimodal violence detection with incomplete modality.
Expert Syst. Appl., 2026

OTKD: A general knowledge distillation pipeline for object tracking.
Expert Syst. Appl., 2026

Rep Deep & Machine Learning: Exemplar-Free Continual Video Action Recognition via Slow-Fast Collaborative Learning.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Remote Sensing Image Change Captioning: A Comprehensive Review.
Int. J. Multim. Inf. Retr., September, 2025

PAUL: Uncertainty-Guided Partition and Augmentation for Robust Cross-View Geo-Localization under Noisy Correspondence.
CoRR, August, 2025

Deep Learning For Point Cloud Denoising: A Survey.
CoRR, August, 2025

FCAT: Federated causal adversarial training.
Knowl. Based Syst., 2025

A survey of security threats in federated learning.
Complex Intell. Syst., 2025

Boosting Discriminability for Robust Multimodal Entity Linking with Visual Modality Missing.
Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025

Learning from Peers: Collaborative Ensemble Adversarial Training.
Proceedings of the Pattern Recognition and Computer Vision - 8th Chinese Conference, 2025

TrustCLIP: Learning from Noisy Labels via Semantic Label Verification and Trust-aligned Gradient Projection.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

EventLip: Enhancing Event-Based Lip Reading via Frequency-Aware Spatiotemporal Hypergraph Modeling.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Choose Your Expert: Uncertainty-Guided Expert Selection for Continual Deepfake Detection.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Event-Based Binary Neural Networks for Efficient and Accurate Lip Reading.
Proceedings of 2025 2nd International Conference on Machine Learning and Intelligent Computing (MLIC 2025), 2025

Boosting Adversarial Robustness Through Structure-Guided Adversarial Distillation.
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2025

Generalization-Preserved Learning: Closing the Backdoor to Catastrophic Forgetting in Continual Deepfake Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Multi-Modal Entities Matter: Benchmarking Multi-Modal Entity Alignment.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

2024
RDAT: an efficient regularized decoupled adversarial training mechanism.
Int. J. Multim. Inf. Retr., June, 2024

EIOA: A computing expectation-based influence evaluation method in weighted hypergraphs.
Inf. Process. Manag., 2024

Language Without Borders: A Dataset and Benchmark for Code-Switching Lip Reading.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

PD-Refiner: An Underlying Surface Inheritance Refiner with Adaptive Edge-Aware Supervision for Point Cloud Denoising.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

MMAL: Multi-Modal Analytic Learning for Exemplar-Free Audio-Visual Class Incremental Tasks.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Maximizing Feature Distribution Variance for Robust Neural Networks.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Balanced Confidence Calibration for Graph Neural Networks.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

Binary-Temporal Convolutional Neural Network for Multi-Class Auditory Spatial Attention Detection.
Proceedings of the 14th IEEE International Symposium on Chinese Spoken Language Processing, 2024

Boosting Adversarial Robustness Distillation Via Hybrid Decomposed Knowledge.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Few-shot and meta-learning methods for image understanding: a survey.
Int. J. Multim. Inf. Retr., December, 2023

Dual selective knowledge transfer for few-shot classification.
Appl. Intell., November, 2023

Lifelong Fine-Grained Image Retrieval.
IEEE Trans. Multim., 2023

FedVQA: Personalized Federated Visual Question Answering over Heterogeneous Scenes.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Multi-Domain Lifelong Visual Question Answering via Self-Critical Distillation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

COCA: COllaborative CAusal Regularization for Audio-Visual Question Answering.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
VQA-BC: Robust Visual Question Answering Via Bidirectional Chaining.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Multi-stage hybrid embedding fusion network for visual question answering.
Neurocomputing, 2021

From Superficial to Deep: Language Bias driven Curriculum Learning for Visual Question Answering.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

A Language Prior Based Focal Loss for Visual Question Answering.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

2018
Multimodal Local Perception Bilinear Pooling for Visual Question Answering.
IEEE Access, 2018

Cross-Modal Multistep Fusion Network With Co-Attention for Visual Question Answering.
IEEE Access, 2018


  Loading...