Li Liu

Orcid: 0000-0002-4497-0135

Affiliations:
  • Shenzhen Research Institute of Big Data, China
  • University Grenoble-Alpes, GIPSA-lab, Grenoble, France (PhD 2018)


According to our database1, Li Liu authored at least 91 papers between 2016 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
COST: Contrastive one-stage transformer for vision-language small object tracking.
Inf. Fusion, 2026

2025
BackdoorBench: A Comprehensive Benchmark and Analysis of Backdoor Learning.
Int. J. Comput. Vis., August, 2025

Failure Cases Are Better Learned But Boundary Says Sorry: Facilitating Smooth Perception Change for Accuracy-Robustness Trade-Off in Adversarial Training.
CoRR, August, 2025

UniCUE: Unified Recognition and Generation Framework for Chinese Cued Speech Video-to-Speech Generation.
CoRR, June, 2025

AudioGenie: A Training-Free Multi-Agent Framework for Diverse Multimodality-to-Multiaudio Generation.
CoRR, May, 2025

FauForensics: Boosting Audio-Visual Deepfake Detection with Facial Action Units.
CoRR, May, 2025

BackdoorDM: A Comprehensive Benchmark for Backdoor Learning in Diffusion Model.
CoRR, February, 2025

Activation Gradient based Poisoned Sample Detection Against Backdoor Attacks.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Learning Class Unique Features in Fine-Grained Visual Classification.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

MambaTrack: Exploiting Dual-Enhancement for Night UAV Tracking.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Reliable Imputed-Sample Assisted Vertical Federated Learning.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Fine-portraitist: Visualizing the Speaker's Face Portrait during Speech Listening.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

MotionComposer: Enhancing Rhythmic Music Generation with Adaptive Retrieval Reference.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Fusing Pruned and Backdoored Models: Optimal Transport-based Data-free Backdoor Mitigation.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
FHVAC: Feature-Level Hybrid Video Adaptive Configuration for Machine-Centric Live Streaming.
IEEE Trans. Parallel Distributed Syst., May, 2024

Computation and Parameter Efficient Multi-Modal Fusion Transformer for Cued Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Less confidence, less forgetting: Learning with a humbler teacher in exemplar-free Class-Incremental learning.
Neural Networks, 2024

New Paradigm of Adversarial Training: Breaking Inherent Trade-Off between Accuracy and Robustness via Dummy Classes.
CoRR, 2024

Towards Underwater Camouflaged Object Tracking: An Experimental Evaluation of SAM and SAM 2.
CoRR, 2024

Seeing Your Speech Style: A Novel Zero-Shot Identity-Disentanglement Face-based Voice Conversion.
CoRR, 2024

Prior-free Balanced Replay: Uncertainty-guided Reservoir Sampling for Long-Tailed Continual Learning.
CoRR, 2024

Segment Anything for Videos: A Systematic Survey.
CoRR, 2024

A Comprehensive Survey on Human Video Generation: Challenges, Methods, and Insights.
CoRR, 2024

TIMA: Text-Image Mutual Awareness for Balancing Zero-Shot Adversarial Robustness and Generalization Ability.
CoRR, 2024

Awesome Multi-modal Object Tracking.
CoRR, 2024

WPDA: Frequency-based Backdoor Attack with Wavelet Packet Decomposition.
CoRR, 2024

Content-Aware Efficient Learner for Audio-Visual Emotion Recognition.
Proceedings of the Social Robotics - 16th International Conference, 2024

Cued Speech-Integrated Audio-Visual Variational Autoencoder for Speech Enhancement.
Proceedings of the Social Robotics - 16th International Conference, 2024

WebUOT-1M: Advancing Deep Underwater Object Tracking with A Million-Scale Benchmark.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Unveiling and Mitigating Backdoor Vulnerabilities based on Unlearning Weight Changes and Backdoor Activeness.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Bridge to Non-Barrier Communication: Gloss-Prompted Fine-Grained Cued Speech Gesture Generation with Diffusion Model.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Leveraging Noisy Labels of Nearest Neighbors for Label Correction and Sample Selection.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
WebUAV-3M: A Benchmark for Unveiling the Power of Million-Scale Deep UAV Tracking.
IEEE Trans. Pattern Anal. Mach. Intell., July, 2023

Generating and Weighting Semantically Consistent Sample Pairs for Ultrasound Contrastive Learning.
IEEE Trans. Medical Imaging, May, 2023

Defenses in Adversarial Machine Learning: A Survey.
CoRR, 2023

Realistic Speech-to-Face Generation with Speech-Conditioned Latent Diffusion Model with Face Prior.
CoRR, 2023

A Survey on Deep Multi-modal Learning for Body Language Recognition and Generation.
CoRR, 2023

Robust Backdoor Attack with Visible, Semantic, Sample-Specific, and Compatible Triggers.
CoRR, 2023

X-IQE: eXplainable Image Quality Evaluation for Text-to-Image Generation with Visual Large Language Models.
CoRR, 2023

A Comprehensive Survey on Segment Anything Model for Vision and Beyond.
CoRR, 2023

Adversarial Machine Learning: A Systematic Survey of Backdoor Attack, Weight Attack and Adversarial Example.
CoRR, 2023

FedAds: A Benchmark for Privacy-Preserving CVR Estimation with Vertical Federated Learning.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

All in One: Exploring Unified Vision-Language Tracking with Multi-Modal Alignment.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

MetaLR: Meta-tuning of Learning Rates for Transfer Learning in Medical Imaging.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

Emotional Talking Head Generation based on Memory-Sharing and Attention-Augmented Networks.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

MAVD: The First Open Large-Scale Mandarin Audio-Visual Dataset with Depth Information.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

A Novel Interpretable and Generalizable Re-synchronization Model for Cued Speech based on a Multi-Cuer Corpus.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Global Balanced Experts for Federated Long-Tailed Learning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Memory-Augmented Contrastive Learning for Talking Head Generation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Two-Stream Joint-Training for Speaker Independent Acoustic-to-Articulatory Inversion.
Proceedings of the IEEE International Conference on Acoustics, 2023

Spatio-Temporal Structure Consistency for Semi-Supervised Medical Image Classification.
Proceedings of the IEEE International Conference on Acoustics, 2023

TAOTF: A Two-Stage Approximately Orthogonal Training Framework in Deep Neural Networks.
Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023

2022
Rethinking Two Consensuses of the Transferability in Deep Learning.
CoRR, 2022

MetaLR: Layer-wise Learning Rate based on Meta-Learning for Adaptively Fine-tuning Medical Pre-trained Models.
CoRR, 2022

WebUAV-3M: A Benchmark Unveiling the Power of Million-Scale Deep UAV Tracking.
CoRR, 2022

Pre-activation Distributions Expose Backdoor Neurons.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Objective Hand Complexity Comparison between Two Mandarin Chinese Cued Speech Systems.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

MVNet: Memory Assistance and Vocal Reinforcement Network for Speech Enhancement.
Proceedings of the Neural Information Processing - 29th International Conference, 2022

Residual-Guided Personalized Speech Synthesis based on Face Image.
Proceedings of the IEEE International Conference on Acoustics, 2022

Acoustic-to-Articulatory Inversion Based on Speech Decomposition and Auxiliary Feature.
Proceedings of the IEEE International Conference on Acoustics, 2022

Data-Free Backdoor Removal Based on Channel Lipschitzness.
Proceedings of the Computer Vision - ECCV 2022, 2022


Boosting Black-Box Attack with Partially Transferred Conditional Adversarial Distribution.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

HiCo: Hierarchical Contrastive Learning for Ultrasound Video Model Pretraining.
Proceedings of the Computer Vision - ACCV 2022, 2022

2021
Re-Synchronization Using the Hand Preceding Model for Multi-Modal Fusion in Automatic Continuous Cued Speech Recognition.
IEEE Trans. Multim., 2021

A hybrid framework for brain tissue segmentation in magnetic resonance images.
Int. J. Imaging Syst. Technol., 2021

Research on distributed logistics scheduling method for workshop production based on hybrid particle swarm optimisation.
Int. J. Manuf. Technol. Manag., 2021

USCL: Pretraining Deep Ultrasound Image Diagnosis Model Through Video Contrastive Representation Learning.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2021 - 24th International Conference, Strasbourg, France, September 27, 2021

Multi-Modal Active Learning For Automatic Liver Fibrosis Diagnosis Based On Ultrasound Shear Wave Elastography.
Proceedings of the 18th IEEE International Symposium on Biomedical Imaging, 2021

Cross-Modal Knowledge Distillation Method for Automatic Cued Speech Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

An Attention Self-Supervised Contrastive Learning Based Three-Stage Model for Hand Shape Feature Representation in Cued Speech.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

The Ninth Visual Object Tracking VOT2021 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

Self-Supervised Depth Estimation Via Implicit Cues from Videos.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Effective Sample Pair Generation for Ultrasound Video Contrastive Representation Learning.
CoRR, 2020

Towards Class-Specific Unit.
CoRR, 2020

Attention-based Residual Speech Portrait Model for Speech to Face Generation.
CoRR, 2020

Self-Supervised Joint Learning Framework of Depth Estimation via Implicit Cues.
CoRR, 2020

A New Re-synchronization Method based Multi-modal Fusion for Automatic Continuous Cued Speech Recognition.
CoRR, 2020

Semi-Supervised Active Learning for COVID-19 Lung Ultrasound Multi-symptom Classification.
Proceedings of the 32nd IEEE International Conference on Tools with Artificial Intelligence, 2020

Three-Dimensional Lip Motion Network for Text-Independent Speaker Recognition.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

2019
Hierarchical Clustering Based Band Selection Algorithm for Hyperspectral Face Recognition.
IEEE Access, 2019

A Light-Weight Context-Aware Self-Attention Model for Skin Lesion Segmentation.
Proceedings of the PRICAI 2019: Trends in Artificial Intelligence, 2019

Automatic Detection of the Temporal Segmentation of Hand Movements in British English Cued Speech.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

A novel resynchronization procedure for hand-lips fusion applied to continuous French Cued Speech recognition.
Proceedings of the 27th European Signal Processing Conference, 2019

2018
Visual Recognition of Continuous Cued Speech Using a Tandem CNN-HMM Approach.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Automatic Temporal Segmentation of Hand Movements for Hand Positions Recognition in French Cued Speech.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Inner lips feature extraction based on CLNF with hybrid dynamic template for Cued Speech.
EURASIP J. Image Video Process., 2017

Automatic dynamic template tracking of inner lips based on CLNF.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Inner Lips Parameter Estimation based on Adaptive Ellipse Model.
Proceedings of the 14th International Conference on Auditory-Visual Speech Processing, 2017

2016
Cancer Feature Selection and Classification Using a Binary Quantum-Behaved Particle Swarm Optimization and Support Vector Machine.
Comput. Math. Methods Medicine, 2016

Extraction automatique de contour de lèvre à partir du modèle CLNF (Automatic lip contour extraction using CLNF model).
Proceedings of the Actes de la conférence conjointe JEP-TALN-RECITAL 2016. Volume 1 : JEP, 2016


  Loading...