Kun Li

Orcid: 0000-0001-5083-2145

Affiliations:
  • Hefei University of Technology, School of Computer Science and Information Engineering, China


According to our database1, Kun Li authored at least 46 papers between 2019 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Unified Static and Dynamic Network: Efficient Temporal Filtering for Video Grounding.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2025

Task-Generalized Adaptive Cross-Domain Learning for Multimodal Image Fusion.
CoRR, August, 2025

Motion Matters: Motion-guided Modulation Network for Skeleton-based Micro-Action Recognition.
CoRR, July, 2025

Online Micro-gesture Recognition Using Data Augmentation and Spatial-Temporal Attention.
CoRR, July, 2025

MM-Gesture: Towards Precise Micro-Gesture Recognition through Multimodal Fusion.
CoRR, July, 2025

Temporal Boundary Awareness Network for Repetitive Action Counting.
ACM Trans. Multim. Comput. Commun. Appl., April, 2025

The Tenth NTIRE 2025 Image Denoising Challenge Report.
CoRR, April, 2025

A Survey on fMRI-based Brain Decoding for Reconstructing Multimodal Stimuli.
CoRR, March, 2025

BVINet: Unlocking Blind Video Inpainting with Zero Annotations.
CoRR, February, 2025

Prompt-Aware Controllable Shadow Removal.
CoRR, January, 2025

Exploiting EfficientSAM and Temporal Coherence for Audio-Visual Segmentation.
IEEE Trans. Multim., 2025

Repetitive Action Counting With Hybrid Temporal Relation Modeling.
IEEE Trans. Multim., 2025

Leveraging vision-language prompts for real-world image restoration and enhancement.
Comput. Vis. Image Underst., 2025

Improving long-tailed pest classification using diffusion model-based data augmentation.
Comput. Electron. Agric., 2025

Exploiting Ensemble Learning for Cross-View Isolated Sign Language Recognition.
Proceedings of the Companion Proceedings of the ACM on Web Conference 2025, 2025

Temporal-Frequency State Space Duality: An Efficient Paradigm for Speech Emotion Recognition.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Patch-level Sounding Object Tracking for Audio-Visual Question Answering.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Prototypical Calibrating Ambiguous Samples for Micro-Action Recognition.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

ZeroMamba: Exploring Visual State Space Model for Zero-Shot Learning.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Depth Matters: Spatial Proximity-Based Gaze Cone Generation for Gaze Following in Wild.
ACM Trans. Multim. Comput. Commun. Appl., November, 2024

Dual-Path TokenLearner for Remote Photoplethysmography-Based Physiological Measurement With Facial Videos.
IEEE Trans. Comput. Soc. Syst., June, 2024

Benchmarking Micro-Action Recognition: Dataset, Methods, and Applications.
IEEE Trans. Circuits Syst. Video Technol., 2024

MMAD: Multi-label Micro-Action Detection in Videos.
CoRR, 2024

Micro-gesture Online Recognition using Learnable Query Points.
CoRR, 2024

Low-light wheat image enhancement using an explicit inter-channel sparse transformer.
Comput. Electron. Agric., 2024

Joint Spatial-Temporal Modeling and Contrastive Learning for Self-supervised Heart Rate Measurement.
Proceedings of the 3rd Vision-based Remote Physiological Signal Sensing Challenge & Workshop (RePSS 2024) co-located with the 33rd International Joint Conference on Artificial Intelligence (IJCAI 2024), 2024

Repetitive Action Counting with Feature Interaction Enhancement and Adaptive Gate Fusion.
Proceedings of the 6th ACM International Conference on Multimedia in Asia, 2024

Cluster-Phys: Facial Clues Clustering Towards Efficient Remote Physiological Measurement.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Maskable Retentive Network for Video Moment Retrieval.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

MAC 2024: Micro-Action Analysis Grand Challenge.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Micro-gesture Online Recognition using Learnable Query Points.
Proceedings of IJCAI 2024 Workshop&Challenge on Micro-gesture Analysis for Hidden Emotion Understanding (MiGA 2024) co-located with 33rd International Joint Conference on Artificial Intelligence (IJCAI 2024), 2024

Prototype Learning for Micro-gesture Classification.
Proceedings of IJCAI 2024 Workshop&Challenge on Micro-gesture Analysis for Hidden Emotion Understanding (MiGA 2024) co-located with 33rd International Joint Conference on Artificial Intelligence (IJCAI 2024), 2024

Frequency Decoupling for Motion Magnification Via Multi-Level Isomorphic Architecture.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

EulerMormer: Robust Eulerian Motion Magnification via Dynamic Filtering within Transformer.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Transformer-Based Visual Grounding with Cross-Modality Interaction.
ACM Trans. Multim. Comput. Commun. Appl., November, 2023

ViGT: proposal-free video grounding with a learnable token in the transformer.
Sci. China Inf. Sci., October, 2023

Spatiotemporal contrastive modeling for video moment retrieval.
World Wide Web (WWW), July, 2023

Dual-Path Temporal Map Optimization for Make-up Temporal Video Grounding.
CoRR, 2023

Dual-path TokenLearner for Remote Photoplethysmography-based Physiological Measurement with Facial Videos.
CoRR, 2023

Exploiting Diverse Feature for Multimodal Sentiment Analysis.
Proceedings of the 4th on Multimodal Sentiment Analysis Challenge and Workshop: Mimicked Emotions, 2023

Data Augmentation for Human Behavior Analysis in Multi-Person Conversations.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Multi-modality Fusion for Emotion Recognition in Videos.
Proceedings of IJCAI-2023 Workshop&Challenge on Micro-gesture Analysis for Hidden Emotion Understanding (MiGA 2023) co-located with 32nd International Joint Conference on Artificial Intelligence (IJCAI 2023), 2023

Joint Skeletal and Semantic Embedding Loss for Micro-gesture Classification.
Proceedings of IJCAI-2023 Workshop&Challenge on Micro-gesture Analysis for Hidden Emotion Understanding (MiGA 2023) co-located with 32nd International Joint Conference on Artificial Intelligence (IJCAI 2023), 2023

2021
Proposal-Free Video Grounding with Contextual Pyramid Network.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
AOPNet: Anchor Offset Prediction Network for Temporal Action Proposal Generation.
Proceedings of the IEEE International Conference on Signal Processing, 2020

2019
DADNet: Dilated-Attention-Deformable ConvNet for Crowd Counting.
Proceedings of the 27th ACM International Conference on Multimedia, 2019


  Loading...