Proceedings of the 3rd Vision-based Remote Physiological Signal Sensing Challenge & Workshop (RePSS 2024) co-located with the 33rd International Joint Conference on Artificial Intelligence (IJCAI 2024), 2024

Repetitive Action Counting with Feature Interaction Enhancement and Adaptive Gate Fusion.

[BibT_eX]

[DOI]

Jiazhen Zhang

Proceedings of the 6th ACM International Conference on Multimedia in Asia, 2024

Cluster-Phys: Facial Clues Clustering Towards Efficient Remote Physiological Measurement.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Maskable Retentive Network for Video Moment Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

MAC 2024: Micro-Action Analysis Grand Challenge.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Micro-gesture Online Recognition using Learnable Query Points.

[BibT_eX]

[DOI]

Proceedings of IJCAI 2024 Workshop&Challenge on Micro-gesture Analysis for Hidden Emotion Understanding (MiGA 2024) co-located with 33rd International Joint Conference on Artificial Intelligence (IJCAI 2024), 2024

Prototype Learning for Micro-gesture Classification.

[BibT_eX]

[DOI]

Frequency Decoupling for Motion Magnification Via Multi-Level Isomorphic Architecture.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

EulerMormer: Robust Eulerian Motion Magnification via Dynamic Filtering within Transformer.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Transformer-Based Visual Grounding with Cross-Modality Interaction.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., November, 2023

ViGT: proposal-free video grounding with a learnable token in the transformer.

[BibT_eX]

[DOI]

Kun Li

Dan Guo

Meng Wang

Sci. China Inf. Sci., October, 2023

Spatiotemporal contrastive modeling for video moment retrieval.

[BibT_eX]

[DOI]

World Wide Web (WWW), July, 2023

Dual-Path Temporal Map Optimization for Make-up Temporal Video Grounding.

[BibT_eX]

[DOI]

CoRR, 2023

Dual-path TokenLearner for Remote Photoplethysmography-based Physiological Measurement with Facial Videos.

[BibT_eX]

[DOI]

CoRR, 2023

Exploiting Diverse Feature for Multimodal Sentiment Analysis.

[BibT_eX]

[DOI]

Proceedings of the 4th on Multimodal Sentiment Analysis Challenge and Workshop: Mimicked Emotions, 2023

Data Augmentation for Human Behavior Analysis in Multi-Person Conversations.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Multi-modality Fusion for Emotion Recognition in Videos.

[BibT_eX]

[DOI]

Proceedings of IJCAI-2023 Workshop&Challenge on Micro-gesture Analysis for Hidden Emotion Understanding (MiGA 2023) co-located with 32nd International Joint Conference on Artificial Intelligence (IJCAI 2023), 2023

Joint Skeletal and Semantic Embedding Loss for Micro-gesture Classification.

[BibT_eX]

[DOI]

2021

Proposal-Free Video Grounding with Contextual Pyramid Network.

[BibT_eX]

[DOI]