Lingsi Zhu

Orcid: 0009-0002-6860-2772

According to our database1, Lingsi Zhu authored at least 13 papers between 2025 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Anchoring Emotions in Text: Robust Multimodal Fusion for Mimicry Intensity Estimation.
CoRR, March, 2026

Hierarchical Granularity Alignment and State Space Modeling for Robust Multimodal AU Detection in the Wild.
CoRR, March, 2026

Solution to the 10th ABAW Expression Recognition Challenge: A Robust Multimodal Framework with Safe Cross-Attention and Modality Dropout.
CoRR, March, 2026

2025
Solution for 8th Competition on Affective & Behavior Analysis in-the-wild.
CoRR, March, 2025

Technical Approach for the EMI Challenge in the 8th Affective Behavior Analysis in-the-Wild Competition.
CoRR, March, 2025

DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance.
CoRR, March, 2025

Heterogeneous Encoder Fusion with KAN Decoder for Group Engagement Modeling via 8× Sliding Pipelines.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

HierMEQA: A Relationship-Aware Hierarchical Framework for Consistent Micro-Expression Visual Question Answering.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Hierarchical Multi-Feature Extraction and Aggregation for Micro-Action Recognition.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

LVLM-HBA: Large Vision-Language Model with Cross-Modal Alignment for Human Behavior Analysis.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Robust Stage-Wise LVLM Adaptation: Multi-Phase Prompt Lora Fine-tuning for Compound Expression Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

Towards Robust Multimodal AU Detection: STN-Enhanced Visual Encoding and Audio-Visual Spatial-Temporal Alignment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

Dual-Stage Cross-Modal Network with Dynamic Feature Fusion for Emotional Mimicry Intensity Estimation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025


  Loading...