Lingsi Zhu

Orcid: 0009-0002-6860-2772

According to our database¹, Lingsi Zhu authored at least 13 papers between 2025 and 2026.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Anchoring Emotions in Text: Robust Multimodal Fusion for Mimicry Intensity Estimation.

[BibT_eX]

[DOI]

CoRR, March, 2026

Hierarchical Granularity Alignment and State Space Modeling for Robust Multimodal AU Detection in the Wild.

[BibT_eX]

[DOI]

CoRR, March, 2026

Solution to the 10th ABAW Expression Recognition Challenge: A Robust Multimodal Framework with Safe Cross-Attention and Modality Dropout.

[BibT_eX]

[DOI]

CoRR, March, 2026

2025

Solution for 8th Competition on Affective & Behavior Analysis in-the-wild.

[BibT_eX]

[DOI]

CoRR, March, 2025

Technical Approach for the EMI Challenge in the 8th Affective Behavior Analysis in-the-Wild Competition.

[BibT_eX]

[DOI]

CoRR, March, 2025

DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance.

[BibT_eX]

[DOI]

CoRR, March, 2025

Heterogeneous Encoder Fusion with KAN Decoder for Group Engagement Modeling via 8× Sliding Pipelines.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

HierMEQA: A Relationship-Aware Hierarchical Framework for Consistent Micro-Expression Visual Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Hierarchical Multi-Feature Extraction and Aggregation for Micro-Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

LVLM-HBA: Large Vision-Language Model with Cross-Modal Alignment for Human Behavior Analysis.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Robust Stage-Wise LVLM Adaptation: Multi-Phase Prompt Lora Fine-tuning for Compound Expression Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

Towards Robust Multimodal AU Detection: STN-Enhanced Visual Encoding and Audio-Visual Spatial-Temporal Alignment.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

Dual-Stage Cross-Modal Network with Dynamic Feature Fusion for Emotional Mimicry Intensity Estimation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

Lingsi Zhu

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...