Kaishen Yuan

Orcid: 0009-0008-2353-2436

According to our database1, Kaishen Yuan authored at least 20 papers between 2024 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Z<sup>2</sup>-Sampling: Zero-Cost Zigzag Trajectories for Semantic Alignment in Diffusion Models.
CoRR, April, 2026

TurboEvolve: Towards Fast and Robust LLM-Driven Program Evolution.
CoRR, April, 2026

AULLM++: Structural Reasoning with Large Language Models for Micro-Expression Recognition.
CoRR, March, 2026

ΔVLA: Prior-Guided Vision-Language-Action Models via World Knowledge Variation.
CoRR, March, 2026

Multi-Granularity Facial Emotional Representation With Unlabeled Data and Textual Supervision.
IEEE Trans. Image Process., 2026

2025
POLARIS: Projection-Orthogonal Least Squares for Robust and Adaptive Inversion in Diffusion Models.
CoRR, December, 2025

CoEmoGen: Towards Semantically-Coherent and Scalable Emotional Image Content Generation.
CoRR, August, 2025

MedTVT-R1: A Multimodal LLM Empowering Medical Reasoning and Diagnosis.
CoRR, June, 2025

ANT: Adaptive Neural Temporal-Aware Text-to-Motion Model.
CoRR, June, 2025

FEALLM: Advancing Facial Emotion Analysis in Multimodal Large Language Models with Emotional Synergy and Reasoning.
CoRR, May, 2025

scMMAE: masked cross-attention network for single-cell multimodal omics fusion to enhance unimodal omics.
Briefings Bioinform., January, 2025

Multi-Scale Promoted Self-Adjusting Correlation Learning for Facial Action Unit Detection.
IEEE Trans. Affect. Comput., 2025

FEALLM: Advancing Facial Emotion Analysis in Multimodal Large Language Models with Emotional Synergy and Reasoning.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

ANT: Adaptive Neural Temporal-Aware Text-to-Motion Model.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

AU-TTT: Vision Test-Time Training model for Facial Action Unit Detection.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

Period-LLM: Extending the Periodic Capability of Multimodal Large Language Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

AU-LLM: Micro-Expression Action Unit Detection via Enhanced LLM-Based Feature Fusion.
Proceedings of the Biometric Recognition - 19th Chinese Conference, 2025

2024
EMO-LLaMA: Enhancing Facial Emotion Understanding with Instruction Tuning.
CoRR, 2024

AUFormer: Vision Transformers Are Parameter-Efficient Facial Action Unit Detectors.
Proceedings of the Computer Vision - ECCV 2024, 2024

GPT as Psychologist? Preliminary Evaluations for GPT-4V on Visual Affective Computing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024


  Loading...