Shuyuan Tu

Orcid: 0000-0002-4299-3114

According to our database1, Shuyuan Tu authored at least 17 papers between 2022 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Baton: Explicit Semantic Blueprints for Joint Video-Audio Generation.
CoRR, May, 2026

Preference Score Distillation: Leveraging 2D Rewards to Align Text-to-3D Generation with Human Preference.
CoRR, March, 2026

ArcFlow: Unleashing 2-Step Text-to-Image Generation via High-Precision Non-Linear Flow Distillation.
CoRR, February, 2026

Emotion-LLaMAv2 and MMEVerse: A New Framework and Benchmark for Multimodal Emotion Understanding.
CoRR, January, 2026

2025
FlashPortrait: 6x Faster Infinite Portrait Animation with Adaptive Latent Prediction.
CoRR, December, 2025

Stable Offline Hand-Eye Calibration for any Robot with Just One Mark.
CoRR, November, 2025

PersonaAnimator: Personalized Motion Transfer from Unconstrained Videos.
CoRR, August, 2025

StableAvatar: Infinite-Length Audio-Driven Avatar Video Generation.
CoRR, August, 2025

StableAnimator++: Overcoming Pose Misalignment and Face Distortion for Human Image Animation.
CoRR, July, 2025

MotionFollower: Editing Video Motion via Score-Guided Diffusion.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

StableAnimator: High-Quality Identity-Preserving Human Image Animation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
MotionFollower: Editing Video Motion via Lightweight Score-Guided Diffusion.
CoRR, 2024

SZTU-CMU at MER2024: Improving Emotion-LLaMA with Conv-Attention for Multimodal Emotion Recognition.
Proceedings of the 2nd International Workshop on Multimodal and Responsible Affective Computing, 2024

ASKDetector: An AST-Semantic and Key Features Fusion based Code Comment Mismatch Detector.
Proceedings of the 32nd IEEE/ACM International Conference on Program Comprehension, 2024

MotionEditor: Editing Video Motion via Content-Aware Diffusion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Implicit Temporal Modeling with Learnable Alignment for Video Recognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
Multiple Biological Granularities Network for Person Re-Identification.
Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022


  Loading...