Zeyi Sun

Orcid: 0009-0008-4264-4281

Affiliations:

Shanghai Jiao Tong University, Shanghai, China

According to our database¹, Zeyi Sun authored at least 15 papers between 2023 and 2026.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2026

RAR: Retrieving and Ranking Augmented MLLMs for Visual Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2026

2025

CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, August, 2025

SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience.

[BibT_eX]

[DOI]

CoRR, August, 2025

RelightVid: Temporal-Consistent Diffusion Model for Video Relighting.

[BibT_eX]

[DOI]

CoRR, January, 2025

Bootstrap3D: Improving Multi-View Diffusion Model with Synthetic Data.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

X-Prompt: Generalizable Auto-Regressive Visual Learning with In-Context Prompting.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Visual-RFT: Visual Reinforcement Fine-Tuning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

2024

X-Prompt: Towards Universal In-Context Image Generation in Auto-Regressive Vision Language Foundation Models.

[BibT_eX]

[DOI]

CoRR, 2024

V3Det Challenge 2024 on Vast Vocabulary and Open Vocabulary Object Detection: Methods and Results.

[BibT_eX]

[DOI]

CoRR, 2024

Bootstrap3D: Improving 3D Content Creation with Synthetic Data.

[BibT_eX]

[DOI]

CoRR, 2024

Make-it-Real: Unleashing Large Multimodal Model's Ability for Painting 3D Objects with Realistic Materials.

[BibT_eX]

[DOI]

CoRR, 2024

Make-it-Real: Unleashing Large Multimodal Model for Painting 3D Objects with Realistic Materials.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

GPT4Point: A Unified Framework for Point-Language Understanding and Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Alpha-CLIP: A CLIP Model Focusing on Wherever you Want.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

Gemini vs GPT-4V: A Preliminary Comparison and Combination of Vision-Language Models Through Qualitative Cases.

[BibT_eX]

[DOI]

CoRR, 2023

Zeyi Sun

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...