Zuwei Long

Orcid: 0009-0008-3827-2243

According to our database¹, Zuwei Long authored at least 10 papers between 2024 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Omni-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion.

[BibT_eX]

[DOI]

CoRR, March, 2026

Youtu-VL: Unleashing Visual Potential via Unified Vision-Language Supervision.

[BibT_eX]

[DOI]

CoRR, January, 2026

2025

DeepTalk: Towards Seamless and Smart Speech Interaction with Adaptive Modality-Specific MoE.

[BibT_eX]

[DOI]

CoRR, June, 2025

VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model.

[BibT_eX]

[DOI]

CoRR, May, 2025

LUCY: Linguistic Understanding and Control Yielding Early Stage of Her.

[BibT_eX]

[DOI]

CoRR, January, 2025

VITA-Audio: Fast Interleaved Audio-Text Token Generation for Efficient Large Speech-Language Model.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Towards Universal Perception through Language-Guided Open-World Object Detection.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

2024

T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs.

[BibT_eX]

[DOI]

CoRR, 2024

VITA: Towards Open-Source Interactive Omni Multimodal LLM.

[BibT_eX]

[DOI]

CoRR, 2024

Zuwei Long

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...