Zuwei Long

Orcid: 0009-0008-3827-2243

According to our database1, Zuwei Long authored at least 10 papers between 2024 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Omni-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion.
CoRR, March, 2026

Youtu-VL: Unleashing Visual Potential via Unified Vision-Language Supervision.
CoRR, January, 2026

2025
DeepTalk: Towards Seamless and Smart Speech Interaction with Adaptive Modality-Specific MoE.
CoRR, June, 2025

VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model.
CoRR, May, 2025

LUCY: Linguistic Understanding and Control Yielding Early Stage of Her.
CoRR, January, 2025

VITA-Audio: Fast Interleaved Audio-Text Token Generation for Efficient Large Speech-Language Model.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Towards Universal Perception through Language-Guided Open-World Object Detection.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

2024
T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs.
CoRR, 2024

VITA: Towards Open-Source Interactive Omni Multimodal LLM.
CoRR, 2024


  Loading...