Detao Bai
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
CoRR, June, 2025
CoGenAV: Versatile Audio-Visual Representation Learning via Contrastive-Generative Synchronization.
CoRR, May, 2025
HumanOmni: A Large Vision-Speech Language Model for Human-Centric Video Understanding.
CoRR, January, 2025
Omni-Emotion: Extending Video MLLM with Detailed Face and Audio Modeling for Multimodal Emotion Analysis.
CoRR, January, 2025