Xihan Wei
According to our database1,
Xihan Wei
authored at least 20 papers
between 2019 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
CoRR, June, 2025
CoRR, June, 2025
CoGenAV: Versatile Audio-Visual Representation Learning via Contrastive-Generative Synchronization.
CoRR, May, 2025
ActionArt: Advancing Multimodal Large Models for Fine-Grained Human-Centric Video Understanding.
CoRR, April, 2025
CoRR, March, 2025
R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcement Learning.
CoRR, March, 2025
HumanOmni: A Large Vision-Speech Language Model for Human-Centric Video Understanding.
CoRR, January, 2025
Omni-Emotion: Extending Video MLLM with Detailed Face and Audio Modeling for Multimodal Emotion Analysis.
CoRR, January, 2025
Facial Dynamics in Video: Instruction Tuning for Improved Facial Expression Perception and Contextual Awareness.
CoRR, January, 2025
LLaVA-Octopus: Unlocking Instruction-Driven Adaptive Projector Fusion for Video Understanding.
CoRR, January, 2025
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
2022
Spatiotemporal Self-attention Modeling with Temporal Patch Shift for Action Recognition.
Proceedings of the Computer Vision - ECCV 2022, 2022
Proceedings of the 33rd British Machine Vision Conference 2022, 2022
2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
2020
2019