Enxin Song

According to our database1, Enxin Song authored at least 10 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
MovieChat+: Question-Aware Sparse Memory for Long Video Question Answering.
IEEE Trans. Pattern Anal. Mach. Intell., January, 2026

2025
VideoNSA: Native Sparse Attention Scales Video Understanding.
CoRR, October, 2025

AuroraLong: Bringing RNNs Back to Efficient Open-Ended Video Understanding.
CoRR, July, 2025

AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Bringing RNNs Back to Efficient Open-Ended Video Understanding.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Video-MMLU: A Massive Multi-Discipline Lecture Understanding Benchmark.
Proceedings of the IEEE/CVF International Conference on Computer Vision, ICCV 2025, 2025

2024
MovieChat: From Dense Token to Sparse Memory for Long Video Understanding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Devil in the Number: Towards Robust Multi-modality Data Filter.
CoRR, 2023

MovieChat: From Dense Token to Sparse Memory for Long Video Understanding.
CoRR, 2023


  Loading...