Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

VITATECS: A Diagnostic Dataset for Temporal Concept Understanding of Video-Language Models.

[BibT_eX]

[DOI]

Shicheng Li

Proceedings of the Computer Vision - ECCV 2024, 2024

TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

TempCompass: Do Video LLMs Really Understand Videos?

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023

Towards End-to-End Embodied Decision Making via Multi-modal Large Language Model: Explorations with GPT4-Vision and Beyond.

[BibT_eX]

[DOI]

CoRR, 2023

M<sup>3</sup>IT: A Large-Scale Dataset towards Multi-Modal Multilingual Instruction Tuning.

[BibT_eX]

[DOI]

CoRR, 2023

Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Delving into the Openness of CLIP.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022

Rethinking the Openness of CLIP.

[BibT_eX]

[DOI]

CoRR, 2022

2021

CUGE: A Chinese Language Understanding and Generation Evaluation Benchmark.

[BibT_eX]

[DOI]

CoRR, 2021

Text AutoAugment: Learning Compositional Augmentation Policy for Text Classification.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Dynamic Knowledge Distillation for Pre-trained Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

CascadeBERT: Accelerating Inference of Pre-trained Language Models via Calibrated Complete Models Cascade.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Learning Relation Alignment for Calibrated Cross-modal Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020

Accelerating Pre-trained Language Models via Calibrated Cascade.

[BibT_eX]

[DOI]