Yifan Du

Affiliations:

Renmin University of China, Gaoling School of Artificial Intelligence, China

According to our database¹, Yifan Du authored at least 18 papers between 2022 and 2026.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

A Survey of Large Language Models.

[BibT_eX]

[DOI]

Frontiers Comput. Sci., December, 2026

Improving Vision-language Models with Perception-centric Process Reward Models.

[BibT_eX]

[DOI]

CoRR, April, 2026

Towards Long-horizon Agentic Multimodal Search.

[BibT_eX]

[DOI]

CoRR, April, 2026

Beyond the Last Frame: Process-aware Evaluation for Generative Video Reasoning.

[BibT_eX]

[DOI]

Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

2025

VIPER: Process-aware Evaluation for Generative Video Reasoning.

[BibT_eX]

[DOI]

CoRR, December, 2025

Revisiting the Necessity of Lengthy Chain-of-Thought in Vision-centric Reasoning Generalization.

[BibT_eX]

[DOI]

CoRR, November, 2025

AVC-DPO: Aligned Video Captioning via Direct Preference Optimization.

[BibT_eX]

[DOI]

CoRR, July, 2025

Virgo: A Preliminary Exploration on Reproducing o1-like MLLM.

[BibT_eX]

[DOI]

CoRR, January, 2025

Needle In A Video Haystack: A Scalable Synthetic Evaluator for Video MLLMs.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Exploring the Design Space of Visual Context Representation in Video MLLMs.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

What Makes for Good Visual Instructions? Synthesizing Complex Visual Reasoning Instructions for Visual Instruction Tuning.

[BibT_eX]

[DOI]

Proceedings of the 31st International Conference on Computational Linguistics, 2025

2024

Towards Event-oriented Long Video Understanding.

[BibT_eX]

[DOI]

CoRR, 2024

Needle In A Video Haystack: A Scalable Synthetic Framework for Benchmarking Video MLLMs.

[BibT_eX]

[DOI]

CoRR, 2024

2023

A Survey of Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Evaluating Object Hallucination in Large Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Learning to Imagine: Visually-Augmented Natural Language Generation.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Zero-shot Visual Question Answering with Language Model Feedback.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022

A Survey of Vision-Language Pre-Trained Models.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Yifan Du

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...