Yifan Du

Affiliations:
  • Renmin University of China, Gaoling School of Artificial Intelligence, China


According to our database1, Yifan Du authored at least 17 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
A Survey of Large Language Models.
Frontiers Comput. Sci., December, 2026

Improving Vision-language Models with Perception-centric Process Reward Models.
CoRR, April, 2026

Towards Long-horizon Agentic Multimodal Search.
CoRR, April, 2026

2025
VIPER: Process-aware Evaluation for Generative Video Reasoning.
CoRR, December, 2025

Revisiting the Necessity of Lengthy Chain-of-Thought in Vision-centric Reasoning Generalization.
CoRR, November, 2025

AVC-DPO: Aligned Video Captioning via Direct Preference Optimization.
CoRR, July, 2025

Virgo: A Preliminary Exploration on Reproducing o1-like MLLM.
CoRR, January, 2025

Needle In A Video Haystack: A Scalable Synthetic Evaluator for Video MLLMs.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Exploring the Design Space of Visual Context Representation in Video MLLMs.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

What Makes for Good Visual Instructions? Synthesizing Complex Visual Reasoning Instructions for Visual Instruction Tuning.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

2024
Towards Event-oriented Long Video Understanding.
CoRR, 2024

Needle In A Video Haystack: A Scalable Synthetic Framework for Benchmarking Video MLLMs.
CoRR, 2024

2023
A Survey of Large Language Models.
CoRR, 2023

Evaluating Object Hallucination in Large Vision-Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Learning to Imagine: Visually-Augmented Natural Language Generation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Zero-shot Visual Question Answering with Language Model Feedback.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
A Survey of Vision-Language Pre-Trained Models.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022


  Loading...