Jingqi Tong

According to our database¹, Jingqi Tong authored at least 20 papers between 2024 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

OMIBench: Benchmarking Olympiad-Level Multi-Image Reasoning in Large Vision-Language Model.

[BibT_eX]

[DOI]

CoRR, April, 2026

AI Can Learn Scientific Taste.

[BibT_eX]

[DOI]

CoRR, March, 2026

SciAgentGym: Benchmarking Multi-Step Scientific Tool-use in LLM Agents.

[BibT_eX]

[DOI]

CoRR, February, 2026

MOVA: Towards Scalable and Synchronized Video-Audio Generation.

[BibT_eX]

[DOI]

CoRR, February, 2026

OpenNovelty: An LLM-powered Agentic System for Verifiable Scholarly Novelty Assessment.

[BibT_eX]

[DOI]

CoRR, January, 2026

LLMEval-Fair: A Large-Scale Longitudinal Study on Robust and Fair Evaluation of Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

VideoPro: Adaptive Program Reasoning for Long Video Understanding.

[BibT_eX]

[DOI]

Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

Beyond Scaling: Measuring and Predicting the Upper Bound of Knowledge Retention in Language Model Pre-Training.

[BibT_eX]

[DOI]

Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

OMIBench: Benchmarking Olympiad-Level Multi-Image Reasoning in Large Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

2025

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm.

[BibT_eX]

[DOI]

CoRR, November, 2025

Adaptive Fast-and-Slow Visual Program Reasoning for Long-Form VideoQA.

[BibT_eX]

[DOI]

CoRR, September, 2025

LLMEval-3: A Large-Scale Longitudinal Study on Robust and Fair Evaluation of Large Language Models.

[BibT_eX]

[DOI]

CoRR, August, 2025

SpeechRole: A Large-Scale Dataset and Benchmark for Evaluating Speech Role-Playing Agents.

[BibT_eX]

[DOI]

CoRR, August, 2025

Code2Logic: Game-Code-Driven Data Synthesis for Enhancing VLMs General Reasoning.

[BibT_eX]

[DOI]

CoRR, May, 2025

Predicting Large Language Model Capabilities on Closed-Book QA Tasks Using Only Information Available Prior to Training.

[BibT_eX]

[DOI]

CoRR, February, 2025

Understanding Parametric and Contextual Knowledge Reconciliation within Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

LLMEval-Med: A Real-world Clinical Benchmark for Medical LLMs with Physician Validation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

2024

Exploring the Compositional Deficiency of Large Language Models in Mathematical Reasoning.

[BibT_eX]

[DOI]

CoRR, 2024

Exploring the Compositional Deficiency of Large Language Models in Mathematical Reasoning Through Trap Problems.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-Training.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Jingqi Tong

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...