Yuansheng Ni

According to our database¹, Yuansheng Ni authored at least 14 papers between 2024 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

SWE-QA-Pro: A Representative Benchmark and Scalable Training Recipe for Repository-Level Code Understanding.

[BibT_eX]

[DOI]

CoRR, March, 2026

2025

VisCoder2: Building Multi-Language Visualization Coding Agents.

[BibT_eX]

[DOI]

CoRR, October, 2025

PhyX: Does Your Model Have the "Wits" for Physical Reasoning?

[BibT_eX]

[DOI]

CoRR, May, 2025

MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

VisCoder: Fine-Tuning LLMs for Executable Python Visualization Code Generation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

A Comprehensive Study of Knowledge Editing for Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

GenAI Arena: An Open Evaluation Platform for Generative Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

MMMU: A Massive Multi-Discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

EasyEdit: An Easy-to-use Knowledge Editing Framework for Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations), 2024

Yuansheng Ni

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...