Yunzhong He

Orcid: 0000-0002-5429-5372

According to our database¹, Yunzhong He authored at least 19 papers between 2016 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Not Every Rubric Teaches Equally: Policy-Aware Rubric Rewards for RLVR.

[BibT_eX]

[DOI]

Utkarsh Tyagi

Xingang Guo

MohammadHossein Rezaei

CoRR, May, 2026

Reward Hacking in Rubric-Based Reinforcement Learning.

[BibT_eX]

[DOI]

Anas Mahmoud

MohammadHossein Rezaei

CoRR, May, 2026

SWE Atlas: Benchmarking Coding Agents Beyond Issue Resolution.

[BibT_eX]

[DOI]

Johannes Baptist Mols

MohammadHossein Rezaei

Bing Liu

Brad Kenstler

Yunzhong He

CoRR, May, 2026

Pre-training Language Model for Friend Recommendation: A Case Study of Large Social Graph.

[BibT_eX]

[DOI]

Proceedings of the Nineteenth ACM International Conference on Web Search and Data Mining, 2026

Agentic Rubrics as Contextual Verifiers for SWE Agents.

[BibT_eX]

[DOI]

Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

Audio MultiChallenge: A Multi-Turn Evaluation of Spoken Dialogue Systems on Natural Human Interaction.

[BibT_eX]

[DOI]

Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

PRBench: Large-Scale Expert Rubrics for Evaluating High-Stakes Professional Reasoning.

[BibT_eX]

[DOI]

Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

2025

PRBench: Large-Scale Expert Rubrics for Evaluating High-Stakes Professional Reasoning.

[BibT_eX]

[DOI]

CoRR, November, 2025

Beyond Seeing: Evaluating Multimodal LLMs on Tool-Enabled Image Perception, Transformation, and Reasoning.

[BibT_eX]

[DOI]

Ernesto Gabriel Hernández Montoya

Rakshith Sharma Srinivasa

CoRR, October, 2025

Online Rubrics Elicitation from Pairwise Comparisons.

[BibT_eX]

[DOI]

MohammadHossein Rezaei

CoRR, October, 2025

TutorBench: A Benchmark To Assess Tutoring Capabilities Of Large Language Models.

[BibT_eX]

[DOI]

Rakshith S. Srinivasa

Guillermo Mangialardi

Charmaine Ng

Ed-Yeremai Hernandez-Cardona

CoRR, October, 2025

Chasing the Tail: Effective Rubric-based Reward Modeling for Large Language Model Post-Training.

[BibT_eX]

[DOI]

Junkai Zhang

Zihao Wang

Lin Gui

Swarnashree Mysore Sathyendra

CoRR, September, 2025

2023

Auto-GPT for Online Decision Making: Benchmarks and Additional Opinions.

[BibT_eX]

[DOI]

Hui Yang

Sifu Yue

Yunzhong He

CoRR, 2023

HierCat: Hierarchical Query Categorization from Weakly Supervised Data at Facebook Marketplace.

[BibT_eX]

[DOI]

Proceedings of the Companion Proceedings of the ACM Web Conference 2023, 2023

Que2Engage: Embedding-based Retrieval for Relevant and Engaging Products at Facebook Marketplace.

[BibT_eX]

[DOI]

Proceedings of the Companion Proceedings of the ACM Web Conference 2023, 2023

2021

Que2Search: Fast and Accurate Query and Document Understanding for Search at Facebook.

[BibT_eX]

[DOI]

Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

2020

A Social Search Model for Large Scale Social Networks.

[BibT_eX]

[DOI]

CoRR, 2020

2017

Learning Human Utility from Video Demonstrations for Deductive Planning in Robotics.

[BibT_eX]

[DOI]

Proceedings of the 1st Annual Conference on Robot Learning, CoRL 2017, Mountain View, 2017

2016

Jointly Learning Grounded Task Structures from Language Instruction and Visual Demonstration.

[BibT_eX]

[DOI]

Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Yunzhong He

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...