Yunzhong He

Orcid: 0000-0002-5429-5372

According to our database1, Yunzhong He authored at least 19 papers between 2016 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Not Every Rubric Teaches Equally: Policy-Aware Rubric Rewards for RLVR.
CoRR, May, 2026

Reward Hacking in Rubric-Based Reinforcement Learning.
CoRR, May, 2026

SWE Atlas: Benchmarking Coding Agents Beyond Issue Resolution.
CoRR, May, 2026

Pre-training Language Model for Friend Recommendation: A Case Study of Large Social Graph.
Proceedings of the Nineteenth ACM International Conference on Web Search and Data Mining, 2026

Agentic Rubrics as Contextual Verifiers for SWE Agents.
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

Audio MultiChallenge: A Multi-Turn Evaluation of Spoken Dialogue Systems on Natural Human Interaction.
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

PRBench: Large-Scale Expert Rubrics for Evaluating High-Stakes Professional Reasoning.
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

2025
PRBench: Large-Scale Expert Rubrics for Evaluating High-Stakes Professional Reasoning.
CoRR, November, 2025

Beyond Seeing: Evaluating Multimodal LLMs on Tool-Enabled Image Perception, Transformation, and Reasoning.
CoRR, October, 2025

Online Rubrics Elicitation from Pairwise Comparisons.
CoRR, October, 2025

TutorBench: A Benchmark To Assess Tutoring Capabilities Of Large Language Models.
CoRR, October, 2025

Chasing the Tail: Effective Rubric-based Reward Modeling for Large Language Model Post-Training.
CoRR, September, 2025

2023
Auto-GPT for Online Decision Making: Benchmarks and Additional Opinions.
CoRR, 2023

HierCat: Hierarchical Query Categorization from Weakly Supervised Data at Facebook Marketplace.
Proceedings of the Companion Proceedings of the ACM Web Conference 2023, 2023

Que2Engage: Embedding-based Retrieval for Relevant and Engaging Products at Facebook Marketplace.
Proceedings of the Companion Proceedings of the ACM Web Conference 2023, 2023

2021
Que2Search: Fast and Accurate Query and Document Understanding for Search at Facebook.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

2020
A Social Search Model for Large Scale Social Networks.
CoRR, 2020

2017
Learning Human Utility from Video Demonstrations for Deductive Planning in Robotics.
Proceedings of the 1st Annual Conference on Robot Learning, CoRL 2017, Mountain View, 2017

2016
Jointly Learning Grounded Task Structures from Language Instruction and Visual Demonstration.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016


  Loading...