Wenhao Yu

Orcid: 0000-0002-4075-5980

Affiliations:
  • Tencent AI, Seattle Lab., WA, USA
  • University of Notre Dame, Department of Computer Science and Engineering, IN, USA (PhD 2023)


According to our database1, Wenhao Yu authored at least 104 papers between 2019 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Learning to Build the Environment: Self-Evolving Reasoning RL via Verifiable Environment Synthesis.
CoRR, May, 2026

Measure Twice, Click Once: Co-evolving Proposer and Visual Critic via Reinforcement Learning for GUI Grounding.
CoRR, April, 2026

The Single-Multi Evolution Loop for Self-Improving Model Collaboration Systems.
CoRR, February, 2026

Group Distributionally Robust Optimization-Driven Reinforcement Learning for LLM Reasoning.
CoRR, January, 2026

VScan: Rethinking Visual Token Reduction for Efficient Large Vision-Language Models.
Trans. Mach. Learn. Res., 2026

WebRollback: Enhancing Web Agents with Explicit Rollback Mechanisms.
Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics, 2026

Bidirectional LMs are Better Knowledge Memorizers? A Benchmark for Real-world Knowledge Injection.
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

ReCode: Updating Code API Knowledge with Reinforcement Learning.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning.
CoRR, December, 2025

MotionEdit: Benchmarking and Learning Motion-Centric Image Editing.
CoRR, December, 2025

Guided Self-Evolving LLMs with Minimal Human Supervision.
CoRR, December, 2025

Don't Throw Away Your Pretrained Model.
CoRR, October, 2025

BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution.
CoRR, October, 2025

VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal Reasoning.
CoRR, October, 2025

Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation.
CoRR, September, 2025

Parallel-R1: Towards Parallel Thinking via Reinforcement Learning.
CoRR, September, 2025

Self-Rewarding Vision-Language Model via Reasoning Decomposition.
CoRR, August, 2025

R-Zero: Self-Evolving Reasoning LLM from Zero Data.
CoRR, August, 2025

MobileGUI-RL: Advancing Mobile GUI Agent through Reinforcement Learning in Online Environment.
CoRR, July, 2025

ReCode: Updating Code API Knowledge with Reinforcement Learning.
CoRR, June, 2025

KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality.
CoRR, June, 2025

VScan: Rethinking Visual Token Reduction for Efficient Large Vision-Language Models.
CoRR, May, 2025

WebEvolver: Enhancing Web Agent Self-Improvement with Coevolving World Model.
CoRR, April, 2025

Enhancing Web Agents with Explicit Rollback Mechanisms.
CoRR, April, 2025

Towards Trustworthy GUI Agents: A Survey.
CoRR, March, 2025

Do Retrieval-Augmented Language Models Adapt to Varying User Needs?
CoRR, February, 2025

Self-Regularization with Latent Space Explanations for Controllable LLM-based Classification.
CoRR, February, 2025

OpenCharacter: Training Customizable Role-Playing LLMs with Large-Scale Synthetic Personas.
CoRR, January, 2025

Leopard: A Vision Language Model for Text-Rich Multi- Image Tasks.
Trans. Mach. Learn. Res., 2025

MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Self-Regularization with Sparse Autoencoders for Controllable LLM-based Classification.
Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.2, 2025

BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

RepoGraph: Enhancing AI Software Engineering with Repository-level Code Graph.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

DSBench: How Far Are Data Science Agents from Becoming Data Science Experts?
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Retrieval-augmented GUI Agents with Generative Guidelines.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

WebEvolver: Enhancing Web Agent Self-Improvement with Co-evolving World Model.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Knowledge-augmented Methods for Natural Language Processing
Springer Briefs in Computer Science, Springer, ISBN: 978-981-97-0749-2, 2024

Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots.
CoRR, 2024

DSBench: How Far Are Data Science Agents to Becoming Data Science Experts?
CoRR, 2024

DOCBENCH: A Benchmark for Evaluating LLM-based Document Reading Systems.
CoRR, 2024

BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions.
CoRR, 2024

Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning.
CoRR, 2024

MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Interactions.
CoRR, 2024

Describe-then-Reason: Improving Multimodal Mathematical Reasoning through Visual Comprehension Training.
CoRR, 2024

StarCoder 2 and The Stack v2: The Next Generation.
CoRR, 2024

Sub-Sentence Encoder: Contrastive Learning of Propositional Semantic Representations.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Dense X Retrieval: What Retrieval Granularity Should We Use?
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Chain-of-Note: Enhancing Robustness in Retrieval-Augmented Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

PLUG: Leveraging Pivot Language in Cross-Lingual Instruction Tuning.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Deep Multimodal Complementarity Learning.
IEEE Trans. Neural Networks Learn. Syst., December, 2023

StarCoder: may the source be with you!
Trans. Mach. Learn. Res., 2023

Exploring Contrast Consistency of Open-Domain Question Answering Systems on Minimally Edited Questions.
Trans. Assoc. Comput. Linguistics, 2023

Chain-of-Note: Enhancing Robustness in Retrieval-Augmented Language Models.
CoRR, 2023

Improving Language Models via Plug-and-Play Retrieval Feedback.
CoRR, 2023

GraphPatcher: Mitigating Degree Bias for Graph Neural Networks via Test-time Augmentation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

The Second Workshop on Knowledge-Augmented Methods for Natural Language Processing.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Multi-task Self-supervised Graph Neural Networks Enable Stronger Task Generalization.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Generate rather than Retrieve: Large Language Models are Strong Context Generators.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Auto-Instruct: Automatic Instruction Generation and Ranking for Black-Box Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Pre-training Language Models for Comparative Reasoning.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Let GPT be a Math Tutor: Teaching Math Word Problem Solvers with Customized Exercise Generation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

IfQA: A Dataset for Open-domain Question Answering under Counterfactual Presuppositions.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

A Survey of Multi-task Learning in Natural Language Processing: Regarding Task Relatedness and Training Methods.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Large Language Models are Built-in Autoregressive Search Engines.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

A Survey of Deep Learning for Mathematical Reasoning.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

APOLLO: A Simple Approach for Adaptive Pretraining of Language Models for Logical Reasoning.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
A Survey of Knowledge-enhanced Text Generation.
ACM Comput. Surv., January, 2022

Empowering Language Models with Knowledge Graph Reasoning for Question Answering.
CoRR, 2022

Enhancing Automated Software Traceability by Transfer Learning from Open-World Data.
CoRR, 2022

Retrieval-augmented Generation across Heterogeneous Knowledge.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Student Research Workshop, 2022

Learning from Counterfactual Links for Link Prediction.
Proceedings of the International Conference on Machine Learning, 2022

A Unified Encoder-Decoder Framework with Entity Memory.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Grape: Knowledge Graph Enhanced Passage Reader for Open-domain Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Empowering Language Models with Knowledge Graph Reasoning for Open-Domain Question Answering.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Retrieval Augmentation for Commonsense Reasoning: A Unified Approach.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Task Compass: Scaling Multi-task Pre-training with Task Prefix.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Knowledge-Augmented Methods for Natural Language Processing.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, 2022

KG-FiD: Infusing Knowledge Graph in Fusion-in-Decoder for Open-Domain Question Answering.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Diversifying Content Generation for Commonsense Reasoning with Mixture of Knowledge Graph Experts.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Dict-BERT: Enhancing Language Model Pre-training with Dictionary.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
Counterfactual Graph Learning for Link Prediction.
CoRR, 2021

Validating Label Consistency in NER Data Annotation.
CoRR, 2021

Few-Shot Graph Learning for Molecular Property Prediction.
Proceedings of the WWW '21: The Web Conference 2021, 2021

Technical Question Answering across Tasks and Domains.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Industry Papers, 2021

Enhancing Taxonomy Completion with Concept Generation via Fusing Relational Representations.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Validating Label Consistency in NER Data Annotation.
Proceedings of the 2nd Workshop on Evaluation and Comparison of NLP Systems, 2021

Sentence-Permuted Paragraph Generation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Injecting Entity Types into Entity-Guided Text Generation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Knowledge-Enriched Natural Language Generation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing: EMNLP 2021, 2021

Action Sequence Augmentation for Early Graph-based Anomaly Detection.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

2020
Early Anomaly Detection by Learning and Forecasting Behavior.
CoRR, 2020

Identifying Referential Intention with Heterogeneous Contexts.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Experimental Evidence Extraction System in Data Science with Hybrid Table Features and Ensemble Learning.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Tri-Train: Automatic Pre-Fine Tuning between Pre-Training and Fine-Tuning for SciNER.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

A Technical Question Answering System with Transfer Learning.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 2020

GraSeq: Graph and Sequence Fusion Learning for Molecular Property Prediction.
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

Crossing Variational Autoencoders for Answer Retrieval.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Tablepedia: Automating PDF Table Reading in an Experimental Evidence Exploration and Analytic System.
Proceedings of the World Wide Web Conference, 2019

Faceted Hierarchy: A New Graph Type to Organize Scientific Concepts and a Construction Method.
Proceedings of the Thirteenth Workshop on Graph-Based Methods for Natural Language Processing, 2019


  Loading...