Weihua Du

Orcid: 0000-0002-8856-0277

According to our database1, Weihua Du authored at least 22 papers between 2018 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Reinforcing Human Behavior Simulation via Verbal Feedback.
CoRR, May, 2026

Why Search When You Can Transfer? Amortized Agentic Workflow Design from Structural Priors.
CoRR, April, 2026

The Consensus Trap: Rescuing Multi-Agent LLMs from Adversarial Majorities via Token-Level Collaboration.
CoRR, April, 2026

AdaExplore: Failure-Driven Adaptation and Diversity-Preserving Search for Efficient Kernel Generation.
CoRR, April, 2026

Mind the Sim2Real Gap in User Simulation for Agentic Tasks.
CoRR, March, 2026

GradAlign: Gradient-Aligned Data Selection for LLM Reinforcement Learning.
CoRR, February, 2026

2025
Training Proactive and Personalized LLM Agents.
CoRR, November, 2025

Scaling LLM Multi-turn RL with End-to-end Summarization-based Context Management.
CoRR, October, 2025

Generalizable End-to-End Tool-Use RL with Synthetic CodeGym.
CoRR, September, 2025

Optimizing Temperature for Language Models with Multi-Sample Inference.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Agentic-R1: Distilled Dual-Strategy Reasoning.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

2024
Constrained Human-AI Cooperation: An Inclusive Embodied Social Intelligence Challenge.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

HAZARD Challenge: Embodied Decision Making in Dynamically Changing Environments.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Building Cooperative Embodied Agents Modularly with Large Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by Step.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Improving Span-Based Aspect Sentiment Triplet Extraction with Abundant Syntax Knowledge.
Neural Process. Lett., October, 2023

T-Eval: Evaluating the Tool Utilization Capability Step by Step.
CoRR, 2023

Iteratively Learn Diverse Strategies with State Distance Information.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Automatic Truss Design with Reinforcement Learning.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

2020
Mobile Phone Usage Habits of Chinese Graduate Students and the Training of MTI Interpreters.
Proceedings of the Human Interaction, Emerging Technologies and Future Applications III, 2020

Distance Education and MOOCs for Language Learning in China.
Proceedings of the Learning Technologies and Systems, 2020

2018
Website and Literature Teaching: Teaching Experiment of Literary Texts at the Beginning of German Studies in China.
Proceedings of the Emerging Technologies for Education - Third International Symposium, 2018


  Loading...