Wenya Xie

Orcid: 0009-0001-5085-7876

According to our database1, Wenya Xie authored at least 20 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
OralMLLM-Bench: Evaluating Cognitive Capabilities of Multimodal Large Language Models in Dental Practice.
CoRR, May, 2026

HeartAgent: An Autonomous Agent System for Explainable Differential Diagnosis in Cardiology.
CoRR, March, 2026

Reasoning While Asking: Transforming Reasoning Large Language Models from Passive Solvers to Proactive Inquirers.
CoRR, January, 2026

How Memory Management Impacts LLM Agents: An Empirical Study of Experience-Following Behavior.
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

Reasoning While Asking: Transforming Reasoning Large Language Models from Passive Solvers to Proactive Inquirers.
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

2025
Enabling Doctor-Centric Medical AI with LLMs through Workflow-Aligned Tasks and Benchmarks.
CoRR, October, 2025

Give Me FP32 or Give Me Death? Challenges and Solutions for Reproducible Reasoning.
CoRR, June, 2025

How Memory Management Impacts LLM Agents: An Empirical Study of Experience-Following Behavior.
CoRR, May, 2025

Automating expert-level medical reasoning evaluation of large language models.
npj Digit. Medicine, 2025

Understanding and Mitigating Numerical Sources of Nondeterminism in LLM Inference.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

MLLM-Bench: Evaluating Multimodal LLMs with Per-sample Criteria.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Word Salad Chopper: Reasoning Models Waste A Ton Of Decoding Budget On Useless Repetitions, Self-Knowingly.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Knowledge Boundary of Large Language Models: A Survey.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Unlocking LLMs' Self-Improvement Capacity with Autonomous Learning for Domain Adaptation.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
Omnidirectional-Sensor-System-Based Texture Noise Correction in Large-Scale 3D Reconstruction.
Sensors, 2024

LLMs for Doctors: Leveraging Medical LLMs to Assist Doctors, Not Replace Them.
CoRR, 2024

LLMs Could Autonomously Learn Without External Supervision.
CoRR, 2024

2023
Coarse-to-Fine Hybrid 3D Mapping System With Co-Calibrated Omnidirectional Camera and Non-Repetitive LiDAR.
IEEE Robotics Autom. Lett., March, 2023

MLLM-Bench, Evaluating Multi-modal LLMs using GPT-4V.
CoRR, 2023

HuatuoGPT-II, One-stage Training for Medical Adaption of LLMs.
CoRR, 2023


  Loading...