Muxi Diao

Orcid: 0009-0000-4423-4157

According to our database1, Muxi Diao authored at least 21 papers between 2024 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
DetailVerifyBench: A Benchmark for Dense Hallucination Localization in Long Image Captions.
CoRR, April, 2026

Hepato-LLaVA: An Expert MLLM with Sparse Topo-Pack Attention for Hepatocellular Pathology Analysis on Whole Slide Images.
CoRR, February, 2026

Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting.
CoRR, January, 2026

Toward Generalizable Forgery Detection and Reasoning.
IEEE Trans. Image Process., 2026

MedReasoner: Reinforcement Learning Drives Reasoning Grounding from Clinical Thought to Pixel-Level Precision.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
RO-Bench: Large-scale robustness evaluation of MLLMs with text-driven counterfactual videos.
CoRR, October, 2025

AutoRed: A Free-form Adversarial Prompt Generation Framework for Automated Red Teaming.
CoRR, October, 2025

OJBench: A Competition Level Code Benchmark For Large Language Models.
CoRR, June, 2025

DriveRX: A Vision-Language Reasoning Model for Cross-Task Autonomous Driving.
CoRR, May, 2025

Harnessing Caption Detailness for Data-Efficient Text-to-Image Generation.
CoRR, May, 2025

CineTechBench: A Benchmark for Cinematographic Technique Understanding and Generation.
CoRR, May, 2025

CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

We-Math: Does Your Large Multimodal Model Achieve Human-like Mathematical Reasoning?
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

SEAS: Self-Evolving Adversarial Safety Optimization for Large Language Models.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
From Simple to Professional: A Combinatorial Controllable Image Captioning Agent.
CoRR, 2024

How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality Data.
CoRR, 2024

We-Math: Does Your Large Multimodal Model Achieve Human-like Mathematical Reasoning?
CoRR, 2024

Multi-Perspective Consistency Enhances Confidence Estimation in Large Language Models.
CoRR, 2024

DolphCoder: Echo-Locating Code Large Language Models with Diverse and Multi-Objective Instruction Tuning.
CoRR, 2024

How Do Your Code LLMs perform? Empowering Code Instruction Tuning with Really Good Data.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

DolphCoder: Echo-Locating Code Large Language Models with Diverse and Multi-Objective Instruction Tuning.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024


  Loading...