Ming Zhang
Affiliations:- Fudan University, Shanghai, China
According to our database1,
Ming Zhang authored at least 37 papers
between 2023 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
JFTA-Bench: Evaluate LLM's Ability of Tracking and Analyzing Malfunctions Using Fault Trees.
CoRR, March, 2026
CoRR, February, 2026
DFPO: Scaling Value Modeling via Distributional Flow towards Robust and Generalizable LLM Post-Training.
CoRR, February, 2026
Can Deep Research Agents Retrieve and Organize? Evaluating the Synthesis Gap with Expert Taxonomies.
CoRR, January, 2026
Muse: Towards Reproducible Long-Form Song Generation with Fine-Grained Style Control.
CoRR, January, 2026
OpenNovelty: An LLM-powered Agentic System for Verifiable Scholarly Novelty Assessment.
CoRR, January, 2026
Sci. China Inf. Sci., 2026
MetaAct-RL: Training Language Models for Reasoning Through Meta-Action-Based Reinforcement Learning.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
What Makes a Good Speech Tokenizer for LLM-Centric Speech Generation? A Systematic Study.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
2025
CoRR, December, 2025
CoRR, November, 2025
From Scores to Preferences: Redefining MOS Benchmarking for Speech Quality Reward Modeling.
CoRR, October, 2025
LLMEval-3: A Large-Scale Longitudinal Study on Robust and Fair Evaluation of Large Language Models.
CoRR, August, 2025
CoRR, August, 2025
SpeechRole: A Large-Scale Dataset and Benchmark for Evaluating Speech Role-Playing Agents.
CoRR, August, 2025
CoRR, June, 2025
EvaLearn: Quantifying the Learning Capability and Efficiency of LLMs via Sequential Problem Solving.
CoRR, June, 2025
CoRR, May, 2025
Predicting Large Language Model Capabilities on Closed-Book QA Tasks Using Only Information Available Prior to Training.
CoRR, February, 2025
Sci. China Inf. Sci., 2025
LLMEval-Med: A Real-world Clinical Benchmark for Medical LLMs with Physician Validation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025
Governance in Motion: Co-evolution of Constitutions and AI models for Scalable Safety.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025
PFDial: A Structured Dialogue Instruction Fine-tuning Method Based on UML Flowcharts.
Proceedings of the Findings of the Association for Computational Linguistics, 2025
2024
CoRR, 2024
Exploring the Compositional Deficiency of Large Language Models in Mathematical Reasoning.
CoRR, 2024
From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities.
CoRR, 2024
Exploring the Compositional Deficiency of Large Language Models in Mathematical Reasoning Through Trap Problems.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023