Mingjie Zhan

Orcid: 0009-0008-9420-7702

According to our database¹, Mingjie Zhan authored at least 40 papers between 2020 and 2026.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Edit-Based Refinement for Parallel Masked Diffusion Language Models.

[BibT_eX]

[DOI]

CoRR, May, 2026

Robustness Risk of Conversational Retrieval: Identifying and Mitigating Noise Sensitivity in Qwen3-Embedding Model.

[BibT_eX]

[DOI]

CoRR, April, 2026

FullStack-Agent: Enhancing Agentic Full-Stack Web Coding via Development-Oriented Testing and Repository Back-Translation.

[BibT_eX]

[DOI]

CoRR, February, 2026

Integrating Large Language Models Into Recommendation via Mutual Augmentation and Adaptive Aggregation.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., January, 2026

SlidesGen-Bench: Evaluating Slides Generation via Computational and Quantitative Metrics.

[BibT_eX]

[DOI]

CoRR, January, 2026

Towards Robust Real-World Spreadsheet Understanding with Multi-Agent Multi-Format Reasoning.

[BibT_eX]

[DOI]

Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

From Solver to Tutor: Evaluating the Pedagogical Intelligence of LLMs with KMP-Bench.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

RAA: Achieving Interactive Remove/Add Anything via Fully Synthetic Data.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

RecRanker: Instruction Tuning Large Language Model as Ranker for Top-k Recommendation.

[BibT_eX]

[DOI]

ACM Trans. Inf. Syst., September, 2025

VoiceAssistant-Eval: Benchmarking AI Assistants across Listening, Speaking, and Viewing.

[BibT_eX]

[DOI]

CoRR, September, 2025

WebGen-Agent: Enhancing Interactive Website Generation with Multi-Level Feedback and Step-Level Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, September, 2025

MOOM: Maintenance, Organization and Optimization of Memory in Ultra-Long Role-Playing Dialogues.

[BibT_eX]

[DOI]

CoRR, September, 2025

Evaluating MLLMs with Multimodal Multi-image Reasoning Benchmark.

[BibT_eX]

[DOI]

CoRR, June, 2025

WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch.

[BibT_eX]

[DOI]

CoRR, May, 2025

Navi-plus: Managing Ambiguous GUI Navigation Tasks with Follow-up.

[BibT_eX]

[DOI]

CoRR, March, 2025

Step-Controlled DPO: Leveraging Stepwise Errors for Enhancing Mathematical Reasoning of Language Models.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2025

MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Alignment with Fill-In-the-Middle for Enhancing Code Generation.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

SpiritSight Agent: Advanced GUI Agent with One Look.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Probability-Consistent Preference Optimization for Enhanced LLM Reasoning.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

ReflectionCoder: Learning from Reflection Sequence for Enhanced One-off Code Generation.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

MathCoder-VL: Bridging Vision and Code for Enhanced Multimodal Mathematical Reasoning.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

UFO: Enhancing Diffusion-Based Video Generation with a Uniform Frame Organizer.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

Step-Controlled DPO: Leveraging Stepwise Error for Enhanced Mathematical Reasoning.

[BibT_eX]

[DOI]

CoRR, 2024

Measuring Multimodal Mathematical Reasoning with MATH-Vision Dataset.

[BibT_eX]

[DOI]

CoRR, 2024

Integrating Large Language Models into Recommendation via Mutual Augmentation and Adaptive Aggregation.

[BibT_eX]

[DOI]

CoRR, 2024

Measuring Multimodal Mathematical Reasoning with MATH-Vision Dataset.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-Verification.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical Reasoning.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Empowering Character-level Text Infilling by Eliminating Sub-Tokens.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

MathGenie: Generating Synthetic Data with Question Back-translation for Enhancing Mathematical Reasoning of LLMs.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

RecRanker: Instruction Tuning Large Language Model as Ranker for Top-k Recommendation.

[BibT_eX]

[DOI]

CoRR, 2023

TeacherLM: Teaching to Fish Rather Than Giving the Fish, Language Modeling Likewise.

[BibT_eX]

[DOI]

CoRR, 2023

Self-Supervised Sentence Compression for Meeting Summarization.

[BibT_eX]

[DOI]

CoRR, 2023

Towards Versatile and Efficient Visual Knowledge Injection into Pre-trained Language Models with Cross-Modal Adapters.

[BibT_eX]

[DOI]

CoRR, 2023

Learning Locality and Isotropy in Dialogue Modeling.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Reconstruct Before Summarize: An Efficient Two-Step Framework for Condensing and Summarizing Meeting Transcripts.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

VCSUM: A Versatile Chinese Meeting Summarization Dataset.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2021

GroupLink: An End-to-end Multitask Method for Word Grouping and Relation Extraction in Form Understanding.

[BibT_eX]

[DOI]

CoRR, 2021

2020

DocStruct: A Multimodal Method to Extract Hierarchy Structure in Document for General Form Understanding.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Mingjie Zhan

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...