Mingjie Zhan

Orcid: 0009-0008-9420-7702

According to our database¹, Mingjie Zhan authored at least 32 papers between 2020 and 2025.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

RecRanker: Instruction Tuning Large Language Model as Ranker for Top-k Recommendation.

[BibT_eX]

[DOI]

ACM Trans. Inf. Syst., September, 2025

VoiceAssistant-Eval: Benchmarking AI Assistants across Listening, Speaking, and Viewing.

[BibT_eX]

[DOI]

CoRR, September, 2025

WebGen-Agent: Enhancing Interactive Website Generation with Multi-Level Feedback and Step-Level Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, September, 2025

MOOM: Maintenance, Organization and Optimization of Memory in Ultra-Long Role-Playing Dialogues.

[BibT_eX]

[DOI]

CoRR, September, 2025

Alignment with Fill-In-the-Middle for Enhancing Code Generation.

[BibT_eX]

[DOI]

CoRR, August, 2025

Evaluating MLLMs with Multimodal Multi-image Reasoning Benchmark.

[BibT_eX]

[DOI]

CoRR, June, 2025

WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch.

[BibT_eX]

[DOI]

CoRR, May, 2025

Navi-plus: Managing Ambiguous GUI Navigation Tasks with Follow-up.

[BibT_eX]

[DOI]

CoRR, March, 2025

Step-Controlled DPO: Leveraging Stepwise Errors for Enhancing Mathematical Reasoning of Language Models.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2025

MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

SpiritSight Agent: Advanced GUI Agent with One Look.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Probability-Consistent Preference Optimization for Enhanced LLM Reasoning.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

ReflectionCoder: Learning from Reflection Sequence for Enhanced One-off Code Generation.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

MathCoder-VL: Bridging Vision and Code for Enhanced Multimodal Mathematical Reasoning.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

UFO: Enhancing Diffusion-Based Video Generation with a Uniform Frame Organizer.

[BibT_eX]

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024

Step-Controlled DPO: Leveraging Stepwise Error for Enhanced Mathematical Reasoning.

[BibT_eX]

[DOI]

CoRR, 2024

Measuring Multimodal Mathematical Reasoning with MATH-Vision Dataset.

[BibT_eX]

[DOI]

CoRR, 2024

Integrating Large Language Models into Recommendation via Mutual Augmentation and Adaptive Aggregation.

[BibT_eX]

[DOI]

CoRR, 2024

Measuring Multimodal Mathematical Reasoning with MATH-Vision Dataset.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-Verification.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical Reasoning.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Empowering Character-level Text Infilling by Eliminating Sub-Tokens.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

MathGenie: Generating Synthetic Data with Question Back-translation for Enhancing Mathematical Reasoning of LLMs.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

RecRanker: Instruction Tuning Large Language Model as Ranker for Top-k Recommendation.

[BibT_eX]

[DOI]

CoRR, 2023

TeacherLM: Teaching to Fish Rather Than Giving the Fish, Language Modeling Likewise.

[BibT_eX]

[DOI]

CoRR, 2023

Self-Supervised Sentence Compression for Meeting Summarization.

[BibT_eX]

[DOI]

CoRR, 2023

Towards Versatile and Efficient Visual Knowledge Injection into Pre-trained Language Models with Cross-Modal Adapters.

[BibT_eX]

[DOI]

CoRR, 2023

Learning Locality and Isotropy in Dialogue Modeling.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Reconstruct Before Summarize: An Efficient Two-Step Framework for Condensing and Summarizing Meeting Transcripts.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

VCSUM: A Versatile Chinese Meeting Summarization Dataset.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2021

GroupLink: An End-to-end Multitask Method for Word Grouping and Relation Extraction in Form Understanding.

[BibT_eX]

[DOI]

CoRR, 2021

2020

DocStruct: A Multimodal Method to Extract Hierarchy Structure in Document for General Form Understanding.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Mingjie Zhan

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...