Minjie Hong

Orcid: 0009-0000-0368-2527

According to our database1, Minjie Hong authored at least 15 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
DocRetriever: A Plug-and-Play Framework for Multimodal Document Retrieval with Comprehensive Benchmark.
CoRR, May, 2026

Diffusion Model as a Generalist Segmentation Learner.
CoRR, April, 2026

DUET: Joint Exploration of User Item Profiles in Recommendation System.
CoRR, April, 2026

DiagramGPT-Llama3: Enabling Editable, High-Fidelity Diagram Generation with Vision Large Language Models.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Thinking with Programming Vision: Towards a Unified View for Thinking with Images.
CoRR, December, 2025

Generative Reasoning Recommendation via LLMs.
CoRR, October, 2025

DSI-Bench: A Benchmark for Dynamic Spatial Intelligence.
CoRR, October, 2025

APO: Enhancing Reasoning Ability of MLLMs via Asymmetric Policy Optimization.
CoRR, June, 2025

Observe-R1: Unlocking Reasoning Abilities of MLLMs with Dynamic Progressive Reinforcement Learning.
CoRR, May, 2025

EAGER-LLM: Enhancing Large Language Models as Recommenders through Exogenous Behavior-Semantic Integration.
Proceedings of the ACM on Web Conference 2025, 2025

Vela: Scalable Embeddings with Voice Large Language Models for Multimodal Retrieval.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

OmniBind: Large-scale Omni Multimodal Representation via Binding Spaces.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
EAGER: Two-Stream Generative Recommender with Behavior-Semantic Collaboration.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

AudioVSR: Enhancing Video Speech Recognition with Audio Data.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023
The Shifting Role of Government in Government Data Openness.
Proceedings of the Big Data - BigData 2023, 2023


  Loading...