Haogeng Liu

According to our database1, Haogeng Liu authored at least 11 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Think 360°: Evaluating the Width-centric Reasoning Capability of MLLMs Beyond Depth.
CoRR, March, 2026

2025
Unlocking the Potential of Difficulty Prior in RL-based Multimodal Reasoning.
CoRR, May, 2025

InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

2024
InfiMM-HD: A Leap Forward in High-Resolution Multimodal Understanding.
CoRR, 2024

Visual Anchors Are Strong Information Aggregators For Multimodal Large Language Model.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

InfiMM: Advancing Multimodal Understanding with an Open-Sourced Visual Language Model.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

DeVAn: Dense Video Annotation for Video-Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Video-CSR: Complex Video Digest Creation for Visual-Language Models.
CoRR, 2023

Video-Teller: Enhancing Cross-Modal Generation with Fusion and Decoupling.
CoRR, 2023

Boosting Fast and High-Quality Speech Synthesis with Linear Diffusion.
CoRR, 2023

UnifySpeech: A Unified Framework for Zero-shot Text-to-Speech and Voice Conversion.
CoRR, 2023


  Loading...