Haogeng Liu

According to our database¹, Haogeng Liu authored at least 11 papers between 2023 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Think 360°: Evaluating the Width-centric Reasoning Capability of MLLMs Beyond Depth.

[BibT_eX]

[DOI]

CoRR, March, 2026

2025

Unlocking the Potential of Difficulty Prior in RL-based Multimodal Reasoning.

[BibT_eX]

[DOI]

CoRR, May, 2025

InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

2024

InfiMM-HD: A Leap Forward in High-Resolution Multimodal Understanding.

[BibT_eX]

[DOI]

CoRR, 2024

Visual Anchors Are Strong Information Aggregators For Multimodal Large Language Model.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

InfiMM: Advancing Multimodal Understanding with an Open-Sourced Visual Language Model.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

DeVAn: Dense Video Annotation for Video-Language Models.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

Video-CSR: Complex Video Digest Creation for Visual-Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Video-Teller: Enhancing Cross-Modal Generation with Fusion and Decoupling.

[BibT_eX]

[DOI]

CoRR, 2023

Boosting Fast and High-Quality Speech Synthesis with Linear Diffusion.

[BibT_eX]

[DOI]

CoRR, 2023

UnifySpeech: A Unified Framework for Zero-shot Text-to-Speech and Voice Conversion.

[BibT_eX]

[DOI]

CoRR, 2023

Haogeng Liu

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...