Haolong Yan

Orcid: 0009-0007-9745-3135

According to our database1, Haolong Yan authored at least 9 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Step-3 is Large yet Affordable: Model-system Co-design for Cost-effective Decoding.
CoRR, July, 2025

M-DocSum: Do LVLMs Genuinely Comprehend Interleaved Image-Text in Document Summarization?
CoRR, March, 2025

Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model.
CoRR, February, 2025

Bi-directional dual contrastive adapting method for alleviating hallucination in visual question answering.
Expert Syst. Appl., 2025

2024
TOP:A New Target-Audience Oriented Content Paraphrase Task.
CoRR, 2024

ELEMO: Elements Focused Emotion Recognition for Sticker Images.
Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024

Visual Enhanced Entity-Level Interaction Network for Multimodal Summarization.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Leveraging Generative Large Language Models with Visual Instruction and Demonstration Retrieval for Multimodal Sarcasm Detection.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

2022
Entity-level Interaction via Heterogeneous Graph for Multimodal Named Entity Recognition.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022


  Loading...