We stand with Ukraine

We stand with Ukraine

Haokun Wen

Orcid: 0000-0003-0633-3722

According to our database¹, Haokun Wen authored at least 24 papers between 2020 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

Online presence:

on orcid.org

On csauthors.net:

Bibliography

2025

Dual Knowledge-Enhanced Two-Stage Reasoner for Multimodal Dialog Systems.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, September, 2025

Spatial Understanding from Videos: Structured Prompts Meet Simulation Data.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, June, 2025

FineCIR: Explicit Parsing of Fine-Grained Modification Semantics for Composed Image Retrieval.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, March, 2025

A Comprehensive Survey on Composed Image Retrieval.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, February, 2025

Interactive Garment Recommendation with User in the Loop.

[BibT_eX]

[DOI]

Federico Becattini

,

,

,

,

,

,

Alberto Del Bimbo

ACM Trans. Multim. Comput. Commun. Appl., January, 2025

FiRE: Enhancing MLLMs with Fine-Grained Context Learning for Complex Image Retrieval.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025

Pseudo Triplet Guided Few-shot Composed Image Retrieval.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the International Joint Conference on Neural Networks, 2025

ENCODER: Entity Mining and Modification Relation Binding for Composed Image Retrieval.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024

Self-Training Boosted Multi-Factor Matching Network for Composed Image Retrieval.

[BibT_eX]

[DOI]

,

,

,

,

,

IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

Pseudo-triplet Guided Few-shot Composed Image Retrieval.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2024

Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Fine-grained Textual Inversion Network for Zero-Shot Composed Image Retrieval.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Differential-Perceptive and Retrieval-Augmented MLLM for Change Captioning.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

2023

Egocentric Early Action Prediction via Multimodal Transformer-Based Dual Action Prediction.

[BibT_eX]

[DOI]

,

,

,

,

,

,

IEEE Trans. Circuits Syst. Video Technol., September, 2023

Self-Training Boosted Multi-Faceted Matching Network for Composed Image Retrieval.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2023

Finetuning Language Models for Multimodal Question Answering.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Target-Guided Composed Image Retrieval.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 31st ACM International Conference on Multimedia, 2023

CLIP-Based Composed Image Retrieval with Comprehensive Fusion and Data Augmentation.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the AI 2023: Advances in Artificial Intelligence, 2023

2022

Partially Supervised Compatibility Modeling.

[BibT_eX]

[DOI]

,

,

,

,

Chung-Hsing Yeh

,

,

IEEE Trans. Image Process., 2022

Personalized Fashion Compatibility Modeling via Metapath-guided Heterogeneous Graph Learning.

[BibT_eX]

[DOI]

,

,

,

,

Chung-Hsing Yeh

,

Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

2021

Attribute-wise Explainable Fashion Compatibility Modeling.

[BibT_eX]

[DOI]

,

,

,

,

,

ACM Trans. Multim. Comput. Commun. Appl., 2021

Comprehensive Linguistic-Visual Composition Network for Image Retrieval.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Multimodal Compatibility Modeling via Exploring the Consistent and Complementary Correlations.

[BibT_eX]

[DOI]

,

,

,

Chung-Hsing Yeh

,

,

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

2020

Generative Attribute Manipulation Scheme for Flexible Fashion Search.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Loading...