Haokun Wen

Orcid: 0000-0003-0633-3722

According to our database1, Haokun Wen authored at least 24 papers between 2020 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Dual Knowledge-Enhanced Two-Stage Reasoner for Multimodal Dialog Systems.
CoRR, September, 2025

Spatial Understanding from Videos: Structured Prompts Meet Simulation Data.
CoRR, June, 2025

FineCIR: Explicit Parsing of Fine-Grained Modification Semantics for Composed Image Retrieval.
CoRR, March, 2025

A Comprehensive Survey on Composed Image Retrieval.
CoRR, February, 2025

Interactive Garment Recommendation with User in the Loop.
ACM Trans. Multim. Comput. Commun. Appl., January, 2025

FiRE: Enhancing MLLMs with Fine-Grained Context Learning for Complex Image Retrieval.
Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025

Pseudo Triplet Guided Few-shot Composed Image Retrieval.
Proceedings of the International Joint Conference on Neural Networks, 2025

ENCODER: Entity Mining and Modification Relation Binding for Composed Image Retrieval.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Self-Training Boosted Multi-Factor Matching Network for Composed Image Retrieval.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

Pseudo-triplet Guided Few-shot Composed Image Retrieval.
CoRR, 2024

Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Fine-grained Textual Inversion Network for Zero-Shot Composed Image Retrieval.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Differential-Perceptive and Retrieval-Augmented MLLM for Change Captioning.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

2023
Egocentric Early Action Prediction via Multimodal Transformer-Based Dual Action Prediction.
IEEE Trans. Circuits Syst. Video Technol., September, 2023

Self-Training Boosted Multi-Faceted Matching Network for Composed Image Retrieval.
CoRR, 2023

Finetuning Language Models for Multimodal Question Answering.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Target-Guided Composed Image Retrieval.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

CLIP-Based Composed Image Retrieval with Comprehensive Fusion and Data Augmentation.
Proceedings of the AI 2023: Advances in Artificial Intelligence, 2023

2022
Partially Supervised Compatibility Modeling.
IEEE Trans. Image Process., 2022

Personalized Fashion Compatibility Modeling via Metapath-guided Heterogeneous Graph Learning.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

2021
Attribute-wise Explainable Fashion Compatibility Modeling.
ACM Trans. Multim. Comput. Commun. Appl., 2021

Comprehensive Linguistic-Visual Composition Network for Image Retrieval.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Multimodal Compatibility Modeling via Exploring the Consistent and Complementary Correlations.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

2020
Generative Attribute Manipulation Scheme for Flexible Fashion Search.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020


  Loading...