Haoran Wang

Orcid: 0000-0002-6098-4772

Affiliations:
  • Baidu Research, Beijing, China
  • Tianjin University, School of Electrical and Information Engineering, China (former)


According to our database1, Haoran Wang authored at least 23 papers between 2018 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
FAST: Flexibly Controllable Arbitrary Style Transfer via Latent Diffusion Models.
ACM Trans. Multim. Comput. Commun. Appl., September, 2025

SGD: Street View Synthesis with Gaussian Splatting and Diffusion Prior.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

2024
Multi-task hierarchical convolutional network for visual-semantic cross-modal retrieval.
Pattern Recognit., 2024

Hierarchical matching and reasoning for multi-query image retrieval.
Neural Networks, 2024

VIP: Versatile Image Outpainting Empowered by Multimodal Large Language Model.
CoRR, 2024

SGD: Street View Synthesis with Gaussian Splatting and Diffusion Prior.
CoRR, 2024

HiCAST: Highly Customized Arbitrary Style Transfer with Adapter Enhanced Diffusion Models.
CoRR, 2024

Eliminate Before Align: A Remote Sensing Image-Text Retrieval Framework with Keyword Explicit Reasoning.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Neural Field Classifiers via Target Encoding and Classification Loss.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

VIP: Versatile Image Outpainting Empowered by Multimodal Large Language Model.
Proceedings of the Computer Vision - ACCV 2024, 2024

2023
S3IM: Stochastic Structural SIMilarity and Its Unreasonable Effectiveness for Neural Fields.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
SMAN: Stacked Multimodal Attention Network for Cross-Modal Image-Text Retrieval.
IEEE Trans. Cybern., 2022

Boosting Video-Text Retrieval with Explicit High-Level Semantics.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Temporal Saliency Query Network for Efficient Video Recognition.
Proceedings of the Computer Vision - ECCV 2022, 2022

CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
Step-Wise Hierarchical Alignment Network for Image-Text Matching.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

2020
Stacked squeeze-and-excitation recurrent residual network for visual-semantic matching.
Pattern Recognit., 2020

Multi-Modal Memory Enhancement Attention Network for Image-Text Matching.
IEEE Access, 2020

Consensus-Aware Visual-Semantic Embedding for Image-Text Matching.
Proceedings of the Computer Vision - ECCV 2020, 2020

2019
Image-attribute reciprocally guided attention network for pedestrian attribute recognition.
Pattern Recognit. Lett., 2019

Small and Dense Commodity Object Detection with Multi-Scale Receptive Field Attention.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Saliency-Guided Attention Network for Image-Sentence Matching.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

2018
Cross-Modal Retrieval with Discriminative Dual-Path CNN.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018


  Loading...