Jingwen Chen

Orcid: 0000-0002-7917-6003

Affiliations:
  • HiDream.ai Inc., Beijing, China
  • Sun Yat-sen University, Guangzhou, China (PhD 2021)


According to our database1, Jingwen Chen authored at least 17 papers between 2018 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
HiDream-I1: A High-Efficient Image Generative Foundation Model with Sparse Diffusion Transformer.
CoRR, May, 2025

Incorporating Visual Correspondence into Diffusion Model for Virtual Try-On.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Improving Virtual Try-On with Garment-Focused Diffusion Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

Unleashing Text-to-Image Diffusion Prior for Zero-Shot Image Captioning.
Proceedings of the Computer Vision - ECCV 2024, 2024

Improving Text-Guided Object Inpainting with Semantic Pre-inpainting.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
Retrieval Augmented Convolutional Encoder-decoder Networks for Video Captioning.
ACM Trans. Multim. Comput. Commun. Appl., February, 2023

Boosting Vision-and-Language Navigation with Direction Guiding and Backtracing.
ACM Trans. Multim. Comput. Commun. Appl., January, 2023

ControlStyle: Text-Driven Stylized Image Generation Using Diffusion Priors.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

3D Creation at Your Fingertips: From Text or Image to 3D Assets.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

2022
3D-Producer: A Hybrid and User-Friendly 3D Reconstruction System.
Proceedings of the Artificial Intelligence - Second CAAI International Conference, 2022

2021
X-modaler: A Versatile and High-performance Codebase for Cross-modal Analytics.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Scheduled Sampling in Vision-Language Pretraining with Decoupled Encoder-Decoder Network.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
VideoTRM: Pre-training for Video Captioning Challenge 2020.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

2019
See and chat: automatically generating viewer-level comments on images.
Multim. Tools Appl., 2019

Social Relation Recognition From Videos via Multi-Scale Spatial-Temporal Reasoning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Temporal Deformable Convolutional Encoder-Decoder Networks for Video Captioning.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Image Blind Denoising With Generative Adversarial Network Based Noise Modeling.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018


  Loading...