Jingwen Chen

Orcid: 0000-0002-7917-6003

Affiliations:

HiDream.ai Inc., Beijing, China
Sun Yat-sen University, Guangzhou, China (PhD 2021)

According to our database¹, Jingwen Chen authored at least 17 papers between 2018 and 2025.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2025

HiDream-I1: A High-Efficient Image Generative Foundation Model with Sparse Diffusion Transformer.

[BibT_eX]

[DOI]

CoRR, May, 2025

Incorporating Visual Correspondence into Diffusion Model for Virtual Try-On.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

Improving Virtual Try-On with Garment-Focused Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Unleashing Text-to-Image Diffusion Prior for Zero-Shot Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Improving Text-Guided Object Inpainting with Semantic Pre-inpainting.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

2023

Retrieval Augmented Convolutional Encoder-decoder Networks for Video Captioning.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., February, 2023

Boosting Vision-and-Language Navigation with Direction Guiding and Backtracing.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., January, 2023

ControlStyle: Text-Driven Stylized Image Generation Using Diffusion Priors.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

3D Creation at Your Fingertips: From Text or Image to 3D Assets.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

2022

3D-Producer: A Hybrid and User-Friendly 3D Reconstruction System.

[BibT_eX]

[DOI]

Proceedings of the Artificial Intelligence - Second CAAI International Conference, 2022

2021

X-modaler: A Versatile and High-performance Codebase for Cross-modal Analytics.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Scheduled Sampling in Vision-Language Pretraining with Decoupled Encoder-Decoder Network.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

VideoTRM: Pre-training for Video Captioning Challenge 2020.

[BibT_eX]

[DOI]

Jingwen Chen

Hongyang Chao

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

2019

See and chat: automatically generating viewer-level comments on images.

[BibT_eX]

[DOI]

Jingwen Chen

Ting Yao

Hongyang Chao

Multim. Tools Appl., 2019

Social Relation Recognition From Videos via Multi-Scale Spatial-Temporal Reasoning.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Temporal Deformable Convolutional Encoder-Decoder Networks for Video Captioning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Image Blind Denoising With Generative Adversarial Network Based Noise Modeling.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Jingwen Chen

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...