We stand with Ukraine

We stand with Ukraine

Quan Chen

Orcid: 0000-0002-4865-2396

Affiliations:

Kuaishou Technology, Beijing, China
Alibaba Group, Beijing, China

According to our database¹, Quan Chen authored at least 23 papers between 2018 and 2026.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

Online presence:

on orcid.org
on scholar.google.com

On csauthors.net:

Bibliography

2026

Training-free subject-enhanced attention guidance for compositional text-to-image generation.

[BibT_eX]

[DOI]

,

,

,

,

,

Pattern Recognit., 2026

2025

From Principles to Applications: A Comprehensive Survey of Discrete Tokenizers in Generation, Comprehension, Recommendation, and Information Retrieval.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, February, 2025

Orthus: Autoregressive Interleaved Image-Text Generation with Modality-Specific Heads.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Forty-second International Conference on Machine Learning, 2025

MOTION: Multi-object Video Editing with Training-Free Attention Guidance.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the Advanced Intelligent Computing Technology and Applications, 2025

D&M: Enriching E-commerce Videos with Sound Effects by Key Moment Detection and SFX Matching.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

LEARN: Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial Application.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024

Text-Video Multi-Grained Integration for Video Moment Montage.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2024

Orthus: Autoregressive Interleaved Image-Text Generation with Modality-Specific Heads.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2024

Enhancing Instruction-Following Capability of Visual-Language Models by Reducing Image Redundancy.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

Video to Music Moment Retrieval.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2024

ASR-enhanced Multimodal Representation Learning for Cross-Domain Product Retrieval.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2024

Training-free Subject-Enhanced Attention Guidance for Compositional Text-to-image Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2024

Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial Application.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

Knowledge Condensation and Reasoning for Knowledge-based VQA.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

Spatiotemporal Fine-grained Video Description for Short Videos.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Spatiotemporal Graph Guided Multi-modal Network for Livestreaming Product Retrieval.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Cross-view Semantic Alignment for Livestreaming Product Recognition.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Cross-Domain Product Representation Learning for Rich-Content E-Commerce.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2021

Boosting Image Outpainting with Semantic Layout Prediction.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2021

2020

Progressive Feature Polishing Network for Salient Object Detection.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Progressive Feature Polishing Network for Salient Object Detection.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2019

2018

Semantic Human Matting.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Loading...