Quan Chen

Orcid: 0000-0002-4865-2396

Affiliations:
  • Kuaishou Technology, Beijing, China
  • Alibaba Group, Beijing, China


According to our database1, Quan Chen authored at least 22 papers between 2018 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Training-free subject-enhanced attention guidance for compositional text-to-image generation.
Pattern Recognit., 2026

2025
From Principles to Applications: A Comprehensive Survey of Discrete Tokenizers in Generation, Comprehension, Recommendation, and Information Retrieval.
CoRR, February, 2025

MOTION: Multi-object Video Editing with Training-Free Attention Guidance.
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2025

D&M: Enriching E-commerce Videos with Sound Effects by Key Moment Detection and SFX Matching.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

LEARN: Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial Application.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Text-Video Multi-Grained Integration for Video Moment Montage.
CoRR, 2024

Orthus: Autoregressive Interleaved Image-Text Generation with Modality-Specific Heads.
CoRR, 2024

Enhancing Instruction-Following Capability of Visual-Language Models by Reducing Image Redundancy.
CoRR, 2024

Video to Music Moment Retrieval.
CoRR, 2024

ASR-enhanced Multimodal Representation Learning for Cross-Domain Product Retrieval.
CoRR, 2024

Training-free Subject-Enhanced Attention Guidance for Compositional Text-to-image Generation.
CoRR, 2024

Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial Application.
CoRR, 2024

Knowledge Condensation and Reasoning for Knowledge-based VQA.
CoRR, 2024

Spatiotemporal Fine-grained Video Description for Short Videos.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Spatiotemporal Graph Guided Multi-modal Network for Livestreaming Product Retrieval.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Cross-view Semantic Alignment for Livestreaming Product Recognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Cross-Domain Product Representation Learning for Rich-Content E-Commerce.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2021
Boosting Image Outpainting with Semantic Layout Prediction.
CoRR, 2021

2020
Progressive Feature Polishing Network for Salient Object Detection.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Progressive Feature Polishing Network for Salient Object Detection.
CoRR, 2019

2018
Semantic Human Matting.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018


  Loading...