Quan Chen

Orcid: 0000-0002-4865-2396

Affiliations:
  • Kuaishou Technology, Beijing, China
  • Alibaba Group, Beijing, China


According to our database1, Quan Chen authored at least 31 papers between 2018 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
FBOS-RL: Feedback-Driven Bi-Objective Synergistic Reinforcement Learning.
CoRR, May, 2026

Generative Recommendation for Large-Scale Advertising.
CoRR, February, 2026

Think-Then-Generate: Reasoning-Aware Text-to-Image Diffusion with LLM Encoders.
CoRR, January, 2026

ASR-Enhanced Multimodal Representation Learning for Cross-Domain Product Retrieval.
IEEE Trans. Multim., 2026

Training-free subject-enhanced attention guidance for compositional text-to-image generation.
Pattern Recognit., 2026

2025
EEA: Exploration-Exploitation Agent for Long Video Understanding.
CoRR, December, 2025

From Principles to Applications: A Comprehensive Survey of Discrete Tokenizers in Generation, Comprehension, Recommendation, and Information Retrieval.
CoRR, February, 2025

Orthus: Autoregressive Interleaved Image-Text Generation with Modality-Specific Heads.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

MOTION: Multi-object Video Editing with Training-Free Attention Guidance.
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2025

Music Grounding by Short Video.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

SweetTok: Semantic-Aware Spatial-Temporal Tokenizer for Compact Video Discretization.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Improving Preference Alignment of LLM with Inference-Free Self-Refinement.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

D&M: Enriching E-commerce Videos with Sound Effects by Key Moment Detection and SFX Matching.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

LEARN: Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial Application.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
SweetTokenizer: Semantic-Aware Spatial-Temporal Tokenizer for Compact Visual Discretization.
CoRR, 2024

Text-Video Multi-Grained Integration for Video Moment Montage.
CoRR, 2024

Orthus: Autoregressive Interleaved Image-Text Generation with Modality-Specific Heads.
CoRR, 2024

Enhancing Instruction-Following Capability of Visual-Language Models by Reducing Image Redundancy.
CoRR, 2024

Video to Music Moment Retrieval.
CoRR, 2024

Training-free Subject-Enhanced Attention Guidance for Compositional Text-to-image Generation.
CoRR, 2024

Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial Application.
CoRR, 2024

Knowledge Condensation and Reasoning for Knowledge-based VQA.
CoRR, 2024

Spatiotemporal Fine-grained Video Description for Short Videos.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Spatiotemporal Graph Guided Multi-modal Network for Livestreaming Product Retrieval.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Cross-view Semantic Alignment for Livestreaming Product Recognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Cross-Domain Product Representation Learning for Rich-Content E-Commerce.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2021
Boosting Image Outpainting with Semantic Layout Prediction.
CoRR, 2021

2020
Progressive Feature Polishing Network for Salient Object Detection.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Progressive Feature Polishing Network for Salient Object Detection.
CoRR, 2019

2018
Semantic Human Matting.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018


  Loading...