Yan Li

Orcid: 0000-0003-1882-3331

Affiliations:
  • Chinese Academy of Sciences, Institute of Automation, National Laboratory of Pattern Recognition, Beijing, China


According to our database1, Yan Li authored at least 22 papers between 2018 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
EvoGM: Learning to Merge LLMs via Evolutionary Generative Optimization.
CoRR, May, 2026

ASR-Enhanced Multimodal Representation Learning for Cross-Domain Product Retrieval.
IEEE Trans. Multim., 2026

2025
Kwai Keye-VL 1.5 Technical Report.
CoRR, September, 2025

Kwai Keye-VL Technical Report.
CoRR, July, 2025

<i>COEF-VQ: </i> Cost-Efficient Video Quality Understanding through a Cascaded Multimodal LLM Framework.
Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.2, 2025

MUSE: Multi-Subject Unified Synthesis Via Explicit Layout Semantic Expansion.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

LEARN: Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial Application.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Enhancing Instruction-Following Capability of Visual-Language Models by Reducing Image Redundancy.
CoRR, 2024

Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial Application.
CoRR, 2024

Knowledge Condensation and Reasoning for Knowledge-based VQA.
CoRR, 2024

Spatiotemporal Fine-grained Video Description for Short Videos.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Spatiotemporal Graph Guided Multi-modal Network for Livestreaming Product Retrieval.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Learning Multi-Dimensional Human Preference for Text-to-Image Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
KwaiYiiMath: Technical Report.
CoRR, 2023

Cross-view Semantic Alignment for Livestreaming Product Recognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Cross-Domain Product Representation Learning for Rich-Content E-Commerce.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2020
TEA: Temporal Excitation and Aggregation for Action Recognition.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Mixed Supervised Object Detection with Robust Objectness Transfer.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Transductive Zero-Shot Learning with Visual Structure Constraint.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Transductive Zero-Shot Learning via Visual Center Adaptation.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Discriminative Learning of Latent Features for Zero-Shot Recognition.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Deep Semantic Structural Constraints for Zero-Shot Learning.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018


  Loading...