Fan Yang

Orcid: 0009-0005-4570-5885

Affiliations:
  • KuaiShou Inc., Beijing, China


According to our database1, Fan Yang authored at least 31 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Kwai Keye-VL Technical Report.
CoRR, July, 2025

Long-Tailed Distribution-Aware Router For Mixture-of-Experts in Large Vision-Language Model.
CoRR, July, 2025

OneRec Technical Report.
CoRR, June, 2025

Why Distillation can Outperform Zero-RL: The Role of Flexible Reasoning.
CoRR, May, 2025

Who You Are Matters: Bridging Topics and Social Roles via LLM-Enhanced Logical Recommendation.
CoRR, May, 2025

R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning.
CoRR, May, 2025

VLM as Policy: Common-Law Content Moderation Framework for Short Video Platform.
CoRR, April, 2025

InstructEngine: Instruction-driven Text-to-Image Alignment.
CoRR, April, 2025

Decoupling Contrastive Decoding: Robust Hallucination Mitigation in Multimodal Large Language Models.
CoRR, April, 2025

TIME: Temporal-sensitive Multi-dimensional Instruction Tuning and Benchmarking for Video-LLMs.
CoRR, March, 2025

Exo2Ego: Exocentric Knowledge Guided MLLM for Egocentric Video Understanding.
CoRR, March, 2025

External Large Foundation Model: How to Efficiently Serve Trillions of Parameters for Online Ads Recommendation.
CoRR, February, 2025

iMOVE: Instance-Motion-Aware Video Understanding.
CoRR, February, 2025

MM-RLHF: The Next Step Forward in Multimodal LLM Alignment.
CoRR, February, 2025

TaskGalaxy: Scaling Multi-modal Instruction Fine-tuning with Tens of Thousands Vision Task Types.
CoRR, February, 2025

External Large Foundation Model: How to Efficiently Serve Trillions of Parameters for Online Ads Recommendation.
Proceedings of the Companion Proceedings of the ACM on Web Conference 2025, 2025

SVBench: A Benchmark with Temporal Multi-Turn Dialogues for Streaming Video Understanding.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

TaskGalaxy: Scaling Multi-modal Instruction Fine-tuning with Tens of Thousands Vision Task Types.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

iMOVE : Instance-Motion-Aware Video Understanding.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
Kwai-STaR: Transform LLMs into State-Transition Reasoners.
CoRR, 2024

EVLM: An Efficient Vision-Language Model for Visual Understanding.
CoRR, 2024

Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model.
CoRR, 2024

Semantic Editing Increment Benefits Zero-Shot Composed Image Retrieval.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

A Multimodal Transformer for Live Streaming Highlight Prediction.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

2023
ContentCTR: Frame-level Live Streaming Click-Through Rate Prediction with Multimodal Transformer.
CoRR, 2023

A Unified Model for Video Understanding and Knowledge Embedding with Heterogeneous Knowledge Graph Dataset.
Proceedings of the 2023 ACM International Conference on Multimedia Retrieval, 2023

2022
A Unified Model for Video Understanding and Knowledge Embedding with Heterogeneous Knowledge Graph Dataset.
CoRR, 2022

MLTR: Multi-Label Classification with Transformer.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

2021
Learning Interpretable Decision Rule Sets: A Submodular Optimization Approach.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Time Series Data Augmentation for Deep Learning: A Survey.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021


  Loading...