Fan Yang
Orcid: 0009-0005-4570-5885Affiliations:
- KuaiShou Inc., Beijing, China
According to our database1,
Fan Yang authored at least 52 papers
between 2019 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2026
CoRR, May, 2026
From Semantics to Pixels: Coarse-to-Fine Masked Autoencoders for Hierarchical Visual Understanding.
CoRR, March, 2026
ContextRL: Enhancing MLLM's Knowledge Discovery Efficiency with Context-Augmented RL.
CoRR, February, 2026
CREM: Compression-Driven Representation Enhancement for Multimodal Retrieval and Comprehension.
CoRR, February, 2026
VideoTemp-o3: Harmonizing Temporal Grounding and Video Understanding in Agentic Thinking-with-Videos.
CoRR, February, 2026
ALPBench: A Benchmark for Attribution-level Long-term Personal Behavior Understanding.
CoRR, February, 2026
Meta Lattice: Model Space Redesign for Cost-Effective Industry-Scale Ads Recommendations.
Proceedings of the 32nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.1, 2026
Compressing then Matching: An Efficient Pre-training Paradigm for Multimodal Embedding.
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026
TIME: Temporal-Sensitive Multi-Dimensional Instruction Tuning and Robust Benchmarking for Video-LLMs.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
2025
Meta Lattice: Model Space Redesign for Cost-Effective Industry-Scale Ads Recommendations.
CoRR, December, 2025
Compression then Matching: An Efficient Pre-training Paradigm for Multimodal Embedding.
CoRR, November, 2025
CoRR, November, 2025
COMPEER: Controllable Empathetic Reinforcement Reasoning for Emotional Support Conversation.
CoRR, August, 2025
Long-Tailed Distribution-Aware Router For Mixture-of-Experts in Large Vision-Language Model.
CoRR, July, 2025
CoRR, May, 2025
Who You Are Matters: Bridging Topics and Social Roles via LLM-Enhanced Logical Recommendation.
CoRR, May, 2025
CoRR, May, 2025
TIME: Temporal-sensitive Multi-dimensional Instruction Tuning and Benchmarking for Video-LLMs.
CoRR, March, 2025
CoRR, March, 2025
External Large Foundation Model: How to Efficiently Serve Trillions of Parameters for Online Ads Recommendation.
CoRR, February, 2025
TaskGalaxy: Scaling Multi-modal Instruction Fine-tuning with Tens of Thousands Vision Task Types.
CoRR, February, 2025
External Large Foundation Model: How to Efficiently Serve Trillions of Parameters for Online Ads Recommendation.
Proceedings of the Companion Proceedings of the ACM on Web Conference 2025, 2025
Who You Are Matters: Bridging Interests and Social Roles via LLM-Enhanced Logic Recommendation.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025
Decoupling Contrastive Decoding: Robust Hallucination Mitigation in Multimodal Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025
StreamingCoT: A Dataset for Temporal Dynamics and Multimodal Chain-of-Thought Reasoning in Streaming VideoQA.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025
Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.2, 2025
Proceedings of the Forty-second International Conference on Machine Learning, 2025
SVBench: A Benchmark with Temporal Multi-Turn Dialogues for Streaming Video Understanding.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
TaskGalaxy: Scaling Multi-modal Instruction Fine-tuning with Tens of Thousands Vision Task Types.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
Proceedings of the Findings of the Association for Computational Linguistics, 2025
2024
Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model.
CoRR, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024
2023
ContentCTR: Frame-level Live Streaming Click-Through Rate Prediction with Multimodal Transformer.
CoRR, 2023
A Unified Model for Video Understanding and Knowledge Embedding with Heterogeneous Knowledge Graph Dataset.
Proceedings of the 2023 ACM International Conference on Multimedia Retrieval, 2023
2022
A Unified Model for Video Understanding and Knowledge Embedding with Heterogeneous Knowledge Graph Dataset.
CoRR, 2022
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022
2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021
2019
Proceedings of the 2019 International Conference on Data Mining Workshops, 2019