Yifan Yang
Orcid: 0000-0002-5481-2851Affiliations:
- Microsoft Research Asia, Shanghai, China
- Peking University, Beijing, China (until 2021)
According to our database1,
Yifan Yang authored at least 55 papers
between 2022 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2026
CoRR, April, 2026
AVGen-Bench: A Task-Driven Benchmark for Multi-Granular Evaluation of Text-to-Audio-Video Generation.
CoRR, April, 2026
CoRR, March, 2026
CoRR, March, 2026
CoRR, February, 2026
AMID: Model-Agnostic Dataset Distillation by Adversarial Mutual Information Minimization.
Proceedings of the ACM Web Conference 2026, 2026
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2026
Proceedings of the 23rd USENIX Symposium on Networked Systems Design and Implementation, 2026
HiTVideo: Hierarchical Tokenizers for Enhancing Text-to-Video Generation with Autoregressive Large Language Models.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
2025
CoRR, October, 2025
VidGuard-R1: AI-Generated Video Detection and Explanation via Reasoning MLLMs and RL.
CoRR, October, 2025
CoRR, October, 2025
Efficient and Adaptive Diffusion Model Inference Through Lookup Table on Mobile Devices.
IEEE Trans. Mob. Comput., September, 2025
CoRR, September, 2025
A Disease-Centric Vision-Language Foundation Model for Precision Oncology in Kidney Cancer.
CoRR, August, 2025
CoRR, May, 2025
ViaRL: Adaptive Temporal Grounding via Visual Iterated Amplification Reinforcement Learning.
CoRR, May, 2025
CoRR, May, 2025
StreamMind: Unlocking Full Frame Rate Streaming Video Dialogue through Event-Gated Cognition.
CoRR, March, 2025
Large-Scale AI in Telecom: Charting the Roadmap for Innovation, Scalability, and Enhanced Digital Experiences.
CoRR, March, 2025
Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs.
CoRR, March, 2025
Trans. Mach. Learn. Res., 2025
Babel: A Scalable Pre-trained Model for Multi-Modal Sensing via Expandable Modality Alignment.
Proceedings of the 23rd ACM Conference on Embedded Networked Sensor Systems, 2025
Proceedings of the Eighth Conference on Machine Learning and Systems, 2025
DreamDistribution: Learning Prompt Distribution for Diverse In-distribution Generation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
REDUCIO! Generating 1K Video Within 16 Seconds Using Extremely Compressed Motion Latents.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025
StreamMind: Unlocking Full Frame Rate Streaming Video Dialogue through Event-Gated Cognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025
2024
Proc. ACM Netw., 2024
REDUCIO! Generating 1024⨉1024 Video within 16 Seconds using Extremely Compressed Motion Latents.
CoRR, 2024
Making Every Frame Matter: Continuous Video Understanding for Large Models via Adaptive State Modeling.
CoRR, 2024
Expressive and Generalizable Low-rank Adaptation for Large Models via Slow Cascaded Learning.
CoRR, 2024
CoRR, 2024
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024
LoRASC: Expressive and Generalizable Low-rank Adaptation for Large Models via Slow Cascaded Learning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
2023
IEEE Trans. Multim., 2023
CoRR, 2023
ImageBrush: Learning Visual In-Context Instructions for Exemplar-Based Image Manipulation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
Similarity Distribution Based Membership Inference Attack on Person Re-identification.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022