Zhaoning Zhang
This page is a disambiguation page, it actually contains multiple papers from persons of the same or a similar name.
Bibliography
2026
Alloc-MoE: Budget-Aware Expert Activation Allocation for Efficient Mixture-of-Experts Inference.
CoRR, April, 2026
Sparrow: Text-Anchored Window Attention with Visual-Semantic Glimpsing for Speculative Decoding in Video LLMs.
CoRR, February, 2026
2025
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025