Zhaoning Zhang

This page is a disambiguation page, it actually contains multiple papers from persons of the same or a similar name.

Known people with the same name:

Bibliography

2026
Alloc-MoE: Budget-Aware Expert Activation Allocation for Efficient Mixture-of-Experts Inference.
CoRR, April, 2026

Sparrow: Text-Anchored Window Attention with Visual-Semantic Glimpsing for Speculative Decoding in Video LLMs.
CoRR, February, 2026

2025
Dovetail: A CPU/GPU Heterogeneous Speculative Decoding for LLM inference.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025


  Loading...