Zhaode Wang
Orcid: 0009-0009-0498-7491
According to our database1,
Zhaode Wang authored at least 8 papers
between 2022 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
CoRR, March, 2026
AccKV: Towards Efficient Audio-Video LLMs Inference via Adaptive-Focusing and Cross-Calibration KV Cache Optimization.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
2025
FlowMM: Cross-Modal Information Flow Guided KV Cache Merging for Efficient Multimodal Context Inference.
CoRR, November, 2025
PureKV: Plug-and-Play KV Cache Optimization with Spatial-Temporal Sparse Attention for Vision-Language Large Models.
CoRR, October, 2025
MNN-AECS: Energy Optimization for LLM Decoding on Mobile Devices via Adaptive Core Selection.
CoRR, June, 2025
MadaKV: Adaptive Modality-Perception KV Cache Eviction for Efficient Multimodal Long-Context Inference.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
2024
MNN-LLM: A Generic Inference Engine for Fast Large Language Model Deployment on Mobile Devices.
Proceedings of the 6th ACM International Conference on Multimedia in Asia, Workshops, 2024
2022
Walle: An End-to-End, General-Purpose, and Large-Scale Production System for Device-Cloud Collaborative Machine Learning.
Proceedings of the 16th USENIX Symposium on Operating Systems Design and Implementation, 2022