Zhaode Wang

Orcid: 0009-0009-0498-7491

According to our database1, Zhaode Wang authored at least 8 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
MobileKernelBench: Can LLMs Write Efficient Kernels for Mobile Devices?
CoRR, March, 2026

AccKV: Towards Efficient Audio-Video LLMs Inference via Adaptive-Focusing and Cross-Calibration KV Cache Optimization.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
FlowMM: Cross-Modal Information Flow Guided KV Cache Merging for Efficient Multimodal Context Inference.
CoRR, November, 2025

PureKV: Plug-and-Play KV Cache Optimization with Spatial-Temporal Sparse Attention for Vision-Language Large Models.
CoRR, October, 2025

MNN-AECS: Energy Optimization for LLM Decoding on Mobile Devices via Adaptive Core Selection.
CoRR, June, 2025

MadaKV: Adaptive Modality-Perception KV Cache Eviction for Efficient Multimodal Long-Context Inference.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
MNN-LLM: A Generic Inference Engine for Fast Large Language Model Deployment on Mobile Devices.
Proceedings of the 6th ACM International Conference on Multimedia in Asia, Workshops, 2024

2022
Walle: An End-to-End, General-Purpose, and Large-Scale Production System for Device-Cloud Collaborative Machine Learning.
Proceedings of the 16th USENIX Symposium on Operating Systems Design and Implementation, 2022


  Loading...