Zhaode Wang

Orcid: 0009-0009-0498-7491

According to our database1, Zhaode Wang authored at least 11 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
RecGPT-Mobile: On-Device Large Language Models for User Intent Understanding in Taobao Feed Recommendation.
CoRR, May, 2026

RetentiveKV: State-Space Memory for Uncertainty-Aware Multimodal KV Cache Eviction.
CoRR, May, 2026

MobileKernelBench: Can LLMs Write Efficient Kernels for Mobile Devices?
CoRR, March, 2026

Efficient LLM Inference on ARM via Hardware-Aware Operator Co-design and Heuristic Mixed-Precision Search.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2026

AccKV: Towards Efficient Audio-Video LLMs Inference via Adaptive-Focusing and Cross-Calibration KV Cache Optimization.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
FlowMM: Cross-Modal Information Flow Guided KV Cache Merging for Efficient Multimodal Context Inference.
CoRR, November, 2025

PureKV: Plug-and-Play KV Cache Optimization with Spatial-Temporal Sparse Attention for Vision-Language Large Models.
CoRR, October, 2025

MNN-AECS: Energy Optimization for LLM Decoding on Mobile Devices via Adaptive Core Selection.
CoRR, June, 2025

MadaKV: Adaptive Modality-Perception KV Cache Eviction for Efficient Multimodal Long-Context Inference.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
MNN-LLM: A Generic Inference Engine for Fast Large Language Model Deployment on Mobile Devices.
Proceedings of the 6th ACM International Conference on Multimedia in Asia, Workshops, 2024

2022
Walle: An End-to-End, General-Purpose, and Large-Scale Production System for Device-Cloud Collaborative Machine Learning.
Proceedings of the 16th USENIX Symposium on Operating Systems Design and Implementation, 2022


  Loading...