Zhaode Wang

Orcid: 0009-0009-0498-7491

According to our database¹, Zhaode Wang authored at least 11 papers between 2022 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

RecGPT-Mobile: On-Device Large Language Models for User Intent Understanding in Taobao Feed Recommendation.

[BibT_eX]

[DOI]

CoRR, May, 2026

RetentiveKV: State-Space Memory for Uncertainty-Aware Multimodal KV Cache Eviction.

[BibT_eX]

[DOI]

CoRR, May, 2026

MobileKernelBench: Can LLMs Write Efficient Kernels for Mobile Devices?

[BibT_eX]

[DOI]

CoRR, March, 2026

Efficient LLM Inference on ARM via Hardware-Aware Operator Co-design and Heuristic Mixed-Precision Search.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Circuits and Systems, 2026

AccKV: Towards Efficient Audio-Video LLMs Inference via Adaptive-Focusing and Cross-Calibration KV Cache Optimization.

[BibT_eX]

[DOI]

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

FlowMM: Cross-Modal Information Flow Guided KV Cache Merging for Efficient Multimodal Context Inference.

[BibT_eX]

[DOI]

CoRR, November, 2025

PureKV: Plug-and-Play KV Cache Optimization with Spatial-Temporal Sparse Attention for Vision-Language Large Models.

[BibT_eX]

[DOI]

CoRR, October, 2025

MNN-AECS: Energy Optimization for LLM Decoding on Mobile Devices via Adaptive Core Selection.

[BibT_eX]

[DOI]

CoRR, June, 2025

MadaKV: Adaptive Modality-Perception KV Cache Eviction for Efficient Multimodal Long-Context Inference.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

MNN-LLM: A Generic Inference Engine for Fast Large Language Model Deployment on Mobile Devices.

[BibT_eX]

[DOI]

Proceedings of the 6th ACM International Conference on Multimedia in Asia, Workshops, 2024

2022

Walle: An End-to-End, General-Purpose, and Large-Scale Production System for Device-Cloud Collaborative Machine Learning.

[BibT_eX]

[DOI]

Proceedings of the 16th USENIX Symposium on Operating Systems Design and Implementation, 2022

Zhaode Wang

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...