Zhongkai Yu
Orcid: 0000-0001-9893-3498
According to our database1,
Zhongkai Yu authored at least 13 papers
between 2022 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
AMMA: A Multi-Chiplet Memory-Centric Architecture for Low-Latency 1M Context Attention Serving.
CoRR, April, 2026
CoRR, February, 2026
ScaleSim: Serving Large-Scale Multi-Agent Simulation with Invocation Distance-Based Memory Management.
CoRR, January, 2026
ChipBench: A Next-Step Benchmark for Evaluating LLM Performance in AI-Aided Chip Design.
CoRR, January, 2026
2025
Orders in Chaos: Enhancing Large-Scale MoE LLM Serving with Data Movement Forecasting.
CoRR, October, 2025
KPerfIR: Towards an Open and Compiler-centric Ecosystem for GPU Kernel Performance Tooling on Modern AI Workloads.
CoRR, May, 2025
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., February, 2025
KPerfIR: Towards a Open and Compiler-centric Ecosystem for GPU Kernel Performance Tooling on Modern AI Workloads.
Proceedings of the 19th USENIX Symposium on Operating Systems Design and Implementation, 2025
Proceedings of the Eighth Conference on Machine Learning and Systems, 2025
2024
Environmental Condition Aware Super-Resolution Acceleration Framework in Server-Client Hierarchies.
ACM Trans. Archit. Code Optim., December, 2024
Cambricon-LLM: A Chiplet-Based Hybrid Architecture for On-Device Inference of 70B LLM.
Proceedings of the 57th IEEE/ACM International Symposium on Microarchitecture, 2024
2022
E<sup>2</sup>SR: an end-to-end video CODEC assisted system for super resolution acceleration.
Proceedings of the DAC '22: 59th ACM/IEEE Design Automation Conference, San Francisco, California, USA, July 10, 2022