Nachuan Wang
Orcid: 0009-0000-1431-0067
According to our database1,
Nachuan Wang authored at least 7 papers
between 2024 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
LiLo: Harnessing the on-Chip Accelerators in Intel CPUs for Compressed LLM Inference Acceleration.
Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2026
2025
NetZIP: Algorithm/Hardware Co-design of In-network Lossless Compression for Distributed Large Model Training.
Proceedings of the 58th IEEE/ACM International Symposium on Microarchitecture, 2025
LIA: A Single-GPU LLM Inference Acceleration with Cooperative AMX-Enabled CPU-GPU Computation and CXL Offloading.
Proceedings of the 52nd Annual International Symposium on Computer Architecture, 2025
2024
Exploiting Intel Advanced Matrix Extensions (AMX) for Large Language Model Inference.
IEEE Comput. Archit. Lett., 2024