Eunyeong Cho
Orcid: 0009-0009-0626-0931
According to our database1,
Eunyeong Cho authored at least 5 papers
between 2024 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
SpecMoE: A Fast and Efficient Mixture-of-Experts Inference via Self-Assisted Speculative Decoding.
CoRR, April, 2026
PASCAL: A Phase-Aware Scheduling Algorithm for Serving Reasoning-based Large Language Models.
Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2026
2025
Debunking the CUDA Myth Towards GPU-based AI Systems: Evaluation of the Performance and Programmability of Intel's Gaudi NPU for AI Model Serving.
Proceedings of the 52nd Annual International Symposium on Computer Architecture, 2025
2024
IEEE Comput. Archit. Lett., 2024