Jehyeon Bang
Orcid: 0009-0007-6672-0587
According to our database1,
Jehyeon Bang authored at least 6 papers
between 2024 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
SpecMoE: A Fast and Efficient Mixture-of-Experts Inference via Self-Assisted Speculative Decoding.
CoRR, April, 2026
PASCAL: A Phase-Aware Scheduling Algorithm for Serving Reasoning-based Large Language Models.
Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2026
2025
Debunking the CUDA Myth Towards GPU-based AI Systems: Evaluation of the Performance and Programmability of Intel's Gaudi NPU for AI Model Serving.
Proceedings of the 52nd Annual International Symposium on Computer Architecture, 2025
2024
IEEE Comput. Archit. Lett., 2024
vTrain: A Simulation Framework for Evaluating Cost-Effective and Compute-Optimal Large Language Model Training.
Proceedings of the 57th IEEE/ACM International Symposium on Microarchitecture, 2024