Gyubin Choi
According to our database1,
Gyubin Choi
authored at least 6 papers
between 2023 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
ADOR: A Design Exploration Framework for LLM Serving with Enhanced Latency and Throughput.
CoRR, March, 2025
ADOR: A Design Exploration Framework for LLM Serving with Enhanced Latency and Throughput.
Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2025
Proceedings of the Findings of the Association for Computational Linguistics, 2025
2024
A Latency Processing Unit: A Latency-Optimized and Highly Scalable Processor for Large Language Model Inference.
IEEE Micro, 2024
LPU: A Latency-Optimized and Highly Scalable Processor for Large Language Model Inference.
CoRR, 2024
2023
HyperAccel Latency Processing Unit (LPU<sup>TM</sup>) Accelerating Hyperscale Models for Generative AI.
Proceedings of the 35th IEEE Hot Chips Symposium, 2023