Baeseong Park

According to our database¹, Baeseong Park authored at least 23 papers between 2019 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

ICaRus: Identical Cache Reuse for Efficient Multi Model Inference.

[BibT_eX]

[DOI]

CoRR, March, 2026

SUN: Shared Use of Next-token Prediction for Efficient Multi-LLM Disaggregated Serving.

[BibT_eX]

[DOI]

CoRR, March, 2026

Affine-Scaled Attention: Towards Flexible and Stable Transformer Attention.

[BibT_eX]

[DOI]

CoRR, February, 2026

PrefillShare: A Shared Prefill Module for KV Reuse in Multi-LLM Disaggregated Serving.

[BibT_eX]

[DOI]

CoRR, February, 2026

2025

CodeGEMM: A Codebook-Centric Approach to Efficient GEMM in Quantized LLMs.

[BibT_eX]

[DOI]

CoRR, December, 2025

FIGLUT: An Energy-Efficient Accelerator Design for FP-INT GEMM Using Look-Up Tables.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2025

2024

DropBP: Accelerating Fine-Tuning of Large Language Models by Dropping Backward Propagation.

[BibT_eX]

[DOI]

CoRR, 2024

DropBP: Accelerating Fine-Tuning of Large Language Models by Dropping Backward Propagation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

LUT-GEMM: Quantized Matrix Multiplication based on LUTs for Efficient Inference in Large-Scale Generative Language Models.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

Sparsity-Aware Memory Interface Architecture using Stacked XORNet Compression for Accelerating Pruned-DNN Models.

[BibT_eX]

[DOI]

Proceedings of the Sixth Conference on Machine Learning and Systems, 2023

Winning Both the Accuracy of Floating Point Activation and the Simplicity of Integer Arithmetic.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

TF-MVP: Novel Sparsity-Aware Transformer Accelerator with Mixed-Length Vector Pruning.

[BibT_eX]

[DOI]

Proceedings of the 60th ACM/IEEE Design Automation Conference, 2023

2022

nuQmm: Quantized MatMul for Efficient Inference of Large-Scale Generative Language Models.

[BibT_eX]

[DOI]

CoRR, 2022

Encoding Weights of Irregular Sparsity for Fixed-to-Fixed Model Compression.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

AlphaTuning: Quantization-Aware Parameter-Efficient Adaptation of Large-Scale Pre-Trained Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2021

Modulating Regularization Frequency for Efficient Compression-Aware Model Training.

[BibT_eX]

[DOI]

CoRR, 2021

Sequential Encryption of Sparse Neural Networks Toward Optimum Representation of Irregular Sparsity.

[BibT_eX]

[DOI]

CoRR, 2021

Q-Rater: Non-Convex Optimization for Post-Training Uniform Quantization.

[BibT_eX]

[DOI]

CoRR, 2021

2020

BiQGEMM: matrix multiplication with lookup table for binary-coding-based quantized DNNs.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2020

FleXOR: Trainable Fractional Quantization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Extremely Low Bit Transformer Quantization for On-Device Neural Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Structured Compression by Weight Encryption for Unstructured Pruning and Quantization.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

Structured Compression by Unstructured Pruning for Sparse Quantized Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2019

Baeseong Park

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...