31.7 LUT-SSM: A 99.3TFLOPS/W LUT-Based State-Space Model Accelerator Using Energy-Efficient Element-Wise Layer Fusion and LUT-Friendly Weight-Only Quantization.

[BibT_eX]

[DOI]

Sunwoo Yoo

Dongyun Kam

Proceedings of the IEEE International Solid-State Circuits Conference, 2026

A 9-bit 1.2GS/s Gain-Programmable Stochastic Time-to-Digital Converter with Sub-ps Resolution.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Circuits and Systems, 2026

2025

CodeGEMM: A Codebook-Centric Approach to Efficient GEMM in Quantized LLMs.

[BibT_eX]

[DOI]

CoRR, December, 2025

AnyBCQ: Hardware Efficient Flexible Binary-Coded Quantization for Multi-Precision LLMs.

[BibT_eX]

[DOI]

CoRR, October, 2025

SelfJudge: Faster Speculative Decoding via Self-Supervised Judge Verification.

[BibT_eX]

[DOI]

CoRR, October, 2025

Faster Inference of LLMs using FP8 on the Intel Gaudi.

[BibT_eX]

[DOI]

Joonhyung Lee

Shmulik Markovich-Golan

CoRR, March, 2025

An Investigation of FP8 Across Accelerators for LLM Inference.

[BibT_eX]

[DOI]

CoRR, February, 2025

Debunking the CUDA Myth Towards GPU-based AI Systems.

[BibT_eX]

[DOI]

CoRR, January, 2025

Fully Parallel, One-Cycle Random Shuffling for Efficient Countermeasure Against Side Channel Attack and Its Complexity Verification.

[BibT_eX]

[DOI]

IEEE Trans. Emerg. Top. Comput., 2025

LRQ: Optimizing Post-Training Quantization for Large Language Models by Learning Low-Rank Weight-Scaling Matrices.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

5.4 A 22nm FDSOI CMOS-Based Compact 3-Stack Doherty Power Amplifier with a Stacked OPA-Based Bias Scheme Achieving >16.5dBm Pavg for 5G FR2 Applications.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Solid-State Circuits Conference, 2025

Debunking the CUDA Myth Towards GPU-based AI Systems: Evaluation of the Performance and Programmability of Intel's Gaudi NPU for AI Model Serving.

[BibT_eX]

[DOI]

Proceedings of the 52nd Annual International Symposium on Computer Architecture, 2025

FIGLUT: An Energy-Efficient Accelerator Design for FP-INT GEMM Using Look-Up Tables.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2025

Unifying Uniform and Binary-coding Quantization for Accurate Compression of Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

To FP8 and Back Again: Quantifying the Effects of Reducing Precision on LLM Training Stability.

[BibT_eX]

[DOI]

CoRR, 2024

No Token Left Behind: Reliable KV Cache Compression via Importance-Aware Mixed Precision Quantization.

[BibT_eX]

[DOI]

CoRR, 2024

DropBP: Accelerating Fine-Tuning of Large Language Models by Dropping Backward Propagation.

[BibT_eX]

[DOI]

CoRR, 2024

DropBP: Accelerating Fine-Tuning of Large Language Models by Dropping Backward Propagation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

32.2 A 24.25-to-29.5GHz Extremely Compact Doherty Power Amplifier with Differential-Breaking Phase Offset Achieving 23.7% PAEavg for 5G Base-Station Transceivers.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Solid-State Circuits Conference, 2024

LUT-GEMM: Quantized Matrix Multiplication based on LUTs for Efficient Inference in Large-Scale Generative Language Models.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Rethinking Channel Dimensions to Isolate Outliers for Low-bit Weight Quantization of Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

V-LSTM: An Efficient LSTM Accelerator Using Fixed Nonzero-Ratio Viterbi-Based Pruning.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., October, 2023

Machine learning-based quantification for disease uncertainty increases the statistical power of genetic association studies.

[BibT_eX]

[DOI]

Bioinform., September, 2023

Fully Parallel, One-Cycle Random Shuffling for Efficient Countermeasure in Post-Quantum Cryptography.

[BibT_eX]

[DOI]

IACR Cryptol. ePrint Arch., 2023

Memory-Efficient Fine-Tuning of Compressed Large Language Models via sub-4-bit Integer Quantization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Information Geometry of the Retinal Representation Manifold.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Sparsity-Aware Memory Interface Architecture using Stacked XORNet Compression for Accelerating Pruned-DNN Models.

[BibT_eX]

[DOI]

Proceedings of the Sixth Conference on Machine Learning and Systems, 2023

FlexRound: Learnable Rounding based on Element-wise Division for Post-Training Quantization.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Winning Both the Accuracy of Floating Point Activation and the Simplicity of Integer Arithmetic.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

TF-MVP: Novel Sparsity-Aware Transformer Accelerator with Mixed-Length Vector Pruning.

[BibT_eX]

[DOI]

Proceedings of the 60th ACM/IEEE Design Automation Conference, 2023

2022

nuQmm: Quantized MatMul for Efficient Inference of Large-Scale Generative Language Models.

[BibT_eX]

[DOI]

CoRR, 2022

Maximum Likelihood Training of Implicit Nonlinear Diffusion Models.

[BibT_eX]

[DOI]

CoRR, 2022

Maximum Likelihood Training of Implicit Nonlinear Diffusion Model.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Augmenting Magnetic Resonance Imaging with Tabular Features for Enhanced and Interpretable Medial Temporal Lobe Atrophy Prediction.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning in Clinical Neuroimaging - 5th International Workshop, 2022

Volume is All You Need: Improving Multi-task Multiple Instance Learning for WMH Segmentation and Severity Estimation.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning in Clinical Neuroimaging - 5th International Workshop, 2022

Encoding Weights of Irregular Sparsity for Fixed-to-Fixed Model Compression.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

DFX: A Low-latency Multi-FPGA Appliance for Accelerating Transformer-based Text Generation.

[BibT_eX]

[DOI]

Proceedings of the 2022 IEEE Hot Chips 34 Symposium, 2022

AlphaTuning: Quantization-Aware Parameter-Efficient Adaptation of Large-Scale Pre-Trained Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2021

Modulating Regularization Frequency for Efficient Compression-Aware Model Training.

[BibT_eX]

[DOI]

CoRR, 2021

Sequential Encryption of Sparse Neural Networks Toward Optimum Representation of Irregular Sparsity.

[BibT_eX]

[DOI]

CoRR, 2021

Q-Rater: Non-Convex Optimization for Post-Training Uniform Quantization.

[BibT_eX]

[DOI]

CoRR, 2021

A mechanistically interpretable model of the retinal neural code for natural scenes with multiscale adaptive dynamics.

[BibT_eX]

[DOI]

Proceedings of the 55th Asilomar Conference on Signals, Systems, and Computers, 2021

2020

BiQGEMM: matrix multiplication with lookup table for binary-coding-based quantized DNNs.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2020

FleXOR: Trainable Fractional Quantization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Extremely Low Bit Transformer Quantization for On-Device Neural Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Structured Compression by Weight Encryption for Unstructured Pruning and Quantization.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

A Review of On-Device Fully Neural End-to-End Automatic Speech Recognition Algorithms.

[BibT_eX]

[DOI]

Proceedings of the 54th Asilomar Conference on Signals, Systems, and Computers, 2020

2019

Learning Low-Rank Approximation for CNNs.

[BibT_eX]

[DOI]

CoRR, 2019

Structured Compression by Unstructured Pruning for Sparse Quantized Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2019

Network Pruning for Low-Rank Binary Indexing.

[BibT_eX]

[DOI]

CoRR, 2019

Double Viterbi: Weight Encoding for High Compression Ratio and Fast On-Chip Reconstruction for Deep Neural Network.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Learning Representations, 2019

2018

DeepTwist: Learning Model Compression via Occasional Weight Distortion.

[BibT_eX]

[DOI]

Dongsoo Lee

Parichay Kapoor

Byeongwook Kim

CoRR, 2018

Retraining-Based Iterative Weight Quantization for Deep Neural Networks.

[BibT_eX]

[DOI]

Dongsoo Lee

Byeongwook Kim

CoRR, 2018

A Scalable Multi- TeraOPS Deep Learning Processor Core for AI Trainina and Inference.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Symposium on VLSI Circuits, 2018

Across the Stack Opportunities for Deep Learning Acceleration.

[BibT_eX]

[DOI]

Proceedings of the International Symposium on Low Power Electronics and Design, 2018

Viterbi-based Pruning for Sparse Matrix with Fixed and High Index Compression Ratio.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference on Learning Representations, 2018

2017

Security of Stateful Order-Preserving Encryption.

[BibT_eX]

[DOI]

Proceedings of the Information Security and Cryptology - ICISC 2017 - 20th International Conference, Seoul, South Korea, November 29, 2017

Forward Secure Dynamic Searchable Symmetric Encryption with Efficient Updates.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security, 2017

2016

Embedding Read-Only Memory in Spin-Transfer Torque MRAM-Based On-Chip Caches.

[BibT_eX]

[DOI]

Xuanyao Fong

Rangharajan Venkatesan

Dongsoo Lee

Anand Raghunathan

Kaushik Roy

IEEE Trans. Very Large Scale Integr. Syst., 2016

Optimal wavelength provisioning with fuzzy logic control for power saving in TWDM-PONs.

[BibT_eX]

[DOI]

Hark Yoo

Dongsoo Lee

Man-Soo Han

Proceedings of the International Conference on Information and Communication Technology Convergence, 2016

2014

Design and Optimization of Multiple-Mesh Clock Network.

[BibT_eX]

[DOI]

Jinwook Jung

Dongsoo Lee

Youngsoo Shin

Proceedings of the VLSI-SoC: Internet of Things Foundations, 2014

2013

Area Efficient ROM-Embedded SRAM Cache.

[BibT_eX]

[DOI]

Dongsoo Lee

Kaushik Roy

IEEE Trans. Very Large Scale Integr. Syst., 2013

Fast management of ONUs based on broadcast control channel for a 10-gigabit-capable passive optical network (XG-PON) system.

[BibT_eX]

[DOI]

J. Commun. Networks, 2013

2012

Soft-Error-Resilient FPGAs Using Built-In 2-D Hamming Product Code.

[BibT_eX]

[DOI]

Sang Phill Park

Dongsoo Lee

Kaushik Roy

IEEE Trans. Very Large Scale Integr. Syst., 2012

High-performance low-energy STT MRAM based on balanced write scheme.

[BibT_eX]

[DOI]

Dongsoo Lee

Sumeet Kumar Gupta

Kaushik Roy

Proceedings of the International Symposium on Low Power Electronics and Design, 2012

2011

Memory-based embedded digital ATE.

[BibT_eX]

[DOI]

Proceedings of the 29th IEEE VLSI Test Symposium, 2011

Column-selection-enabled 8T SRAM array with ~1R/1W multi-port operation for DVFS-enabled processors.

[BibT_eX]

[DOI]

Proceedings of the 2011 International Symposium on Low Power Electronics and Design, 2011

Viterbi-Based Efficient Test Data Compression.

[BibT_eX]

[DOI]

Dongsoo Lee

Kaushik Roy

Proceedings of the 16th European Test Symposium, 2011

Dongsoo Lee

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...