Jaeha Kung

CoRR, 2024

A Full SW-HW Demonstration of GEMM Accelerators with RISC-V Instruction Extensions.

[BibT_eX]

[DOI]

Seonghun Jeong

Jooyeon Lee

Proceedings of the International Conference on Electronics, Information, and Communication, 2024

NDPipe: Exploiting Near-data Processing for Scalable Inference and Continuous Training in Photo Storage.

[BibT_eX]

[DOI]

Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

2023

FlexBlock: A Flexible DNN Training Accelerator With Multi-Mode Block Floating Point Support.

[BibT_eX]

[DOI]

IEEE Trans. Computers, September, 2023

Simplified Compressor and Encoder Designs for Low-Cost Approximate Radix-4 Booth Multiplier.

[BibT_eX]

[DOI]

Gunho Park

Youngjoo Lee

IEEE Trans. Circuits Syst. II Express Briefs, March, 2023

All-rounder: A flexible DNN accelerator with diverse data format support.

[BibT_eX]

[DOI]

CoRR, 2023

Improving Hardware Efficiency of a Sparse Training Accelerator by Restructuring a Reduction Network.

[BibT_eX]

[DOI]

Banseok Shin

Sehun Park

Proceedings of the 21st IEEE Interregional NEWCAS Conference, 2023

A 1V 136.6dB-DR 4kHz-BW $\Delta\Sigma$ Current-to-Digital Converter with a Truncation-Noise-Shaped Baseline-Servo-Loop in 0.18\mu\mathrm{m}$ CMOS.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Solid- State Circuits Conference, 2023

DBPS: Dynamic Block Size and Precision Scaling for Efficient DNN Training Supported by RISC-V ISA Extensions.

[BibT_eX]

[DOI]

Proceedings of the 60th ACM/IEEE Design Automation Conference, 2023

2022

Implication of Optimizing NPU Dataflows on Neural Architecture Search for Mobile Devices.

[BibT_eX]

[DOI]

ACM Trans. Design Autom. Electr. Syst., 2022

AutoRelax: HW-SW Co-Optimization for Efficient SpGEMM Operations With Automated Relaxation in Deep Learning.

[BibT_eX]

[DOI]

Sehun Park

Jae-Joon Kim

IEEE Trans. Emerg. Top. Comput., 2022

A 46-nF/10-MΩ Range 114-aF/0.37-Ω Resolution Parasitic- and Temperature-Insensitive Reconfigurable RC-to-Digital Converter in 0.18-μm CMOS.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. I Regul. Pap., 2022

SEMS: Scalable Embedding Memory System for Accelerating Embedding-Based DNNs.

[BibT_eX]

[DOI]

IEEE Comput. Archit. Lett., 2022

LightNorm: Area and Energy-Efficient Batch Normalization Hardware for On-Device DNN Training.

[BibT_eX]

[DOI]

Proceedings of the IEEE 40th International Conference on Computer Design, 2022

DualPIM: A Dual-Precision and Low-Power CNN Inference Engine Using SRAM- and eDRAM-based Processing-in-Memory Arrays.

[BibT_eX]

[DOI]

Proceedings of the 4th IEEE International Conference on Artificial Intelligence Circuits and Systems, 2022

2021

High-throughput Near-Memory Processing on CNNs with 3D HBM-like Memory.

[BibT_eX]

[DOI]

ACM Trans. Design Autom. Electr. Syst., 2021

Design and Analysis of Approximate Compressors for Balanced Error Accumulation in MAC Operator.

[BibT_eX]

[DOI]

Gunho Park

Youngjoo Lee

IEEE Trans. Circuits Syst. I Regul. Pap., 2021

Deep Partitioned Training From Near-Storage Computing to DNN Accelerators.

[BibT_eX]

[DOI]

IEEE Comput. Archit. Lett., 2021

Adaptive Input-to-Neuron Interlink Development in Training of Spike-Based Liquid State Machines.

[BibT_eX]

[DOI]

Sangwoo Hwang

Junghyup Lee

Proceedings of the IEEE International Symposium on Circuits and Systems, 2021

ZeBRA: Precisely Destroying Neural Networks with Zero-Data Based Repeated Bit Flip Attack.

[BibT_eX]

[DOI]

Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020

Balancing Computation Loads and Optimizing Input Vector Loading in LSTM Accelerators.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2020

Towards Scalable Analytics with Inference-Enabled Solid-State Drives.

[BibT_eX]

[DOI]

Minsub Kim

Sungjin Lee

IEEE Comput. Archit. Lett., 2020

Defending Against Flush+Reload Attack With DRAM Cache by Bypassing Shared SRAM Cache.

[BibT_eX]

[DOI]

IEEE Access, 2020

2019

Noise Tolerance of an Energy-Scalable Deep Learning Model with Two Extreme Bit-Precisions.

[BibT_eX]

[DOI]

Sangwoo Jung

Proceedings of the 2019 International SoC Design Conference, 2019

WMixNet: An Energy-Scalable and Computationally Lightweight Deep Learning Accelerator.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/ACM International Symposium on Low Power Electronics and Design, 2019

Similarity-Based LSTM Architecture for Energy-Efficient Edge-Level Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/ACM International Symposium on Low Power Electronics and Design, 2019

Peregrine: A Flexible Hardware Accelerator for LSTM with Limited Synaptic Connection Patterns.

[BibT_eX]

[DOI]

Proceedings of the 56th Annual Design Automation Conference 2019, 2019

2018

Efficient Object Detection Using Embedded Binarized Neural Networks.

[BibT_eX]

[DOI]

David C. Zhang

Gooitzen S. van der Wal

Sek M. Chai

J. Signal Process. Syst., 2018

Adaptive Precision Cellular Nonlinear Network.

[BibT_eX]

[DOI]

IEEE Trans. Very Large Scale Integr. Syst., 2018

Maximizing system performance by balancing computation loads in LSTM accelerators.

[BibT_eX]

[DOI]

Proceedings of the 2018 Design, Automation & Test in Europe Conference & Exhibition, 2018

The CAMEL approach to stacked sensor smart cameras.

[BibT_eX]

[DOI]

Burhan Ahmad Musassar

Proceedings of the 2018 Design, Automation & Test in Europe Conference & Exhibition, 2018

2017

Energy-efficient digital hardware platform for learning complex systems.

[BibT_eX]

[DOI]

PhD thesis, 2017

A Power-Aware Digital Multilayer Perceptron Accelerator with On-Chip Training Based on Approximate Computing.

[BibT_eX]

[DOI]

IEEE Trans. Emerg. Top. Comput., 2017

Energy-efficient neural image processing for Internet-of-Things edge devices.

[BibT_eX]

[DOI]

Proceedings of the IEEE 60th International Midwest Symposium on Circuits and Systems, 2017

A Programmable Hardware Accelerator for Simulating Dynamical Systems.

[BibT_eX]

[DOI]

Proceedings of the 44th Annual International Symposium on Computer Architecture, 2017

On-chip training of recurrent neural networks with limited numerical precision.

[BibT_eX]

[DOI]

Proceedings of the 2017 International Joint Conference on Neural Networks, 2017

Adaptive weight compression for memory-efficient neural networks.

[BibT_eX]

[DOI]

Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2017

2016

Dynamic Approximation with Feedback Control for Energy-Efficient Recurrent Neural Network Hardware.

[BibT_eX]

[DOI]

Proceedings of the 2016 International Symposium on Low Power Electronics and Design, 2016

Neurocube: A Programmable Digital Neuromorphic Architecture with High-Density 3D Memory.

[BibT_eX]

[DOI]

Sek M. Chai

Sudhakar Yalamanchili

Proceedings of the 43rd ACM/IEEE Annual International Symposium on Computer Architecture, 2016

ReRAM Crossbar based Recurrent Neural Network for human activity detection.

[BibT_eX]

[DOI]

Proceedings of the 2016 International Joint Conference on Neural Networks, 2016

2015

On the Impact of Energy-Accuracy Tradeoff in a Digital Cellular Neural Network for Image Processing.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2015

A power-aware digital feedforward neural network platform with backpropagation driven approximate synapses.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM International Symposium on Low Power Electronics and Design, 2015

2011

Compact thermal models: Assessment and pitfalls.

[BibT_eX]

[DOI]