Seon Wook Kim

Proceedings of the International Conference on Electronics, Information, and Communication, 2024

Supporting Multi-Channels to DRAM-based PIM Execution for Boosting the Performance.

[BibT_eX]

[DOI]

Junil Kim

Seok Young Kim

Proceedings of the International Conference on Electronics, Information, and Communication, 2024

2023

PISA-DMA: Processing-in-Memory Instruction Set Architecture Using DMA.

[BibT_eX]

[DOI]

IEEE Access, 2023

BL-PIM: Varying the Burst Length to Realize the All-Bank Performance and Minimize the Multi-Workload Interference for in-DRAM PIM.

[BibT_eX]

[DOI]

IEEE Access, 2023

2022

Low-overhead inverted LUT design for bounded DNN activation functions on floating-point vector ALUs.

[BibT_eX]

[DOI]

Microprocess. Microsystems, September, 2022

Silent-PIM: Realizing the Processing-in-Memory Computing With Standard Memory Requests.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2022

Achieving the Performance of All-Bank In-DRAM PIM With Standard Memory Interface: Memory-Computation Decoupling.

[BibT_eX]

[DOI]

IEEE Access, 2022

Extending the ONNX Runtime Framework for the Processing-in-Memory Execution.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Electronics, Information, and Communication, 2022

2021

Monolithic 3D stacked multiply-accumulate units.

[BibT_eX]

[DOI]

Integr., 2021

Tile-based Code Generation for Efficiently Accessing to Scratchpad Memory.

[BibT_eX]

[DOI]

Jaewook Lee

Yoonah Paik

Proceedings of the International Conference on Electronics, Information, and Communication, 2021

Applying Piecewise Linear Approximation for DNN Non-Linear Activation Functions to Bfloat16 MACs.

[BibT_eX]

[DOI]

Seok Young Kim

Chang Hyun Kim

Proceedings of the International Conference on Electronics, Information, and Communication, 2021

2020

Generating Representative Test Sequences from Real Workload for Minimizing DRAM Verification Overhead.

[BibT_eX]

[DOI]

ACM Trans. Design Autom. Electr. Syst., 2020

2019

Fault Tolerance Technique Offlining Faulty Blocks by Heap Memory Management.

[BibT_eX]

[DOI]

ACM Trans. Design Autom. Electr. Syst., 2019

Reducing DRAM Refresh Rate Using Retention Time Aware Universal Hashing Redundancy Repair.

[BibT_eX]

[DOI]

ACM Trans. Design Autom. Electr. Syst., 2019

Design and Implementation of Display Stream Compression Decoder With Line Buffer Optimization.

[BibT_eX]

[DOI]

IEEE Trans. Consumer Electron., 2019

Design of Processing-"Inside"-Memory Optimized for DRAM Behaviors.

[BibT_eX]

[DOI]

IEEE Access, 2019

Epsim: A Scalable and Parallel Marssx86 Simulator With Exploiting Epoch-Based Execution.

[BibT_eX]

[DOI]

IEEE Access, 2019

Exploring the Relation between Monolithic 3D L1 GPU Cache Capacity and Warp Scheduling Efficiency.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/ACM International Symposium on Low Power Electronics and Design, 2019

2018

Recovering from Biased Distribution of Faulty Cells in Memory by Reorganizing Replacement Regions through Universal Hashing.

[BibT_eX]

[DOI]

ACM Trans. Design Autom. Electr. Syst., 2018

Energy-Efficient DRAM Selective Refresh Technique with Page Residence in a Memory Hierarchy of Hardware-Managed TLB.

[BibT_eX]

[DOI]

IEICE Trans. Electron., 2018

2017

Content-Aware Bit Shuffling for Maximizing PCM Endurance.

[BibT_eX]

[DOI]

ACM Trans. Design Autom. Electr. Syst., 2017

P-DRAMSim2: Exploiting thread-level parallelism in DRAMSim2.

[BibT_eX]

[DOI]

IEICE Electron. Express, 2017

A decoupled bit shifting technique using data encoding/decoding for DRAM redundancy repair.

[BibT_eX]

[DOI]

IEICE Electron. Express, 2017

2016

JavaScript Parallelizing Compiler for Exploiting Parallelism from Data-Parallel HTML5 Applications.

[BibT_eX]

[DOI]

Yeoul Na

ACM Trans. Archit. Code Optim., 2016

High-throughput low-area design of AES using constant binary matrix-vector multiplication.

[BibT_eX]

[DOI]

Microprocess. Microsystems, 2016

2015

Lowering Minimum Supply Voltage for Power-Efficient Cache Design by Exploiting Data Redundancy.

[BibT_eX]

[DOI]

Dongha Jung

Hokyoon Lee

ACM Trans. Design Autom. Electr. Syst., 2015

O2WebCL: an automatic OpenCL-to-WebCL translator for high performance web computing.

[BibT_eX]

[DOI]

J. Supercomput., 2015

D<sup>2</sup>ART: Direct Data Accessing from Passive RFID Tag for infra-less, contact-less, and battery-less pervasive computing.

[BibT_eX]

[DOI]

Microprocess. Microsystems, 2015

2014

Web-based image processing using JavaScript and WebCL.

[BibT_eX]

[DOI]

Myeongjin Cho

Proceedings of the IEEE International Conference on Consumer Electronics, 2014

Performance comparison of GCC and LLVM on the EISC processor.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Electronics, Information and Communications, 2014

Performance evaluation of GCC 4.7.1 on EISC.

[BibT_eX]

[DOI]

Miseon Han

Hokyoon Lee

Proceedings of the International Conference on Electronics, Information and Communications, 2014

2013

A Self-Calibrated DLL-Based Clock Generator for an Energy-Aware EISC Processor.

[BibT_eX]

[DOI]

IEEE Trans. Very Large Scale Integr. Syst., 2013

DiSCo: Distributed Scalable Compilation Tool for Heavy Compilation Workload.

[BibT_eX]

[DOI]

Kyongjin Jo

Jong-Kook Kim

IEICE Trans. Inf. Syst., 2013

2012

Resource Efficient Implementation of Low Power MB-OFDM PHY Baseband Modem With Highly Parallel Architecture.

[BibT_eX]

[DOI]

IEEE Trans. Very Large Scale Integr. Syst., 2012

AndroScope for detailed performance study of the android platform and its applications.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Consumer Electronics, 2012

2011

A Reconfigurable FIR Filter Architecture to Trade Off Filter Performance for Dynamic Power Consumption.

[BibT_eX]

[DOI]

IEEE Trans. Very Large Scale Integr. Syst., 2011

A processor-based decoupled timing controller for flexible and low-cost 2D/3D plasma display panel design.

[BibT_eX]

[DOI]

IEEE Trans. Consumer Electron., 2011

Applying frame layout to hardware design in FPGA for seamless support of cross calls in CPU-FPGA coupling architecture.

[BibT_eX]

[DOI]

Giang Nguyen Huong

Yeoul Na

Microprocess. Microsystems, 2011

Runtime parallelization of legacy code on a transactional memory system.

[BibT_eX]

[DOI]

Matthew DeVuyst

Dean M. Tullsen

Proceedings of the High Performance Embedded Architectures and Compilers, 2011

2010

A Novel Architecture for Block Interleaving Algorithm in MB-OFDM Using Mixed Radix System.

[BibT_eX]

[DOI]

IEEE Trans. Very Large Scale Integr. Syst., 2010

Hierarchical data structure-based timing controller design for plasma display panels.

[BibT_eX]

[DOI]

Proceedings of the International Symposium on Circuits and Systems (ISCAS 2010), May 30, 2010

Support of cross calls between a microprocessor and FPGA in CPU-FPGA coupling architecture.

[BibT_eX]

[DOI]

Giang Nguyen Huong

Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Design issues and optimization in DisplayPort link layer implementation.

[BibT_eX]

[DOI]

Jaegeun Oh

Taejin Kim

Proceedings of the IEEE Asia Pacific Conference on Circuits and Systems, 2010

Design of ultra low power stream data receiver based on UHF passive RFID tag system.

[BibT_eX]

[DOI]

Proceedings of the IEEE Asia Pacific Conference on Circuits and Systems, 2010

Implementation of x86 Binary-to-C Translator by Using GNU Tools.

[BibT_eX]

[DOI]

Kirill Makankov

Proceedings of the 10th IEEE International Conference on Computer and Information Technology, 2010

2009

A low-power baseband modem architecture for a mobile RFID reader.

[BibT_eX]

[DOI]

J. Embed. Comput., 2009

2008

Virtual Memory and Buffer Storage.

[BibT_eX]

[DOI]

Jong-Kook Kim

Proceedings of the Wiley Encyclopedia of Computer Science and Engineering, 2008

A Reconfigurable Processor Infrastructure for Accelerating Java Applications.

[BibT_eX]

[DOI]

IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2008

Applying passive RFID system to wireless headphones for extreme low power consumption.

[BibT_eX]

[DOI]

Proceedings of the 45th Design Automation Conference, 2008

A DC-DC converter with a dual VCDL-based ADC and a self-calibrated DLL-based clock generator for an energy-aware EISC processor.

[BibT_eX]

[DOI]

Proceedings of the IEEE 2008 Custom Integrated Circuits Conference, 2008

2007

Performance Study of Anti-collision Algorithms for EPC-C1 Gen2 RFID Protocol.

[BibT_eX]

[DOI]

Joon Goo Lee

Proceedings of the Information Networking. Towards Ubiquitous Networking and Services, 2007

Compiler Construction for Lockstep Execution of Multithreaded Processors.

[BibT_eX]

[DOI]

Huong Giang Nguyen

Proceedings of the Seventh International Conference on Computer and Information Technology (CIT 2007), 2007

A Dataflow Analysis for Mode Set Optimization in DSP Instruction Sets.

[BibT_eX]

[DOI]

Jiho Chu

Proceedings of the Seventh International Conference on Computer and Information Technology (CIT 2007), 2007

2006

Exploiting reference idempotency to reduce speculative storage overflow.

[BibT_eX]

[DOI]

ACM Trans. Program. Lang. Syst., 2006

A New Energy x Delay-Aware Flip-Flop.

[BibT_eX]

[DOI]

IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2006

Implementation of H.264/AVC decoder for mobile video applications.

[BibT_eX]

[DOI]

Proceedings of the International Symposium on Circuits and Systems (ISCAS 2006), 2006

Jaguar: a compiler infrastructure for Java reconfigurable computing.

[BibT_eX]

[DOI]

Proceedings of the ACM/SIGDA 14th International Symposium on Field Programmable Gate Arrays, 2006

A Multi-protocol Baseband Modem Processor for a Mobile RFID Reader.

[BibT_eX]

[DOI]

Proceedings of the Embedded and Ubiquitous Computing, International Conference, 2006

Code Generation and Optimization for Java-to-C Compilers.

[BibT_eX]

[DOI]

Proceedings of the Emerging Directions in Embedded and Ubiquitous Computing, 2006

Implementation of H.264/AVC decoder for mobile video applications.

[BibT_eX]

[DOI]

Proceedings of the 2006 Conference on Asia South Pacific Design Automation: ASP-DAC 2006, 2006

OpenMP Directive Extension for BlackFin 561 Dual Core Processor.

[BibT_eX]

[DOI]

Hee Seo

Proceedings of the Sixth International Conference on Computer and Information Technology (CIT 2006), 2006

2005

Study of OpenMP applications on the InfiniBand-based software distributed shared-memory system.

[BibT_eX]

[DOI]

Inho Park

Parallel Comput., 2005

The distributed virtual shared-memory system based on the InfiniBand architecture.

[BibT_eX]

[DOI]

Inho Park

J. Parallel Distributed Comput., 2005

Implementation of H.264/AVC baseline profile decoder for mobile video applications.

[BibT_eX]

[DOI]

Proceedings of the 12th IEEE International Conference on Electronics, 2005

2004

Charge-Sharing-Problem Reduced Split-Path Domino Logic.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on VLSI Design (VLSI Design 2004), 2004

A Distributed-Request-Based DiffServ CAC for Seamless Fast-Handoff in Mobile Internet.

[BibT_eX]

[DOI]

Proceedings of the Quality of Service in the Emerging Networking Panorama: Fifth International Workshop on Quality of Future Internet Services, 2004

Implementation of the Software Distributed Shared-Memory System on the InfiniBand.

[BibT_eX]

Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 2004

Implementation of a low power motion detection camera processor using a CMOS Image Sensor.

[BibT_eX]

Suh Ho Lee

Suki Kim

Proceedings of the 2004 International Symposium on Circuits and Systems, 2004

Characterization of OpenMP Applications on the InfiniBand-Based Distributed Virtual Shared Memory System.

[BibT_eX]

[DOI]

Inho Park

Kyung Park

Proceedings of the High Performance Computing, 2004

2003

OpenMP and Compilation Issue in Embedded Applications.

[BibT_eX]

[DOI]

Jaegeun Oh

Chulwoo Kim

Proceedings of the OpenMP Shared Memory Parallel Programming, 2003

Parallelizing Parallel Rollout Algorithm for Solving Markov Decision Processes.

[BibT_eX]

[DOI]

Hyeong Soo Chang

Proceedings of the OpenMP Shared Memory Parallel Programming, 2003

Dynamic Instrumentation of Large-Scale MPI and OpenMP Applications.

[BibT_eX]

[DOI]

Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS 2003), 2003

2002

VGV: Supporting Performance Analysis of Object-Oriented Mixed MPI/OpenMP Parallel Applications.

[BibT_eX]

[DOI]

Proceedings of the 16th International Parallel and Distributed Processing Symposium (IPDPS 2002), 2002

2001

Parallel programming environment for OpenMP.

[BibT_eX]

[DOI]

Sci. Program., 2001

Portable Compilers for OpenMP.

[BibT_eX]

[DOI]

Proceedings of the OpenMP Shared Memory Parallel Programming, 2001

Reference idempotency analysis: a framework for optimizing speculative execution.

[BibT_eX]

[DOI]

Proceedings of the 2001 ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPOPP'01), 2001

The Structure of a Compiler for Explicit and Implicit Parallelism.

[BibT_eX]

[DOI]

Proceedings of the Languages and Compilers for Parallel Computing, 2001

Multiplex: unifying conventional and speculative thread-level parallelism on a chip multiprocessor.

[BibT_eX]

[DOI]

Proceedings of the 15th international conference on Supercomputing, 2001

2000

Where Does the Speedup Go: Quantitative Modeling of Performance Losses in Shared-Memory Programs.

[BibT_eX]

[DOI]

Parallel Process. Lett., 2000

A Performance Advisor Tool for Shared-Memory Parallel Programming.

[BibT_eX]

[DOI]

Insung Park

Proceedings of the Languages and Compilers for Parallel Computing, 2000

Quantifying Differences between OpenMP and MPI Using a Large-Scale Application Suite.

[BibT_eX]

[DOI]

Brian Armstrong

Proceedings of the High Performance Computing, Third International Symposium, 2000

Compiler Techniques for Energy Saving in Instruction Caches of Speculative Parallel Microarchitectures.

[BibT_eX]

[DOI]

Proceedings of the 2000 International Conference on Parallel Processing, 2000

1999

Compiling for Speculative Architectures.

[BibT_eX]

[DOI]

Proceedings of the Languages and Compilers for Parallel Computing, 1999

1998

Compiler-Based Tools for Analyzing Parallel Programs.

[BibT_eX]

[DOI]

Parallel Comput., 1998

1995

An Extended Fuzzy Clustering Algorithm and its Application.

[BibT_eX]

[DOI]

Su Hwan Kim

Tae Won Rhee

J. Circuits Syst. Comput., 1995

1993

Full Adder-based Inner Product Step Processors for Residue and Quadratic Residue Number Systems.

[BibT_eX]