WhiteDwarf: 12.24 TFLOPS/W 40 nm Versatile Neural Inference Engine for Ultra-Compact Execution of CNNs and MLPs Through Triple Unstructured Sparsity Exploitation and Triple Model Compression.

[BibT_eX]

[DOI]

Yasuyuki Okoshi

Ángel López García-Arias

Proceedings of the IEEE Asian Solid-State Circuits Conference, 2024

OSA-HCIM: On-The-Fly Saliency-Aware Hybrid SRAM CIM with Dynamic Precision Configuration.

[BibT_eX]

[DOI]

Yung-Chin Chen

Shimpei Ando

Daichi Fujiki

Shinya Takamaeda-Yamazaki

Kentaro Yoshioka

Proceedings of the 29th Asia and South Pacific Design Automation Conference, 2024

2023

HALO-CAT: A Hidden Network Processor with Activation-Localized CIM Architecture and Layer-Penetrative Tiling.

[BibT_eX]

[DOI]

Yung-Chin Chen

Shimpei Ando

Daichi Fujiki

Shinya Takamaeda-Yamazaki

Kentaro Yoshioka

CoRR, 2023

MVC: Enabling Fully Coherent Multi-Data-Views through the Memory Hierarchy with Processing in Memory.

[BibT_eX]

[DOI]

Daichi Fujiki

Proceedings of the 56th Annual IEEE/ACM International Symposium on Microarchitecture, 2023

Vector-Processing for Mobile Devices: Benchmark and Analysis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Workload Characterization, 2023

2022

In-Memory Acceleration for General Data Parallel Applications

[BibT_eX]

[DOI]

Daichi Fujiki

PhD thesis, 2022

Multi-Layer In-Memory Processing.

[BibT_eX]

[DOI]

Proceedings of the 55th IEEE/ACM International Symposium on Microarchitecture, 2022

2021

In-/Near-Memory Computing

[BibT_eX]

[DOI]

Synthesis Lectures on Computer Architecture, Morgan & Claypool Publishers, ISBN: 978-3-031-01772-8, 2021

A 2.46M Reads/s Seed-Extension Accelerator for Next-Generation Sequencing Using a String-Independent PE Array.

[BibT_eX]

[DOI]

IEEE J. Solid State Circuits, 2021

2020

SeedEx: A Genome Sequencing Accelerator for Optimal Alignments in Subminimal Space.

[BibT_eX]

[DOI]

Proceedings of the 53rd Annual IEEE/ACM International Symposium on Microarchitecture, 2020

A 2.46M reads/s Genome Sequencing Accelerator using a 625 Processing-Element Array.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE Custom Integrated Circuits Conference, 2020

2019

Near-memory data transformation for efficient sparse matrix multi-vector multiplication.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2019

Duality cache for data parallel acceleration.

[BibT_eX]

[DOI]

Daichi Fujiki

Scott A. Mahlke

Reetuparna Das

Proceedings of the 46th International Symposium on Computer Architecture, 2019

2018

AxNoC: Low-power Approximate Network-on-Chips using Critical-Path Isolation.

[BibT_eX]

[DOI]

Proceedings of the Twelfth IEEE/ACM International Symposium on Networks-on-Chip, 2018

GenAx: A Genome Sequencing Accelerator.

[BibT_eX]

[DOI]

Proceedings of the 45th ACM/IEEE Annual International Symposium on Computer Architecture, 2018

In-Memory Data Parallel Processor.

[BibT_eX]

[DOI]

Daichi Fujiki

Scott A. Mahlke

Reetuparna Das

Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating Systems, 2018

2017

High-Bandwidth Low-Latency Approximate Interconnection Networks.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Symposium on High Performance Computer Architecture, 2017

In-memory Data Flow Processor.

[BibT_eX]

[DOI]

Daichi Fujiki

Scott A. Mahlke

Reetuparna Das

Proceedings of the 26th International Conference on Parallel Architectures and Compilation Techniques, 2017

2016

Randomizing Packet Memory Networks for Low-Latency Processor-Memory Communication.

[BibT_eX]

[DOI]

Proceedings of the 24th Euromicro International Conference on Parallel, 2016

Daichi Fujiki

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...