Daichi Fujiki

Orcid: 0000-0001-7949-0417

According to our database1, Daichi Fujiki authored at least 40 papers between 2016 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Context Memorization for Efficient Long Context Generation.
CoRR, May, 2026

Memory-Efficient and Trustworthy Neural Networks via Random Seed-Based Design.
IEEE Access, 2026

Efficient Vision Transformers via Token Merging with Head-Wise Attention Correction.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2026

AQPIM: Breaking the PIM Capacity Wall for LLMs with in-Memory Activation Quantization.
Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2026

A 22nm Continual Learning Accelerator for Autonomous Systems with 69.2TOPS/W Dynamic-Sparse-Weight-Updat and Dual-Mode Vector-Scaled-INT4 Processing.
Proceedings of the IEEE Custom Integrated Circuits Conference, 2026

The Strong Lottery Ticket Hypothesis for Multi-Head Attention Mechanisms.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
AdaBlock-dLLM: Semantic-Aware Diffusion LLM Inference via Adaptive Block Size.
CoRR, September, 2025

DMSA: An Efficient Architecture for Sparse-Sparse Matrix Multiplication Based on Distribute-Merge Product Dataflow.
IEEE Trans. Very Large Scale Integr. Syst., July, 2025

Partially Frozen Random Networks Contain Compact Strong Lottery Tickets.
Trans. Mach. Learn. Res., 2025

TicketLLM: Next-Generation Sparse and Low-bit Transformers with Supermask-based Method.
Trans. Mach. Learn. Res., 2025

Uncovering Strong Lottery Tickets in Graph Transformers: A Path to Memory Efficient and Robust Graph Learning.
Trans. Mach. Learn. Res., 2025

SharK: Enabling High-Performance Range Queries in Key-Value Store Through Vlog Resharding.
IEEE Access, 2025

WhiteDwarf: A Holistic Co-Design Approach to Ultra-Compact Neural Inference Acceleration.
IEEE Access, 2025

BingoGCN: Towards Scalable and Efficient GNN Acceleration with Fine-Grained Partitioning and SLT.
Proceedings of the 52nd Annual International Symposium on Computer Architecture, 2025

DX100: Programmable Data Access Accelerator for Indirection.
Proceedings of the 52nd Annual International Symposium on Computer Architecture, 2025

Multi-Dimensional Vector ISA Extension for Mobile In-Cache Computing.
Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2025

TTF-GNN: Memory-Efficient GNNs via Tensor Train Decomposition and Network Folding.
Proceedings of the IEEE Symposium on Low-Power and High-Speed Chips and Systems, 2025

2024
Restricted Random Pruning at Initialization for High Compression Range.
Trans. Mach. Learn. Res., 2024

Partial Search in a Frozen Network is Enough to Find a Strong Lottery Ticket.
CoRR, 2024

The Case for Coherence Directories in Memory Cubes.
Proceedings of the 17th IEEE International Symposium on Embedded Multicore/Many-core Systems-on-Chip, 2024

Exploiting N:M Sparsity in Quantized-Folded ResNets: Signed Multicoat Supermasks and Iterative Pruning-Quantization.
Proceedings of the Twelfth International Symposium on Computing and Networking, 2024

WhiteDwarf: 12.24 TFLOPS/W 40 nm Versatile Neural Inference Engine for Ultra-Compact Execution of CNNs and MLPs Through Triple Unstructured Sparsity Exploitation and Triple Model Compression.
Proceedings of the IEEE Asian Solid-State Circuits Conference, 2024

OSA-HCIM: On-The-Fly Saliency-Aware Hybrid SRAM CIM with Dynamic Precision Configuration.
Proceedings of the 29th Asia and South Pacific Design Automation Conference, 2024

2023
HALO-CAT: A Hidden Network Processor with Activation-Localized CIM Architecture and Layer-Penetrative Tiling.
CoRR, 2023

MVC: Enabling Fully Coherent Multi-Data-Views through the Memory Hierarchy with Processing in Memory.
Proceedings of the 56th Annual IEEE/ACM International Symposium on Microarchitecture, 2023

Vector-Processing for Mobile Devices: Benchmark and Analysis.
Proceedings of the IEEE International Symposium on Workload Characterization, 2023

2022
In-Memory Acceleration for General Data Parallel Applications
PhD thesis, 2022

Multi-Layer In-Memory Processing.
Proceedings of the 55th IEEE/ACM International Symposium on Microarchitecture, 2022

2021
In-/Near-Memory Computing
Synthesis Lectures on Computer Architecture, Morgan & Claypool Publishers, ISBN: 978-3-031-01772-8, 2021

A 2.46M Reads/s Seed-Extension Accelerator for Next-Generation Sequencing Using a String-Independent PE Array.
IEEE J. Solid State Circuits, 2021

2020
SeedEx: A Genome Sequencing Accelerator for Optimal Alignments in Subminimal Space.
Proceedings of the 53rd Annual IEEE/ACM International Symposium on Microarchitecture, 2020

A 2.46M reads/s Genome Sequencing Accelerator using a 625 Processing-Element Array.
Proceedings of the 2020 IEEE Custom Integrated Circuits Conference, 2020

2019
Near-memory data transformation for efficient sparse matrix multi-vector multiplication.
Proceedings of the International Conference for High Performance Computing, 2019

Duality cache for data parallel acceleration.
Proceedings of the 46th International Symposium on Computer Architecture, 2019

2018
AxNoC: Low-power Approximate Network-on-Chips using Critical-Path Isolation.
Proceedings of the Twelfth IEEE/ACM International Symposium on Networks-on-Chip, 2018

GenAx: A Genome Sequencing Accelerator.
Proceedings of the 45th ACM/IEEE Annual International Symposium on Computer Architecture, 2018

In-Memory Data Parallel Processor.
Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating Systems, 2018

2017
High-Bandwidth Low-Latency Approximate Interconnection Networks.
Proceedings of the 2017 IEEE International Symposium on High Performance Computer Architecture, 2017

In-memory Data Flow Processor.
Proceedings of the 26th International Conference on Parallel Architectures and Compilation Techniques, 2017

2016
Randomizing Packet Memory Networks for Low-Latency Processor-Memory Communication.
Proceedings of the 24th Euromicro International Conference on Parallel, 2016


  Loading...