Hiroki Nakahara

Orcid: 0000-0002-5701-7466

Affiliations:
  • Tokyo Institute of Technology, Japan
  • Ehime University, Matsuyama, Japan
  • Kagoshima University, Japan
  • Kyushu Institute of Technology, Department of Computer Science and Electronics, Fukuoka, Japan (PhD 2007)


According to our database1, Hiroki Nakahara authored at least 105 papers between 2005 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Remarn: A Reconfigurable Multi-threaded Multi-core Accelerator for Recurrent Neural Networks.
ACM Trans. Reconfigurable Technol. Syst., March, 2023

A Many-core Architecture for an Ensemble Ternary Neural Network Toward High-Throughput Inference.
Proceedings of the 16th IEEE International Symposium on Embedded Multicore/Many-core Systems-on-Chip, 2023

A Consideration on Ternary Adversarial Generative Networks.
Proceedings of the 53rd IEEE International Symposium on Multiple-Valued Logic, 2023

A Light-Weight Vision Transformer Toward Near Memory Computation on an FPGA.
Proceedings of the Applied Reconfigurable Computing. Architectures, Tools, and Applications, 2023

2022
Multilayer Perceptron Training Accelerator Using Systolic Array.
IEICE Trans. Inf. Syst., December, 2022

Recurrent Neural Networks With Column-Wise Matrix-Vector Multiplication on FPGAs.
IEEE Trans. Very Large Scale Integr. Syst., 2022

Optimizations of Ternary Generative Adversarial Networks.
Proceedings of the 52nd IEEE International Symposium on Multiple-Valued Logic, 2022

Fast Interface with Ensemble Ternary Neural Network.
Proceedings of the 52nd IEEE International Symposium on Multiple-Valued Logic, 2022

Message from the General Chair and Program Co-Chairs.
Proceedings of the International Conference on Field-Programmable Technology, 2022

Preface.
Proceedings of the International Conference on Field-Programmable Technology, 2022

2021
FPGA-Based Inter-layer Pipelined Accelerators for Filter-Wise Weight-Balanced Sparse Fully Convolutional Networks with Overlapped Tiling.
J. Signal Process. Syst., 2021

Energy-Efficient ECG Signals Outlier Detection Hardware Using a Sparse Robust Deep Autoencoder.
IEICE Trans. Inf. Syst., 2021

A Low-Latency Inference of Randomly Wired Convolutional Neural Networks on an FPGA.
IEICE Trans. Inf. Syst., 2021

Weight Sparseness for a Feature-Map-Split-CNN Toward Low-Cost Embedded FPGAs.
IEICE Trans. Inf. Syst., 2021

A novel compact planar magic-T using CPS and microstrip-to-CPS transition.
IEICE Electron. Express, 2021

Detection of Wrong Direction Vehicles on Two-Way Traffic.
Proceedings of the 13th International Conference on Knowledge and Systems Engineering, 2021

A High-Throughput Detection Circuit based on 2<sup>q</sup>+1-Valued Deep Neural Networks.
Proceedings of the 51st IEEE International Symposium on Multiple-Valued Logic, 2021

Edge Inference Engine for Deep & Random Sparse Neural Networks with 4-bit Cartesian-Product MAC Array and Pipelined Activation Aligner.
Proceedings of the IEEE Hot Chips 33 Symposium, 2021

2020
SENTEI: Filter-Wise Pruning with Distillation towards Efficient Sparse Convolutional Neural Network Accelerators.
IEICE Trans. Inf. Syst., 2020

A Table Look-Up Based Ternary Neural Network Processor.
Proceedings of the 50th IEEE International Symposium on Multiple-Valued Logic, 2020

2<sup>n</sup>+1-valued SSS-Net: Uniform Shift, Channel Sparseness, and Channel Shuffle.
Proceedings of the 50th IEEE International Symposium on Multiple-Valued Logic, 2020

Fast Monocular Depth Estimation on an FPGA.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium Workshops, 2020

Design Method for an LUT Network-Based CNN with a Sparse Local Convolution.
Proceedings of the International Conference on Field-Programmable Technology, 2020

A Reconfigurable Multithreaded Accelerator for Recurrent Neural Networks.
Proceedings of the International Conference on Field-Programmable Technology, 2020

An FPGA-Based Low-Latency Accelerator for Randomly Wired Neural Networks.
Proceedings of the 30th International Conference on Field-Programmable Logic and Applications, 2020

R2CNN: Recurrent Residual Convolutional Neural Network on FPGA.
Proceedings of the FPGA '20: The 2020 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, 2020

Optimizing Reconfigurable Recurrent Neural Networks.
Proceedings of the 28th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2020

High-Throughput Convolutional Neural Network on an FPGA by Customized JPEG Compression.
Proceedings of the 28th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2020

Tiny On-Chip Memory Realization of Weight Sparseness Split-CNNs on Low-end FPGAs.
Proceedings of the 28th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2020

2019
Power Efficient Object Detector with an Event-Driven Camera for Moving Object Surveillance on an FPGA.
IEICE Trans. Inf. Syst., 2019

GUINNESS: A GUI Based Binarized Deep Neural Network Framework for Software Programmers.
IEICE Trans. Inf. Syst., 2019

FPGA-based Accurate Pedestrian Detection with Thermal Camera for Surveillance System.
Proceedings of the 2019 International Conference on ReConFigurable Computing and FPGAs, 2019

Many Universal Convolution Cores for Ensemble Sparse Convolutional Neural Networks.
Proceedings of the 13th IEEE International Symposium on Embedded Multicore/Many-core Systems-on-Chip, 2019

Noise Convolutional Neural Networks and FPGA Implementation.
Proceedings of the 2019 IEEE 49th International Symposium on Multiple-Valued Logic (ISMVL), 2019

A Dataflow Pipelining Architecture for Tile Segmentation with a Sparse MobileNet on an FPGA.
Proceedings of the International Conference on Field-Programmable Technology, 2019

An FPGA Implementation of Real-Time Object Detection with a Thermal Camera.
Proceedings of the 29th International Conference on Field Programmable Logic and Applications, 2019

FPGA-Based Training Accelerator Utilizing Sparseness of Convolutional Neural Network.
Proceedings of the 29th International Conference on Field Programmable Logic and Applications, 2019

Real-Time Multi-Pedestrian Detection in Surveillance Camera using FPGA.
Proceedings of the 29th International Conference on Field Programmable Logic and Applications, 2019

An FPGA-based Fine Tuning Accelerator for a Sparse CNN.
Proceedings of the 2019 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, 2019

Filter-Wise Pruning Approach to FPGA Implementation of Fully Convolutional Network for Semantic Segmentation.
Proceedings of the Applied Reconfigurable Computing - 15th International Symposium, 2019

2018
BRein Memory: A Single-Chip Binary/Ternary Reconfigurable in-Memory Deep Neural Network Accelerator Achieving 1.4 TOPS at 0.6 W.
IEEE J. Solid State Circuits, 2018

An FPGA Realization of a Random Forest with <i>k</i>-Means Clustering Using a High-Level Synthesis Design.
IEICE Trans. Inf. Syst., 2018

A Threshold Neuron Pruning for a Binarized Deep Neural Network on an FPGA.
IEICE Trans. Inf. Syst., 2018

New Generation Dynamically Reconfigurable Processor Technology for Accelerating Embedded AI Applications.
Proceedings of the 2018 IEEE Symposium on VLSI Circuits, 2018

A Ternary Weight Binary Input Convolutional Neural Network: Realization on the Embedded Processor.
Proceedings of the 48th IEEE International Symposium on Multiple-Valued Logic, 2018

A High-speed Low-power Deep Neural Network on an FPGA based on the Nested RNS: Applied to an Object Detector.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2018

Power Efficient Object Detector with an Event-Driven Camera on an FPGA.
Proceedings of the 9th International Symposium on Highly-Efficient Accelerators and Reconfigurable Technologies, 2018

A Performance Per Power Efficient Object Detector on an FPGA for Robot Operating System (ROS).
Proceedings of the 9th International Symposium on Highly-Efficient Accelerators and Reconfigurable Technologies, 2018

A Tri-State Weight Convolutional Neural Network for an FPGA: Applied to YOLOv2 Object Detector.
Proceedings of the International Conference on Field-Programmable Technology, 2018

An FPGA Realization of OpenPose Based on a Sparse Weight Convolutional Neural Network.
Proceedings of the International Conference on Field-Programmable Technology, 2018

Demonstration of Object Detection for Event-Driven Cameras on FPGAs and GPUs.
Proceedings of the 28th International Conference on Field Programmable Logic and Applications, 2018

A Demonstration of FPGA-Based You Only Look Once Version2 (YOLOv2).
Proceedings of the 28th International Conference on Field Programmable Logic and Applications, 2018

A Lightweight YOLOv2: A Binarized CNN with A Parallel Support Vector Regression for an FPGA.
Proceedings of the 2018 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, 2018

Hardware Algorithms.
Proceedings of the Principles and Structures of FPGAs., 2018

2017
In-memory area-efficient signal streaming processor design for binary neural networks.
Proceedings of the IEEE 60th International Midwest Symposium on Circuits and Systems, 2017

A Random Forest Using a Multi-valued Decision Diagram on an FPGA.
Proceedings of the 47th IEEE International Symposium on Multiple-Valued Logic, 2017

On-Chip Memory Based Binarized Convolutional Deep Neural Network Applying Batch Normalization Free Technique on an FPGA.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

All binarized convolutional neural network and its implementation on an FPGA.
Proceedings of the International Conference on Field Programmable Technology, 2017

An object detector based on multiscale sliding window search using a fully pipelined binarized CNN on an FPGA.
Proceedings of the International Conference on Field Programmable Technology, 2017

A fully connected layer elimination for a binarizec convolutional neural network on an FPGA.
Proceedings of the 27th International Conference on Field Programmable Logic and Applications, 2017

A Batch Normalization Free Binarized Convolutional Deep Neural Network on an FPGA (Abstract Only).
Proceedings of the 2017 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, 2017

An FPGA Realization of a Deep Convolutional Neural Network Using a Threshold Neuron Pruning.
Proceedings of the Applied Reconfigurable Computing - 13th International Symposium, 2017

2016
An FFT Circuit for a Spectrometer of a Radio Telescope using the Nested RNS including the Constant Division.
SIGARCH Comput. Archit. News, 2016

An Update Method for a Low Power Cam Emulator Using an LUT Cascade Based on an EVMDD (<i>k</i>).
J. Multiple Valued Log. Soft Comput., 2016

LUT Cascades Based on Edge-Valued Multi-Valued Decision Diagrams: Application to Packet Classification.
IEEE J. Emerg. Sel. Topics Circuits Syst., 2016

An FFT Circuit Using Nested RNS in a Digital Spectrometer for a Radio Telescope.
Proceedings of the 46th IEEE International Symposium on Multiple-Valued Logic, 2016

A memory-based realization of a binarized deep convolutional neural network.
Proceedings of the 2016 International Conference on Field-Programmable Technology, 2016

An acceleration of a random forest classification using Altera SDK for OpenCL.
Proceedings of the 2016 International Conference on Field-Programmable Technology, 2016

2015
A Memory-Based IPv6 Lookup Architecture Using Parallel Index Generation Units.
IEICE Trans. Inf. Syst., 2015

An RNS FFT Circuit Using LUT Cascades Based on a Modulo EVMDD.
Proceedings of the 2015 IEEE International Symposium on Multiple-Valued Logic, 2015

A deep convolutional neural network based on nested residue number system.
Proceedings of the 25th International Conference on Field Programmable Logic and Applications, 2015

A Dynamically Reconfigurable Mixed Analog-Digital Filter Bank.
Proceedings of the Applied Reconfigurable Computing - 11th International Symposium, 2015

2014
A Heterogeneous Multi-valued Decision Diagram Machine for Encoded Characteristic Function for Non-zero Outputs.
J. Multiple Valued Log. Soft Comput., 2014

A Packet Classifier Based on Prefetching EVMDD (<i>k</i>) Machines.
IEICE Trans. Inf. Syst., 2014

An AWF digital spectrometer for a radio telescope.
Proceedings of the 2014 International Conference on ReConFigurable Computing and FPGAs, 2014

An Update Method for a CAM Emulator Using an LUT Cascade Based on an EVMDD (K).
Proceedings of the IEEE 44th International Symposium on Multiple-Valued Logic, 2014

Automatic adjustment system for optical interconnection transmitter using improved particle swarm optimization.
Proceedings of the 2014 International Symposium on Integrated Circuits (ISIC), 2014

2013
A Virus Scanning Engine Using an MPU and an IGU Based on Row-Shift Decomposition.
IEICE Trans. Inf. Syst., 2013

A Machine to Evaluate Decomposed Multi-Terminal Multi-Valued Decision Diagrams for Characteristic Functions.
Proceedings of the 43rd IEEE International Symposium on Multiple-Valued Logic, 2013

A high-speed FFT based on a six-step algorithm: Applied to a radio telescope for a solar radio burst.
Proceedings of the 2013 International Conference on Field-Programmable Technology, 2013

A packet classifier using LUT cascades based on EVMDDS (k).
Proceedings of the 23rd International Conference on Field programmable Logic and Applications, 2013

An Architecture for IPv6 Lookup Using Parallel Index Generation Units.
Proceedings of the Reconfigurable Computing: Architectures, Tools and Applications, 2013

2012
On a wideband fast fourier transform for a radio telescope.
SIGARCH Comput. Archit. News, 2012

A Comparison of Multi-Valued and Heterogeneous Decision Diagram Machines.
J. Multiple Valued Log. Soft Comput., 2012

A regular expression matching circuit: Decomposed non-deterministic realization with prefix sharing and multi-character transition.
Microprocess. Microsystems, 2012

A Design Method of a Regular Expression Matching Circuit Based on Decomposed Automaton.
IEICE Trans. Inf. Syst., 2012

Multi-terminal Multi-valued Decision Diagrams for Characteristic Function Representing Cluster Decomposition.
Proceedings of the 42nd IEEE International Symposium on Multiple-Valued Logic, 2012

On a Wideband Fast Fourier Transform Using Piecewise Linear Approximations: Application to a Radio Telescope Spectrometer.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2012

A Low-Cost and High-Performance Virus Scanning Engine Using a Binary CAM Emulator and an MPU.
Proceedings of the Reconfigurable Computing: Architectures, Tools and Applications, 2012

2011
A Comparison of Heterogeneous Multi-valued Decision Diagram Machines for Multiple-Output Logic Functions.
Proceedings of the 41st IEEE International Symposium on Multiple-Valued Logic, 2011

A Regular Expression Matching Circuit Based on a Decomposed Automaton.
Proceedings of the Reconfigurable Computing: Architectures, Tools and Applications, 2011

2010
A Quaternary Decision Diagram Machine: Optimization of Its Code.
IEICE Trans. Inf. Syst., 2010

A Parallel Branching Program Machine for Sequential Circuits: Implementation and Evaluation.
IEICE Trans. Inf. Syst., 2010

A Comparison of Architectures for Various Decision Diagram Machines.
Proceedings of the 40th IEEE International Symposium on Multiple-Valued Logic, 2010

A Packet Classifier Using a Parallel Branching Program Machine.
Proceedings of the 13th Euromicro Conference on Digital System Design, 2010

2009
A Quaternary Decision Diagram Machine and the Optimization of its Code.
Proceedings of the ISMVL 2009, 2009

A virus scanning engine using a parallel finite-input memory machine and MPUs.
Proceedings of the 19th International Conference on Field Programmable Logic and Applications, 2009

The Parallel Sieve Method for a Virus Scanning Engine.
Proceedings of the 12th Euromicro Conference on Digital System Design, 2009

A Parallel Branching Program Machine for Emulation of Sequential Circuits.
Proceedings of the Reconfigurable Computing: Architectures, 2009

2007
A CAM Emulator Using Look-Up Table Cascades.
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

Implementations of Reconfigurable Logic Arrays on FPGAs.
Proceedings of the 2007 International Conference on Field-Programmable Technology, 2007

2006
A PC-Based Logic Simulator Using a Look-Up Table Cascade Emulator.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2006

A Soft Error Tolerant LUT Cascade Emulator.
Proceedings of the 15th Asian Test Symposium, 2006

A fast logic simulator using a look up table cascade emulator.
Proceedings of the 2006 Conference on Asia South Pacific Design Automation: ASP-DAC 2006, 2006

2005
A Design Algorithm for Sequential Circuits Using LUT Rings.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2005


  Loading...