Masato Motomura

Proceedings of the IEEE Asian Solid-State Circuits Conference, 2024

Sparse-Sparse Matrix Multiplication Accelerator on FPGA featuring Distribute-Merge Product Dataflow.

[BibT_eX]

[DOI]

Proceedings of the 29th Asia and South Pacific Design Automation Conference, 2024

Progressive Variable Precision DNN With Bitwise Ternary Accumulation.

[BibT_eX]

[DOI]

Proceedings of the 6th IEEE International Conference on AI Circuits and Systems, 2024

2023

A Fully-Parallel Annealing Algorithm with Autonomous Pinning Effect Control for Various Combinatorial Optimization Problems.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., December, 2023

TT-MLP: Tensor Train Decomposition on Deep MLPs.

[BibT_eX]

[DOI]

IEEE Access, 2023

Recurrent Residual Networks Contain Stronger Lottery Tickets.

[BibT_eX]

[DOI]

IEEE Access, 2023

Pianissimo: A Sub-mW Class DNN Accelerator with Progressive Bit-by-Bit Datapath Architecture for Adaptive Inference at Edge.

[BibT_eX]

[DOI]

Junnosuke Suzuki

Jaehoon Yu

Mari Yasunaga

Proceedings of the 2023 IEEE Symposium on VLSI Technology and Circuits (VLSI Technology and Circuits), 2023

A Highly Accurate and Parallel Vision MLP FPGA Accelerator based on FP7/8 SIMD Operations.

[BibT_eX]

[DOI]

Proceedings of the 16th IEEE International Symposium on Embedded Multicore/Many-core Systems-on-Chip, 2023

Multicoated and Folded Graph Neural Networks With Strong Lottery Tickets.

[BibT_eX]

[DOI]

Jiale Yan

Hiroaki Ito

Proceedings of the Learning on Graphs Conference, 27-30 November 2023, Virtual Event., 2023

Amorphica: 4-Replica 512 Fully Connected Spin 336MHz Metamorphic Annealer with Programmable Optimization Strategy and Compressed-Spin-Transfer Multi-Chip Extension.

[BibT_eX]

[DOI]

Bruno Hideki Fukushima-Kimura

Kota Ando

Ryota Yasudo

Thiem Van Chu

Proceedings of the IEEE International Solid- State Circuits Conference, 2023

Decision Forest Training Accelerator Based on Binary Feature Decomposition.

[BibT_eX]

[DOI]

Proceedings of the 31st IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2023

Flexibly Controllable Dynamic Cooling Methods for Solid-State Annealing Processors to Improve Combinatorial Optimization Performance.

[BibT_eX]

[DOI]

Proceedings of the IEEE Symposium in Low-Power and High-Speed Chips, 2023

2022

A Hybrid Integer Encoding Method for Obtaining High-Quality Solutions of Quadratic Knapsack Problems on Solid-State Annealers.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., December, 2022

Real-Time Tone Mapping: A Survey and Cross-Implementation Hardware Benchmark.

[BibT_eX]

[DOI]

Yafei Ou

IEEE Trans. Circuits Syst. Video Technol., 2022

Hiddenite: 4K-PE Hidden Network Inference 4D-Tensor Engine Exploiting On-Chip Model Construction Achieving 34.8-to-16.0TOPS/W for CIFAR-100 and ImageNet.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Solid-State Circuits Conference, 2022

APC-SCA: A Fully-Parallel Annealing Algorithm with Autonomous Pinning Effect Control.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2022

Multicoated Supermasks Enhance Hidden Networks.

[BibT_eX]

[DOI]

Yasuyuki Okoshi

Proceedings of the International Conference on Machine Learning, 2022

Ring-VCO-based ReLU activation function with linearity improvement for pulsed neuron circuits.

[BibT_eX]

[DOI]

Pitchayapatchaya Srikram

Yuri Kanazawa

Proceedings of the 29th IEEE International Conference on Electronics, Circuits and Systems, 2022

2021

A 96-MB 3D-Stacked SRAM Using Inductive Coupling With 0.4-V Transmitter, Termination Scheme and 12: 1 SerDes in 40-nm CMOS.

[BibT_eX]

[DOI]

Kota Shiba

Tatsuo Omori

Kodai Ueyoshi

Mototsugu Hamada

Tadahiro Kuroda

IEEE Trans. Circuits Syst. I Regul. Pap., 2021

STATICA: A 512-Spin 0.25M-Weight Annealing Processor With an All-Spin-Updates-at-Once Architecture for Combinatorial Optimization With Complete Spin-Spin Interactions.

[BibT_eX]

[DOI]

IEEE J. Solid State Circuits, 2021

ProgressiveNN: Achieving Computational Scalability with Dynamic Bit-Precision Adjustment by MSB-first Accumulative Computation.

[BibT_eX]

[DOI]

Int. J. Netw. Comput., 2021

ExtraFerns: Fully Parallel Ensemble Learning Technique with Random Projection and Non-Greedy yet Minimal Memory Access Training.

[BibT_eX]

[DOI]

Int. J. Netw. Comput., 2021

Selective Fine-Tuning on a Classifier Ensemble: Realizing Adaptive Neural Networks With a Diversified Multi-Exit Architecture.

[BibT_eX]

[DOI]

Kazutoshi Hirose

Jaehoon Yu

IEEE Access, 2021

Edge Inference Engine for Deep & Random Sparse Neural Networks with 4-bit Cartesian-Product MAC Array and Pipelined Activation Aligner.

[BibT_eX]

[DOI]

Proceedings of the IEEE Hot Chips 33 Symposium, 2021

A High-Performance and Flexible FPGA Inference Accelerator for Decision Forests Based on Prior Feature Space Partitioning.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Field-Programmable Technology, 2021

Hidden-Fold Networks: Random Recurrent Residuals Using Sparse Supermasks.

[BibT_eX]

[DOI]

Masanori Hashimoto

Jaehoon Yu

Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020

An Adaptive Global and Local Tone Mapping Algorithm Implemented on FPGA.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2020

A Hardware-efficient Weight Sampling Circuit for Bayesian Neural Networks.

[BibT_eX]

[DOI]

Int. J. Netw. Comput., 2020

7.3 STATICA: A 512-Spin 0.25M-Weight Full-Digital Annealing Processor with a Near-Memory All-Spin-Updates-at-Once Architecture for Combinatorial Optimization with Complete Spin-Spin Interactions.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Solid- State Circuits Conference, 2020

A 3D-Stacked SRAM using Inductive Coupling with Low-Voltage Transmitter and 12: 1 SerDes.

[BibT_eX]

[DOI]

Mototsugu Hamada

Tadahiro Kuroda

Proceedings of the IEEE International Symposium on Circuits and Systems, 2020

ProgressiveNN: Achieving Computational Scalability without Network Alteration by MSB-first Accumulative Computation.

[BibT_eX]

[DOI]

Proceedings of the Eighth International Symposium on Computing and Networking, 2020

ExtraFerns: Fully Parallel Ensemble Learning Technique with Non-Greedy yet Minimal Memory Access Training.

[BibT_eX]

[DOI]

Proceedings of the Eighth International Symposium on Computing and Networking, 2020

Hardware/Algorithm Co-optimization for Fully-Parallelized Compact Decision Tree Ensembles on FPGAs.

[BibT_eX]

[DOI]

Proceedings of the Applied Reconfigurable Computing. Architectures, Tools, and Applications, 2020

2019

QUEST: Multi-Purpose Log-Quantized DNN Inference Engine Stacked on 96-MB 3-D SRAM Using Inductive Coupling Technology in 40-nm CMOS.

[BibT_eX]

[DOI]

Kodai Ueyoshi

Kota Ando

Kazutoshi Hirose

Mototsugu Hamada

Tadahiro Kuroda

IEEE J. Solid State Circuits, 2019

Introduction to the Special Issue on the 2018 International Solid-State Circuits Conference (ISSCC).

[BibT_eX]

[DOI]

IEEE J. Solid State Circuits, 2019

FPGA-Based Annealing Processor with Time-Division Multiplexing.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2019

Foreword.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2019

Dither NN: Hardware/Algorithm Co-Design for Accurate Quantized Neural Networks.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2019

A Resource-Efficient Weight Sampling Method for Bayesian Neural Network Accelerators.

[BibT_eX]

[DOI]

Yuki Hirayama

Proceedings of the 2019 Seventh International Symposium on Computing and Networking, 2019

Radiography Contrast Enhancement: Smoothed LHE Filter a Practical Solution for Digital X-Rays with Mach Band.

[BibT_eX]

[DOI]

Proceedings of the 2019 Digital Image Computing: Techniques and Applications, 2019

DeltaNet: Differential Binary Neural Network.

[BibT_eX]

[DOI]

Proceedings of the 30th IEEE International Conference on Application-specific Systems, 2019

2018

Real-time HDTV to 4K and 8K-UHD conversions using anti-aliasing based super resolution algorithm on FPGA.

[BibT_eX]

[DOI]

Microprocess. Microsystems, 2018

BRein Memory: A Single-Chip Binary/Ternary Reconfigurable in-Memory Deep Neural Network Accelerator Achieving 1.4 TOPS at 0.6 W.

[BibT_eX]

[DOI]

IEEE J. Solid State Circuits, 2018

Protocomputing Architecture over a Digital Medium Aiming at Real-Time Video Processing.

[BibT_eX]

[DOI]

Aoi Tanibata

Alexandre Schmid

Complex., 2018

New Generation Dynamically Reconfigurable Processor Technology for Accelerating Embedded AI Applications.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Symposium on VLSI Circuits, 2018

Analysis of Smoothed LHE Methods for Processing Images with Optical Illusions.

[BibT_eX]

[DOI]

Proceedings of the IEEE Visual Communications and Image Processing, 2018

Area and Energy Optimization for Bit-Serial Log-Quantized DNN Accelerator with Shared Accumulators.

[BibT_eX]

[DOI]

Proceedings of the 12th IEEE International Symposium on Embedded Multicore/Many-core Systems-on-Chip, 2018

QUEST: A 7.49TOPS multi-purpose log-quantized DNN inference engine stacked on 96MB 3D SRAM using inductive-coupling technology in 40nm CMOS.

[BibT_eX]

[DOI]

Kodai Ueyoshi

Kota Ando

Kazutoshi Hirose

Proceedings of the 2018 IEEE International Solid-State Circuits Conference, 2018

Session 13 overview: Machine learning and signal processing: Digital architectures and systems subcommittee.

[BibT_eX]

[DOI]

Dejan Markovic

Byeong-Gyu Nam

Proceedings of the 2018 IEEE International Solid-State Circuits Conference, 2018

Sparse Disparity Estimation Using Global Phase Only Correlation for Stereo Matching Acceleration.

[BibT_eX]

[DOI]

Takeshi Shimada

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Dither NN: An Accurate Neural Network with Dithering for Low Bit-Precision Hardware.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Field-Programmable Technology, 2018

Advanced Devices and Architectures.

[BibT_eX]

[DOI]

Masanori Hariyama

Minoru Watanabe

Proceedings of the Principles and Structures of FPGAs., 2018

2017

Error Tolerance Analysis of Deep Learning Hardware Using a Restricted Boltzmann Machine Toward Low-Power Memory Implementation.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. II Express Briefs, 2017

Quantization Error-Based Regularization in Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Artificial Intelligence XXXIV, 2017

In-memory area-efficient signal streaming processor design for binary neural networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE 60th International Midwest Symposium on Circuits and Systems, 2017

F3: Beyond the horizon of conventional computing: From deep learning to neuromorphic systems.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Solid-State Circuits Conference, 2017

Low latency divider using ensemble of moving average curves.

[BibT_eX]

[DOI]

Proceedings of the 18th International Symposium on Quality Electronic Design, 2017

Live demonstration: Feature extraction system using restricted Boltzmann machines on FPGA.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Circuits and Systems, 2017

Exploring optimized accelerator design for binarized convolutional neural networks.

[BibT_eX]

[DOI]

Proceedings of the 2017 International Joint Conference on Neural Networks, 2017

Logarithmic Compression for Memory Footprint Reduction in Neural Network Training.

[BibT_eX]

[DOI]

Proceedings of the Fifth International Symposium on Computing and Networking, 2017

A Time-Division Multiplexing Ising Machine on FPGAs.

[BibT_eX]

[DOI]

Kasho Yamamoto

Weiqiang Huang

Proceedings of the 8th International Symposium on Highly Efficient Accelerators and Reconfigurable Technologies, 2017

FPGA implementation of edge-guided pattern generation for motion-vector estimation of textureless objects.

[BibT_eX]

[DOI]

Aoi Tanibata

Alexandre Schmid

Proceedings of the 27th International Conference on Field Programmable Logic and Applications, 2017

A Batch Normalization Free Binarized Convolutional Deep Neural Network on an FPGA (Abstract Only).

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, 2017

An image sensor/processor 3D stacked module featuring ThruChip interfaces.

[BibT_eX]

[DOI]

Proceedings of the 22nd Asia and South Pacific Design Automation Conference, 2017

An FPGA Realization of a Deep Convolutional Neural Network Using a Threshold Neuron Pruning.

[BibT_eX]

[DOI]

Proceedings of the Applied Reconfigurable Computing - 13th International Symposium, 2017

Accelerating deep learning by binarized hardware.

[BibT_eX]

[DOI]

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016

Introduction to the Special Issue on the 2015 Symposium on VLSI Circuits.

[BibT_eX]

[DOI]

Andreia Cathelin

IEEE J. Solid State Circuits, 2016

Motion Vector Estimation of Textureless Objects Exploiting Reaction-Diffusion Cellular Automata.

[BibT_eX]

[DOI]

Int. J. Unconv. Comput., 2016

Foreword.

[BibT_eX]

[DOI]

Jeffrey C. Gealow

Proceedings of the 2016 IEEE Symposium on VLSI Circuits, 2016

FPGA architecture for feed-forward sequential memory network targeting long-term time-series forecasting.

[BibT_eX]

[DOI]

Proceedings of the International Conference on ReConFigurable Computing and FPGAs, 2016

An FPGA-optimized architecture of anti-aliasing based super resolution for real-time HDTV to 4K- and 8K-UHD conversions.

[BibT_eX]

[DOI]

Proceedings of the International Conference on ReConFigurable Computing and FPGAs, 2016

Memory-error tolerance of scalable and highly parallel architecture for restricted Boltzmann machines in Deep Belief Network.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Circuits and Systems, 2016

A memory-based realization of a binarized deep convolutional neural network.

[BibT_eX]

[DOI]

Proceedings of the 2016 International Conference on Field-Programmable Technology, 2016

Motion-vector estimation and cognitive classification on an image sensor/processor 3D stacked system featuring ThruChip interfaces.

[BibT_eX]

[DOI]

Proceedings of the ESSCIRC Conference 2016: 42<sup>nd</sup> European Solid-State Circuits Conference, 2016

A 2-clock-cycle Naïve Bayes classifier for dynamic branch prediction in pipelined RISC microprocessors.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Asia Pacific Conference on Circuits and Systems, 2016

2015

Crosstalk Rejection in 3-D-Stacked Interchip Communication With Blind Source Separation.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. II Express Briefs, 2015

Introduction to the Special Issue on the 2014 Symposium on VLSI Circuits.

[BibT_eX]

[DOI]

Jeffrey C. Gealow

Lizeth Gonzalez-Carabarin

IEEE J. Solid State Circuits, 2015

Enhancing Memcached by Caching Its Data and Functionalities at Network Interface.

[BibT_eX]

[DOI]

J. Inf. Process., 2015

Through Chip Interface Based Three-Dimensional FPGA Architecture Exploration.

[BibT_eX]

[DOI]

IEICE Trans. Electron., 2015

Image sensor/digital logic 3D stacked module featuring inductive coupling channels for high speed/low-noise image transfer.

[BibT_eX]

[DOI]

Proceedings of the Symposium on VLSI Circuits, 2015

2014

Low-power asynchronous digital pipeline based on mismatch-tolerant logic gates.

[BibT_eX]

[DOI]

IEICE Electron. Express, 2014

Achieving higher performance of memcached by caching at network interface.

[BibT_eX]

[DOI]

Proceedings of the 2014 International Conference on Field-Programmable Technology, 2014

Caching memcached at reconfigurable network interface.

[BibT_eX]

[DOI]

Proceedings of the 24th International Conference on Field Programmable Logic and Applications, 2014

A 4.5 to 13 times energy-efficient embedded microprocessor with mainly-static/partially-dynamic reconfigurable array accelerator.

[BibT_eX]

[DOI]

Proceedings of the IEEE Asian Solid-State Circuits Conference, 2014

2013

C-Based Complex Event Processing on Reconfigurable Hardware.

[BibT_eX]

[DOI]

Hiroaki Inoue

Takashi Takenaka

IEEE Trans. Very Large Scale Integr. Syst., 2013

A restricted dynamically reconfigurable architecture for low power processors.

[BibT_eX]

[DOI]

Proceedings of the 2012 International Conference on Reconfigurable Computing and FPGAs, 2013

Exploiting hardware reconfigurability on window join.

[BibT_eX]

[DOI]

Proceedings of the International Conference on High Performance Computing & Simulation, 2013

Hardware-oriented stereo vision algorithm based on 1-D guided filtering and its FPGA implementation.

[BibT_eX]

[DOI]

Proceedings of the 20th IEEE International Conference on Electronics, 2013

C-Based Adaptive Stream Processing on Dynamically Reconfigurable Hardware: A Case Study on Window Join.

[BibT_eX]

[DOI]

Proceedings of the Reconfigurable Computing: Architectures, Tools and Applications, 2013

2011

Test compression for dynamically reconfigurable processors.

[BibT_eX]

[DOI]

ACM Trans. Reconfigurable Technol. Syst., 2011

Programmable cell array using rewritable solid-electrolyte switch integrated in 90nm CMOS.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Solid-State Circuits Conference, 2011

20Gbps C-Based Complex Event Processing.

[BibT_eX]

[DOI]

Hiroaki Inoue

Takashi Takenaka

Proceedings of the International Conference on Field Programmable Logic and Applications, 2011

Panel discussions: Impact on society by fusion and harmony of mobile devices, servers, and networks - Their direction of evolutions and optimal roles.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE Symposium on Low-Power and High-Speed Chips, 2011

2004

Efficient metrics and high-level synthesis for dynamically reconfigurable logic.

[BibT_eX]

[DOI]

IEEE Trans. Very Large Scale Integr. Syst., 2004

A Combined Approach to High-Level Synthesis for Dynamically Reconfigurable Systems.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2004

Wrapper-based bus implementation techniques for performance improvement and cost reduction.

[BibT_eX]

[DOI]

Kenichiro Anjo

Atsushi Okamura

IEEE J. Solid State Circuits, 2004

Stream applications on the dynamically reconfigurable processor.

[BibT_eX]

[DOI]

Kazutoshi Wakabayashi

Takao Toi

Toru Awashima

Proceedings of the 2004 IEEE International Conference on Field-Programmable Technology, 2004

Implementing and Evaluating Stream Applications on the Dynamically Reconfigurable Processor.

[BibT_eX]

[DOI]

Kazutoshi Wakabayashi

Takeo Toi

Toru Awashima

Proceedings of the 12th IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM 2004), 2004

2003

New design methodology with efficient prediction of quality metrics for logic level design towards dynamic reconfigurable logic.

[BibT_eX]

[DOI]

J. Syst. Archit., 2003

A Hierarchical Cost Estimation Technique for High Level Synthesis.

[BibT_eX]

[DOI]

IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2003

A New Reconfigurable Hardware Architecture for High Throughput Networking Applications and its Design Methodology.

[BibT_eX]

[DOI]

Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS 2003), 2003

A New Hardware Algorithm for Fast IP Routing Targeting Programmable Routers.

[BibT_eX]

Proceedings of the Network Control and Engineering for QoS, 2003

2000

A Study of Channeled DRAM Memory Architectures.

[BibT_eX]

[DOI]

Lars Friebe

Yoshikazu Yabe

Proceedings of the IEEE International Conference On Computer Design: VLSI In Computers & Processors, 2000

A Virtual Hardware System on a Dynamically Reconfigurable Logic Device.

[BibT_eX]

[DOI]

Proceedings of the 8th IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM 2000), 2000

Spatial-temporal mapping of real applications on a Dynamically Reconfigurable Logic Engine (DRLE) LSI.

[BibT_eX]

[DOI]

Koichiro Furuta

Taro Fujii

Kazutoshi Wakabayashi

Masakazu Yamashina

Proceedings of the IEEE 2000 Custom Integrated Circuits Conference, 2000

Reconfigurable computing: its concept and a practical embodiment using newly developed dynamically reconfigurable logic (DRL) LSI: invited talk.

[BibT_eX]

[DOI]

Masakazu Yamashina

Proceedings of ASP-DAC 2000, 2000

1998

An 800-MOPS, 110-mW, 1.5-V, parallel DSP for mobile multimedia processing.

[BibT_eX]

[DOI]

IEEE J. Solid State Circuits, 1998

An Embedded DRAM-FPGA Chip with Instantaneous Logic Reconfiguration.

[BibT_eX]

[DOI]

Proceedings of the 6th IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM '98), 1998

1995

Cache-processor coupling: a fast and wide on-chip data cache design.

[BibT_eX]

[DOI]

IEEE J. Solid State Circuits, April, 1995

Ordered multithreading: a novel technique for exploiting thread-level parallelism.

[BibT_eX]

[DOI]

Proceedings of the IFIP WG10.3 working conference on Parallel architectures and compilation techniques, 1995

1994

A 300-MHz 16-b 0.5-μm BiCMOS digital signal processor core LSI.

[BibT_eX]

[DOI]

IEEE J. Solid State Circuits, March, 1994

1993

A 300-MHz 16-b BiCMOS video signal processor.

[BibT_eX]

[DOI]

IEEE J. Solid State Circuits, December, 1993

1992

A 2 K-word dictionary search processor (DISP) LSI with an approximate word search capability.

[BibT_eX]

[DOI]