Shreyas K. Venkataramanaiah

IEEE J. Solid State Circuits, 2023

FP-IMC: A 28nm All-Digital Configurable Floating-Point In-Memory Computing Macro.

[BibT_eX]

[DOI]

Jyotishman Saikia

Amitesh Sridharan

Injune Yeo

Deliang Fan

Jae-Sun Seo

Proceedings of the 49th IEEE European Solid State Circuits Conference, 2023

2022

Efficient continual learning at the edge with progressive segmented training.

[BibT_eX]

[DOI]

Xiaocong Du

Neuromorph. Comput. Eng., December, 2022

A 28nm 8-bit Floating-Point Tensor Core based CNN Training Processor with Dynamic Activation/Weight Sparsification.

[BibT_eX]

[DOI]

Proceedings of the 48th IEEE European Solid State Circuits Conference, 2022

2021

Algorithm-Hardware Co-Optimization for Energy-Efficient Drone Detection on Resource-Constrained FPGA.

[BibT_eX]

[DOI]

Han-Sok Suh

Jian Meng

Ty Nguyen

Vijay Kumar

Yu Cao

Proceedings of the International Conference on Field-Programmable Technology, 2021

FixyFPGA: Efficient FPGA Accelerator for Deep Neural Networks with High Element-Wise Sparsity and without External Memory Access.

[BibT_eX]

[DOI]

Jian Meng

Proceedings of the 31st International Conference on Field-Programmable Logic and Applications, 2021

2020

Deep Neural Network Training Accelerator Designs in ASIC and FPGA.

[BibT_eX]

[DOI]

Shihui Yin

Yu Cao

Jae-Sun Seo

Proceedings of the International SoC Design Conference, 2020

Online Knowledge Acquisition with the Selective Inherited Model.

[BibT_eX]

[DOI]

Xiaocong Du

Proceedings of the 2020 International Joint Conference on Neural Networks, 2020

Efficient and Modularized Training on FPGA for Real-time Applications.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

FPGA-based Low-Batch Training Accelerator for Modern CNNs Featuring High Bandwidth Memory.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM International Conference On Computer Aided Design, 2020

2019

FixyNN: Efficient Hardware for Mobile Computer Vision via Transfer Learning.

[BibT_eX]

[DOI]

Paul N. Whatmough

Chuteng Zhou

Patrick Hansen

Matthew Mattina

CoRR, 2019

FixyNN: Energy-Efficient Real-Time Mobile Computer Vision Hardware Acceleration via Transfer Learning.

[BibT_eX]

[DOI]

Paul N. Whatmough

Chuteng Zhou

Patrick Hansen

Matthew Mattina

Proceedings of the Second Conference on Machine Learning and Systems, SysML 2019, 2019

Automatic Compiler Based FPGA Accelerator for CNN Training.

[BibT_eX]

[DOI]

Proceedings of the 29th International Conference on Field Programmable Logic and Applications, 2019

2017

Algorithm and hardware design of discrete-time spiking neural networks based on back propagation with binary activations.

[BibT_eX]

[DOI]

Shihui Yin

Proceedings of the IEEE Biomedical Circuits and Systems Conference, 2017

Minimizing area and energy of deep learning hardware design using collective low precision and structured compression.

[BibT_eX]

[DOI]

Shihui Yin

Gaurav Srivastava

Chaitali Chakrabarti

Visar Berisha