Runbin Shi

Orcid: 0000-0003-4901-5476

According to our database1, Runbin Shi authored at least 22 papers between 2015 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Co-design Hardware and Algorithm for Vector Search.
CoRR, 2023

Co-design Hardware and Algorithm for Vector Search.
Proceedings of the International Conference for High Performance Computing, 2023

2022
Exploiting HBM on FPGAs for Data Processing.
ACM Trans. Reconfigurable Technol. Syst., 2022

Radio Frequency Fingerprinting on the Edge.
IEEE Trans. Mob. Comput., 2022

ECI: a Customizable Cache Coherency Stack for Hybrid FPGA-CPU Architectures.
CoRR, 2022

2021
O3BNN-R: An Out-of-Order Architecture for High-Performance and Regularized BNN Inference.
IEEE Trans. Parallel Distributed Syst., 2021

Adversarial Hardware With Functional and Topological Camouflage.
IEEE Trans. Circuits Syst. II Express Briefs, 2021

Mix and Match: A Novel FPGA-Centric Deep Neural Network Quantization Framework.
Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2021

2020
MSP: An FPGA-Specific Mixed-Scheme, Multi-Precision Deep Neural Network Quantization Framework.
CoRR, 2020

AWB-GCN: A Graph Convolutional Network Accelerator with Runtime Workload Rebalancing.
Proceedings of the 53rd Annual IEEE/ACM International Symposium on Microarchitecture, 2020

CSB-RNN: a faster-than-realtime RNN acceleration framework with compressed structured blocks.
Proceedings of the ICS '20: 2020 International Conference on Supercomputing, 2020

Dynamic Sparse Training: Find Efficient Sparse Network From Scratch With Trainable Masked Layers.
Proceedings of the 8th International Conference on Learning Representations, 2020

FTDL: An FPGA-tailored Architecture for Deep Learning Systems.
Proceedings of the FPGA '20: The 2020 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, 2020

FTDL: A Tailored FPGA-Overlay for Deep Learning with High Scalability.
Proceedings of the 57th ACM/IEEE Design Automation Conference, 2020

2019
A Real-Time Coprime Line Scan Super-Resolution System for Ultra-Fast Microscopy.
IEEE Trans. Biomed. Circuits Syst., 2019

High-Throughput Line Buffer Microarchitecture for Arbitrary Sized Streaming Image Processing.
J. Imaging, 2019

E-LSTM: Efficient Inference of Sparse LSTM on Embedded Heterogeneous System.
Proceedings of the 56th Annual Design Automation Conference 2019, 2019

2017
Ultra-low latency continuous block-parallel stream windowing using FPGA on-chip memory.
Proceedings of the International Conference on Field Programmable Technology, 2017

2016
A Heterogeneous System for Real-Time Detection with AdaBoost.
Proceedings of the 18th IEEE International Conference on High Performance Computing and Communications; 14th IEEE International Conference on Smart City; 2nd IEEE International Conference on Data Science and Systems, 2016

The neuro vector engine: Flexibility to improve convolutional net efficiency for wearable vision.
Proceedings of the 2016 Design, Automation & Test in Europe Conference & Exhibition, 2016

2015
A Locality Aware Convolutional Neural Networks Accelerator.
Proceedings of the 2015 Euromicro Conference on Digital System Design, 2015

A programmable baseband processor for massive MIMO uplink multi-user detection.
Proceedings of the 2015 IEEE 11th International Conference on ASIC, 2015


  Loading...