Kai Zhong

Orcid: 0000-0002-8448-9530

Affiliations:
  • Tsinghua University, Department of Electronic Engineering, Beijing, China


According to our database1, Kai Zhong authored at least 24 papers between 2015 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
DySpMM: From Fix to Dynamic for Sparse Matrix-Matrix Multiplication Accelerators.
Proceedings of the 61st ACM/IEEE Design Automation Conference, 2024

FEASTA: A Flexible and Efficient Accelerator for Sparse Tensor Algebra in Machine Learning.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

2023
CoGNN: An Algorithm-Hardware Co-Design Approach to Accelerate GNN Inference With Minibatch Sampling.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., December, 2023

An Efficient Accelerator for Point-based and Voxel-based Point Cloud Neural Networks.
Proceedings of the 60th ACM/IEEE Design Automation Conference, 2023

NTGAT: A Graph Attention Network Accelerator with Runtime Node Tailoring.
Proceedings of the 28th Asia and South Pacific Design Automation Conference, 2023

2022
A Unified FPGA Virtualization Framework for General-Purpose Deep Neural Networks in the Cloud.
ACM Trans. Reconfigurable Technol. Syst., 2022

Exploring the Potential of Low-Bit Training of Convolutional Neural Networks.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2022

Exploiting Parallelism with Vertex-Clustering in Processing-In-Memory-based GCN Accelerators.
Proceedings of the 2022 Design, Automation & Test in Europe Conference & Exhibition, 2022

2021
Machine Learning for Electronic Design Automation: A Survey.
ACM Trans. Design Autom. Electr. Syst., 2021

BoolNet: Minimizing The Energy Consumption of Binary Neural Networks.
CoRR, 2021

2020
Towards Lower Bit Multiplication for Convolutional Neural Network Training.
CoRR, 2020

Enable Efficient and Flexible FPGA Virtualization for Deep Learning in the Cloud.
Proceedings of the FPGA '20: The 2020 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, 2020

Enabling Efficient and Flexible FPGA Virtualization for Deep Learning in the Cloud.
Proceedings of the 28th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2020

2019
Provable Non-linear Inductive Matrix Completion.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

2018
Nonlinear Inductive Matrix Completion based on One-layer Neural Networks.
CoRR, 2018

Power Grid Reduction by Sparse Convex Optimization.
Proceedings of the 2018 International Symposium on Physical Design, 2018

2017
Learning Non-overlapping Convolutional Neural Networks with Multiple Kernels.
CoRR, 2017

Fast second-order cone programming for safe mission planning.
Proceedings of the 2017 IEEE International Conference on Robotics and Automation, 2017

Recovery Guarantees for One-hidden-layer Neural Networks.
Proceedings of the 34th International Conference on Machine Learning, 2017

2016
Efficient and Non-Convex Coordinate Descent for Symmetric Nonnegative Matrix Factorization.
IEEE Trans. Signal Process., 2016

Mixed Linear Regression with Multiple Components.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Practical public PUF enabled by solving max-flow problem on chip.
Proceedings of the 53rd Annual Design Automation Conference, 2016

2015
Coordinate Descent Methods for Symmetric Nonnegative Matrix Factorization.
CoRR, 2015

Efficient Matrix Sensing Using Rank-1 Gaussian Measurements.
Proceedings of the Algorithmic Learning Theory - 26th International Conference, 2015


  Loading...