Proceedings of the ASPLOS '22: 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Lausanne, Switzerland, 28 February 2022, 2022

DOTA: detect and omit weak attentions for scalable transformer acceleration.

[BibT_eX]

[DOI]

Zheng Qu

Proceedings of the ASPLOS '22: 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Lausanne, Switzerland, 28 February 2022, 2022

2021

Transformer Acceleration with Dynamic Sparse Attention.

[BibT_eX]

[DOI]

CoRR, 2021

Π-RT: A Runtime Framework to Enable Energy-Efficient Real-Time Robotic Vision Applications on Heterogeneous Architectures.

[BibT_eX]

[DOI]

Computer, 2021

Efficient tensor core-based GPU kernels for structured sparsity under reduced precision.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2021

ENMC: Extreme Near-Memory Classification via Approximate Screening.

[BibT_eX]

[DOI]

Proceedings of the MICRO '21: 54th Annual IEEE/ACM International Symposium on Microarchitecture, 2021

2020

SemiMap: A Semi-Folded Convolution Mapping for Speed-Overhead Balance on Crossbars.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2020

Computation on Sparse Neural Networks: an Inspiration for Future Hardware.

[BibT_eX]

[DOI]

CoRR, 2020

DUET: Boosting Deep Neural Network Efficiency on Dual-Module Architecture.

[BibT_eX]

[DOI]

Proceedings of the 53rd Annual IEEE/ACM International Symposium on Microarchitecture, 2020

Boosting Deep Neural Network Efficiency with Dual-Module Inference.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

INVITED: Computation on Sparse Neural Networks and its Implications for Future Hardware.

[BibT_eX]

[DOI]

Proceedings of the 57th ACM/IEEE Design Automation Conference, 2020

2019

L1-Norm Batch Normalization for Efficient Training of Deep Neural Networks.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2019

Dynamic Sparse Graph for Efficient Deep Learning.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Learning Representations, 2019

2018

L1-Norm Batch Normalization for Efficient Training of Deep Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2018

PIRT: A Runtime Framework to Enable Energy-Efficient Real-Time Robotic Applications on Heterogeneous Architectures.

[BibT_eX]

[DOI]

CoRR, 2018

2017

Building energy-efficient multi-level cell STT-RAM caches with data compression.

[BibT_eX]

[DOI]

Proceedings of the 22nd Asia and South Pacific Design Automation Conference, 2017

2016

CNNLab: a Novel Parallel Framework for Neural Networks using GPU and FPGA-a Practical Study with Trade-off Analysis.

[BibT_eX]

[DOI]

CoRR, 2016

NVSim-CAM: a circuit-level simulator for emerging nonvolatile memory based content-addressable memory.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Computer-Aided Design, 2016

Leveraging 3D Technologies for Hardware Security: Opportunities and Challenges.

[BibT_eX]

[DOI]

Proceedings of the 26th edition on Great Lakes Symposium on VLSI, 2016

Liu Liu

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...