Proceedings of the ASPLOS '22: 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Lausanne, Switzerland, 28 February 2022, 2022

FINGERS: exploiting fine-grained parallelism in graph mining accelerators.

[BibT_eX]

[DOI]

Qihang Chen

Boyu Tian

Mingyu Gao

Proceedings of the ASPLOS '22: 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Lausanne, Switzerland, 28 February 2022, 2022

2021

PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections.

[BibT_eX]

[DOI]

Proceedings of the 15th USENIX Symposium on Operating Systems Design and Implementation, 2021

PipeZK: Accelerating Zero-Knowledge Proof with a Pipelined Architecture.

[BibT_eX]

[DOI]

Proceedings of the 48th ACM/IEEE Annual International Symposium on Computer Architecture, 2021

2020

Improving the Accuracy, Scalability, and Performance of Graph Neural Networks with Roc.

[BibT_eX]

[DOI]

Proceedings of the Third Conference on Machine Learning and Systems, 2020

Interstellar: Using Halide's Scheduling Language to Analyze DNN Accelerators.

[BibT_eX]

[DOI]

Proceedings of the ASPLOS '20: Architectural Support for Programming Languages and Operating Systems, 2020

2019

Optimizing DNN Computation with Relaxed Graph Substitutions.

[BibT_eX]

[DOI]

Proceedings of the Second Conference on Machine Learning and Systems, SysML 2019, 2019

TANGRAM: Optimized Coarse-Grained Dataflow for Scalable NN Accelerators.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems, 2019

2018

DNN Dataflow Choice Is Overrated.

[BibT_eX]

[DOI]

CoRR, 2018

GraphP: Reducing Communication for PIM-Based Graph Processing with Efficient Data Partition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2018

2017

3D nanosystems enable <i>embedded</i> abundant-data computing: special session paper.

[BibT_eX]

[DOI]

Proceedings of the Twelfth IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis Companion, 2017

TETRIS: Scalable and Efficient Neural Network Acceleration with 3D Memory.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Second International Conference on Architectural Support for Programming Languages and Operating Systems, 2017

2016

DRAF: A Low-Power DRAM-Based Reconfigurable Acceleration Fabric.

[BibT_eX]

[DOI]

Proceedings of the 43rd ACM/IEEE Annual International Symposium on Computer Architecture, 2016

HRL: Efficient and flexible reconfigurable logic for near-data processing.

[BibT_eX]

[DOI]

Mingyu Gao

Christos Kozyrakis

Proceedings of the 2016 IEEE International Symposium on High Performance Computer Architecture, 2016

2015

Energy-Efficient Abundant-Data Computing: The N3XT 1, 000x.

[BibT_eX]

[DOI]

Computer, 2015

Practical Near-Data Processing for In-Memory Analytics Frameworks.

[BibT_eX]

[DOI]

Mingyu Gao

Grant Ayers

Christos Kozyrakis

Proceedings of the 2015 International Conference on Parallel Architectures and Compilation, 2015

Mingyu Gao

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...