Proceedings of the SIGMETRICS/PERFORMANCE '22: ACM SIGMETRICS/IFIP PERFORMANCE Joint International Conference on Measurement and Modeling of Computer Systems, Mumbai, India, June 6, 2022

Sparseloop: An Analytical Approach To Sparse Tensor Accelerator Modeling.

[BibT_eX]

[DOI]

Yannan Nellie Wu

Proceedings of the 55th IEEE/ACM International Symposium on Microarchitecture, 2022

Ruby: Improving Hardware Efficiency for Tensor Algebra Accelerators Through Imperfect Factorization.

[BibT_eX]

[DOI]

Proceedings of the International IEEE Symposium on Performance Analysis of Systems and Software, 2022

Demystifying Map Space Exploration for NPUs.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Workload Characterization, 2022

DiGamma: Domain-aware Genetic Algorithm for HW-Mapping Co-optimization for DNN Accelerators.

[BibT_eX]

[DOI]

Proceedings of the 2022 Design, Automation & Test in Europe Conference & Exhibition, 2022

2021

Flexion: A Quantitative Metric for Flexibility in DNN Accelerators.

[BibT_eX]

[DOI]

IEEE Comput. Archit. Lett., 2021

Sparseloop: An Analytical, Energy-Focused Design Space Exploration Methodology for Sparse Tensor Accelerators.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2021

Mind mappings: enabling efficient algorithm-accelerator mapping space search.

[BibT_eX]

[DOI]

Christopher W. Fletcher

Proceedings of the ASPLOS '21: 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2021

Union: A Unified HW-SW Co-Design Ecosystem in MLIR for Evaluating Tensor Operations on Spatial Accelerators.

[BibT_eX]

[DOI]

Sivasankaran Rajamanickam

Roberto Gioiosa

Tushar Krishna

Proceedings of the 30th International Conference on Parallel Architectures and Compilation Techniques, 2021

2020

Data Orchestration in Deep Learning Accelerators

[BibT_eX]

[DOI]

Synthesis Lectures on Computer Architecture, Morgan & Claypool Publishers, ISBN: 978-3-031-01767-4, 2020

MAESTRO: A Data-Centric Approach to Understand Reuse, Performance, and Hardware Cost of DNN Mappings.

[BibT_eX]

[DOI]

IEEE Micro, 2020

2019

Understanding Reuse, Performance, and Hardware Cost of DNN Dataflow: A Data-Centric Approach.

[BibT_eX]

[DOI]

Proceedings of the 52nd Annual IEEE/ACM International Symposium on Microarchitecture, 2019

Timeloop: A Systematic Approach to DNN Accelerator Evaluation.

[BibT_eX]

[DOI]

Rangharajan Venkatesan

Brucek Khailany

Stephen W. Keckler

Joel S. Emer

Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2019

2017

SCNN: An Accelerator for Compressed-sparse Convolutional Neural Networks.

[BibT_eX]

[DOI]

Rangharajan Venkatesan

Proceedings of the 44th Annual International Symposium on Computer Architecture, 2017

2015

Efficient Control and Communication Paradigms for Coarse-Grained Spatial Architectures.

[BibT_eX]

[DOI]

ACM Trans. Comput. Syst., 2015

2014

Efficient Spatial Processing Element Control via Triggered Instructions.

[BibT_eX]

[DOI]

IEEE Micro, 2014

2013

Triggered instructions: a control paradigm for spatially-programmed architectures.

[BibT_eX]

[DOI]

Proceedings of the 40th Annual International Symposium on Computer Architecture, 2013

A Hierarchical Architectural Framework for Reconfigurable Logic Computing.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013

2012

Leveraging latency-insensitivity to ease multiple FPGA design.

[BibT_eX]

[DOI]

Kermin Elliott Fleming

Proceedings of the ACM/SIGDA 20th International Symposium on Field Programmable Gate Arrays, 2012

2011

HAsim: FPGA-based high-detail multicore simulation using time-division multiplexing.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on High-Performance Computer Architecture (HPCA-17 2011), 2011

Leap scratchpads: automatic memory and cache management for reconfigurable logic.

[BibT_eX]

[DOI]

Proceedings of the ACM/SIGDA 19th International Symposium on Field Programmable Gate Arrays, 2011

2007

Mechanisms for bounding vulnerabilities of processor structures.

[BibT_eX]

[DOI]

Niranjan Soundararajan

Angshuman Parashar

Anand Sivasubramaniam

Proceedings of the 34th International Symposium on Computer Architecture (ISCA 2007), 2007

2006

SlicK: slice-based locality exploitation for efficient redundant multithreading.

[BibT_eX]

[DOI]

Angshuman Parashar

Anand Sivasubramaniam

Sudhanva Gurumurthi

Proceedings of the 12th International Conference on Architectural Support for Programming Languages and Operating Systems, 2006

2004

A Complexity-Effective Approach to ALU Bandwidth Enhancement for Instruction-Level Temporal Redundancy.

[BibT_eX]

[DOI]

Angshuman Parashar

Sudhanva Gurumurthi

Anand Sivasubramaniam

Proceedings of the 31st International Symposium on Computer Architecture (ISCA 2004), 2004

2002

An Uncalibrated Lightfield Acquisition System.

[BibT_eX]

Proceedings of the ICVGIP 2002, 2002

Angshuman Parashar

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...