We stand with Ukraine

We stand with Ukraine

Shoaib Kamil

Orcid: 0000-0001-5965-3717

Affiliations:

Adobe Systems Inc., Seattle, WA, USA

According to our database¹, Shoaib Kamil authored at least 65 papers between 2002 and 2023.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

Online presence:

On csauthors.net:

Bibliography

2023

Fast Instruction Selection for Fast Digital Signal Processing.

[BibT_eX]

[DOI]

Alexander J. Root

,

Maaz Bin Safeer Ahmad

,

,

,

,

Jonathan Ragan-Kelley

Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

2022

Searching for Fast Demosaicking Algorithms.

[BibT_eX]

[DOI]

,

Michaël Gharbi

,

,

,

,

Connelly Barnes

,

Jonathan Ragan-Kelley

ACM Trans. Graph., 2022

Sparsity-Specific Code Optimization using Expression Trees.

[BibT_eX]

[DOI]

Philipp Herholz

,

,

Teseo Schneider

,

,

Daniele Panozzo

,

Olga Sorkine-Hornung

ACM Trans. Graph., 2022

H rtDown: Document Processor for Executable Linear Algebra Papers.

[BibT_eX]

[DOI]

,

,

,

Yotam I. Gingold

Proceedings of the SIGGRAPH Asia 2022 Conference Papers, 2022

A Cross-Platform Benchmark for Interval Computation Libraries.

[BibT_eX]

[DOI]

,

Zachary Ferguson

,

Teseo Schneider

,

,

,

Daniele Panozzo

Proceedings of the Parallel Processing and Applied Mathematics, 2022

Vector instruction selection for digital signal processors using program synthesis.

[BibT_eX]

[DOI]

Maaz Bin Safeer Ahmad

,

Alexander J. Root

,

,

,

Proceedings of the ASPLOS '22: 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Lausanne, Switzerland, 28 February 2022, 2022

2021

I♥LA: compilable markdown for linear algebra.

[BibT_eX]

[DOI]

,

,

,

Yotam I. Gingold

ACM Trans. Graph., 2021

I$\heartsuit$LA: Compilable Markdown for Linear Algebra.

[BibT_eX]

[DOI]

,

,

,

Yotam I. Gingold

CoRR, 2021

Domain-Specific Language Abstractions for Compression.

[BibT_eX]

[DOI]

,

Ajay Brahmakshatriya

,

,

,

,

,

Saman P. Amarasinghe

Proceedings of the 31st Data Compression Conference, 2021

Compiling Graph Applications for GPU s with GraphIt.

[BibT_eX]

[DOI]

Ajay Brahmakshatriya

,

,

,

,

,

Saman P. Amarasinghe

Proceedings of the IEEE/ACM International Symposium on Code Generation and Optimization, 2021

2020

NASOQ: numerically accurate sparsity-oriented QP solver.

[BibT_eX]

[DOI]

,

Danny M. Kaufman

,

,

Maryam Mehri Dehnavi

ACM Trans. Graph., 2020

A sparse iteration space transformation framework for sparse tensor algebra.

[BibT_eX]

[DOI]

Ryan Senanayake

,

,

,

,

,

,

Saman P. Amarasinghe

,

Fredrik Kjolstad

Proc. ACM Program. Lang., 2020

Verifying and improving Halide's term rewriting system with program synthesis.

[BibT_eX]

[DOI]

Julie L. Newcomb

,

,

,

Rastislav Bodík

,

Proc. ACM Program. Lang., 2020

Compliation Techniques for Graphs Algorithms on GPUs.

[BibT_eX]

[DOI]

Ajay Brahmakshatriya

,

,

,

,

,

Saman P. Amarasinghe

CoRR, 2020

A Unified Iteration Space Transformation Framework for Sparse and Dense Tensor Algebra.

[BibT_eX]

[DOI]

Ryan Senanayake

,

Fredrik Kjolstad

,

,

,

Saman P. Amarasinghe

CoRR, 2020

EGGS: Sparsity-Specific Code Generation.

[BibT_eX]

[DOI]

,

Teseo Schneider

,

,

,

,

Daniele Panozzo

Comput. Graph. Forum, 2020

Optimizing ordered graph algorithms with GraphIt.

[BibT_eX]

[DOI]

,

Ajay Brahmakshatriya

,

,

Laxman Dhulipala

,

,

Saman P. Amarasinghe

,

Proceedings of the CGO '20: 18th ACM/IEEE International Symposium on Code Generation and Optimization, 2020

2019

Automatically translating image processing libraries to halide.

[BibT_eX]

[DOI]

Maaz Bin Safeer Ahmad

,

Jonathan Ragan-Kelley

,

,

ACM Trans. Graph., 2019

Modular verification of web page layout.

[BibT_eX]

[DOI]

Pavel Panchekha

,

Michael D. Ernst

,

Zachary Tatlock

,

Proc. ACM Program. Lang., 2019

PriorityGraph: A Unified Programming Model for Optimizing Ordered Graph Algorithms.

[BibT_eX]

[DOI]

,

Ajay Brahmakshatriya

,

,

Laxman Dhulipala

,

,

Saman P. Amarasinghe

,

CoRR, 2019

Tensor Algebra Compilation with Workspaces.

[BibT_eX]

[DOI]

Fredrik Kjolstad

,

,

,

Saman P. Amarasinghe

Proceedings of the IEEE/ACM International Symposium on Code Generation and Optimization, 2019

Tiramisu: A Polyhedral Compiler for Expressing Fast and Portable Code.

[BibT_eX]

[DOI]

Riyadh Baghdadi

,

,

Malek Ben Romdhane

,

Emanuele Del Sozzo

,

Abdurrahman Akkas

,

,

Patricia Suriana

,

,

Saman P. Amarasinghe

Proceedings of the IEEE/ACM International Symposium on Code Generation and Optimization, 2019

2018

GraphIt: a high-performance graph DSL.

[BibT_eX]

[DOI]

,

,

Riyadh Baghdadi

,

,

,

Saman P. Amarasinghe

Proc. ACM Program. Lang., 2018

GraphIt - A High-Performance DSL for Graph Analytics.

[BibT_eX]

[DOI]

,

,

Riyadh Baghdadi

,

,

,

Saman P. Amarasinghe

CoRR, 2018

Automatic Generation of Sparse Tensor Kernels with Workspaces.

[BibT_eX]

[DOI]

Fredrik Kjolstad

,

,

Saman P. Amarasinghe

CoRR, 2018

ParSy: inspection and transformation of sparse matrix computations for parallelism.

[BibT_eX]

[DOI]

,

,

Michelle Mills Strout

,

Maryam Mehri Dehnavi

Proceedings of the International Conference for High Performance Computing, 2018

Verifying that web pages have accessible layout.

[BibT_eX]

[DOI]

Pavel Panchekha

,

,

Michael D. Ernst

,

Zachary Tatlock

,

Proceedings of the 39th ACM SIGPLAN Conference on Programming Language Design and Implementation, 2018

2017

The tensor algebra compiler.

[BibT_eX]

[DOI]

Fredrik Kjolstad

,

,

,

,

Saman P. Amarasinghe

Proc. ACM Program. Lang., 2017

Sympiler: transforming sparse matrix codes by decoupling symbolic analysis.

[BibT_eX]

[DOI]

,

,

Michelle Mills Strout

,

Maryam Mehri Dehnavi

Proceedings of the International Conference for High Performance Computing, 2017

taco: a tool to generate tensor algebra kernels.

[BibT_eX]

[DOI]

Fredrik Kjolstad

,

,

,

,

Saman P. Amarasinghe

Proceedings of the 32nd IEEE/ACM International Conference on Automated Software Engineering, 2017

Parallel associative reductions in halide.

[BibT_eX]

[DOI]

Patricia Suriana

,

,

Proceedings of the 2017 International Symposium on Code Generation and Optimization, 2017

2016

Simit: A Language for Physical Simulation.

[BibT_eX]

[DOI]

Fredrik Kjolstad

,

,

Jonathan Ragan-Kelley

,

David I. W. Levin

,

,

,

,

Danny M. Kaufman

,

,

Wojciech Matusik

,

Saman P. Amarasinghe

ACM Trans. Graph., 2016

Distributed Halide.

[BibT_eX]

[DOI]

Tyler Denniston

,

,

Saman P. Amarasinghe

Proceedings of the 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2016

Verified lifting of stencil computations.

[BibT_eX]

[DOI]

,

,

Shachar Itzhaky

,

Armando Solar-Lezama

Proceedings of the 37th ACM SIGPLAN Conference on Programming Language Design and Implementation, 2016

2015

Parallel processing of filtered queries in attributed semantic graphs.

[BibT_eX]

[DOI]

,

,

,

Samuel Williams

,

Erika Duriakova

,

,

,

John R. Gilbert

J. Parallel Distributed Comput., 2015

Bridging the Gap Between General-Purpose and Domain-Specific Compilers with Synthesis.

[BibT_eX]

[DOI]

,

,

Armando Solar-Lezama

Proceedings of the 1st Summit on Advances in Programming Languages, 2015

Helium: lifting high-performance stencil kernels from stripped x86 binaries to halide DSL code.

[BibT_eX]

[DOI]

,

Jeffrey Bosboom

,

,

,

Jonathan Ragan-Kelley

,

,

,

Saman P. Amarasinghe

Proceedings of the 36th ACM SIGPLAN Conference on Programming Language Design and Implementation, 2015

2014

MSL: A Synthesis Enabled Language for Distributed Implementations.

[BibT_eX]

[DOI]

,

,

Armando Solar-Lezama

Proceedings of the International Conference for High Performance Computing, 2014

WOSC 2014: second workshop on optimizing stencil computations.

[BibT_eX]

[DOI]

,

Saman P. Amarasinghe

,

Proceedings of the SPLASH'14, 2014

OpenTuner: an extensible framework for program autotuning.

[BibT_eX]

[DOI]

,

,

Kalyan Veeramachaneni

,

Jonathan Ragan-Kelley

,

Jeffrey Bosboom

,

Una-May O'Reilly

,

Saman P. Amarasinghe

Proceedings of the International Conference on Parallel Architectures and Compilation, 2014

2013

Communication-Optimal Parallel Recursive Rectangular Matrix Multiplication.

[BibT_eX]

[DOI]

,

,

,

,

Benjamin Lipshitz

,

,

Omer Spillinger

Proceedings of the 27th IEEE International Symposium on Parallel and Distributed Processing, 2013

High-Productivity and High-Performance Analysis of Filtered Semantic Graphs.

[BibT_eX]

[DOI]

,

Erika Duriakova

,

,

John R. Gilbert

,

,

,

,

Samuel Williams

Proceedings of the 27th IEEE International Symposium on Parallel and Distributed Processing, 2013

2012

Productive High Performance Parallel Programming with Auto-tuned Domain-Specific Embedded Languages.

[BibT_eX]

[DOI]

Shoaib Ashraf Kamil

PhD thesis, 2012

Auto-tuning the Matrix Powers Kernel with SEJITS.

[BibT_eX]

[DOI]

,

,

Proceedings of the High Performance Computing for Computational Science, 2012

Parallel High Performance Bootstrapping in Python.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 11th Python in Science Conference 2012 (SciPy 2012), 2012

Poster: Beating MKL and ScaLAPACK at Rectangular Matrix Multiplication Using the BFS/DFS Approach.

[BibT_eX]

[DOI]

,

,

,

,

Benjamin Lipshitz

,

,

Omer Spillinger

Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Portable parallel performance from sequential, productive, embedded domain-specific languages.

[BibT_eX]

[DOI]

,

Derrick Coetzee

,

,

,

Ekaterina Gonina

,

Jonathan Harper

,

,

Proceedings of the 17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2012

High-performance analysis of filtered semantic graphs.

[BibT_eX]

[DOI]

,

,

John R. Gilbert

,

,

,

,

Samuel Williams

Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2012

2011

Bringing Parallel Performance to Python with Domain-Specific Selective Embedded Just-in-Time Specialization.

[BibT_eX]

[DOI]

,

Derrick Coetzee

,

Proceedings of the 10th Python in Science Conference 2011 (SciPy 2011), Austin, Texas, July 11, 2011

CUDA-level Performance with Python-level Productivity for Gaussian Mixture Model Applications.

[BibT_eX]

[DOI]

,

Ekaterina Gonina

,

,

Gerald Friedland

,

David A. Patterson

,

Proceedings of the 3rd USENIX Workshop on Hot Topics in Parallelism, 2011

2010

Communication Requirements and Interconnect Optimization for High-End Scientific Applications.

[BibT_eX]

[DOI]

,

,

,

IEEE Trans. Parallel Distributed Syst., 2010

An auto-tuning framework for parallel multicore stencil computations.

[BibT_eX]

[DOI]

,

,

,

,

Samuel Williams

Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Silicon Nanophotonic Network-on-Chip Using TDM Arbitration.

[BibT_eX]

[DOI]

,

,

,

,

,

Luca P. Carloni

,

Proceedings of the IEEE 18th Annual Symposium on High Performance Interconnects, 2010

2009

Optimization and Performance Modeling of Stencil Computations on Modern Microprocessors.

[BibT_eX]

[DOI]

,

,

Samuel Williams

,

,

,

Katherine A. Yelick

SIAM Rev., 2009

Energy-Efficient Computing for Extreme-Scale Science.

[BibT_eX]

[DOI]

,

,

,

Michael F. Wehner

,

,

,

,

Marghoob Mohiyuddin

Computer, 2009

Analysis of photonic networks for a chip multiprocessor using scientific applications.

[BibT_eX]

[DOI]

,

,

Aleksandr Biberman

,

,

Benjamin G. Lee

,

Marghoob Mohiyuddin

,

,

,

Luca P. Carloni

,

John Kubiatowicz

,

,

Proceedings of the Third International Symposium on Networks-on-Chips, 2009

2008

Power efficiency in high performance computing.

[BibT_eX]

[DOI]

,

,

Erich Strohmaier

Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

2007

Scientific Computing Kernels on the Cell Processor.

[BibT_eX]

[DOI]

Samuel Williams

,

,

,

,

,

Katherine A. Yelick

Int. J. Parallel Program., 2007

Scientific Application Performance on Candidate PetaScale Platforms.

[BibT_eX]

[DOI]

,

,

Jonathan Carter

,

,

Michael Lijewski

,

,

,

,

Erich Strohmaier

,

Stéphane Ethier

,

Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

Reconfigurable hybrid interconnection for static and dynamic scientific applications.

[BibT_eX]

[DOI]

,

,

Daniel K. Gunter

,

Michael Lijewski

,

,

Proceedings of the 4th Conference on Computing Frontiers, 2007

2006

The potential of the cell processor for scientific computing.

[BibT_eX]

[DOI]

Samuel Williams

,

,

,

,

,

Katherine A. Yelick

Proceedings of the Third Conference on Computing Frontiers, 2006

Implicit and explicit optimizations for stencil computations.

[BibT_eX]

[DOI]

,

,

Samuel Williams

,

,

,

Katherine A. Yelick

Proceedings of the 2006 workshop on Memory System Performance and Correctness, 2006

2005

Analyzing Ultra-Scale Application Communication Requirements for a Reconfigurable Hybrid Interconnect.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the ACM/IEEE SC2005 Conference on High Performance Networking and Computing, 2005

Impact of modern memory subsystems on cache optimizations for stencil computations.

[BibT_eX]

[DOI]

,

,

,

,

Katherine A. Yelick

Proceedings of the 2005 workshop on Memory System Performance, 2005

2002

Performance optimizations and bounds for sparse matrix-vector multiply.

[BibT_eX]

[DOI]

,

,

Katherine A. Yelick

,

,

Rajesh Nishtala

,

Benjamin C. Lee

Proceedings of the 2002 ACM/IEEE conference on Supercomputing, 2002

Loading...