Saman P. Amarasinghe

Proceedings of the PLDI '22: 43rd ACM SIGPLAN International Conference on Programming Language Design and Implementation, San Diego, CA, USA, June 13, 2022

Unified Compilation for Lossless Compression and Sparse Computing.

[BibT_eX]

[DOI]

Daniel Donenfeld

Proceedings of the IEEE/ACM International Symposium on Code Generation and Optimization, 2022

GraphIt to CUDA Compiler in 2021 LOC: A Case for High-Performance DSL Implementation via Staging with BuilDSL.

[BibT_eX]

[DOI]

Ajay Brahmakshatriya

Proceedings of the IEEE/ACM International Symposium on Code Generation and Optimization, 2022

2021

GraphIt to CUDA Compiler in 2021 LOC: A Case for High-Performance DSL Implementation via Staging with BuilDSL.

[BibT_eX]

[DOI]

Ajay Brahmakshatriya

Dataset, December, 2021

Compilation of sparse array programming models.

[BibT_eX]

[DOI]

Proc. ACM Program. Lang., 2021

Dynamic Sparse Tensor Algebra Compilation.

[BibT_eX]

[DOI]

CoRR, 2021

An Asymptotic Cost Model for Autoscheduling Sparse Tensor Programs.

[BibT_eX]

[DOI]

Willow Ahrens

CoRR, 2021

An Attempt to Generate Code for Symmetric Tensor Computations.

[BibT_eX]

[DOI]

CoRR, 2021

A Deep Learning Based Cost Model for Automatic Code Optimization.

[BibT_eX]

[DOI]

Massinissa Merouani

Mohamed-Hicham Leghettas

Proceedings of the Fourth Conference on Machine Learning and Systems, 2021

Taming the Zoo: The Unified GraphIt Compiler Framework for Novel Architectures.

[BibT_eX]

[DOI]

Proceedings of the 48th ACM/IEEE Annual International Symposium on Computer Architecture, 2021

A Deep Dive Into Understanding The Random Walk-Based Temporal Graph Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Workload Characterization, 2021

Domain-Specific Language Abstractions for Compression.

[BibT_eX]

[DOI]

Proceedings of the 31st Data Compression Conference, 2021

Compiling Graph Applications for GPU s with GraphIt.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM International Symposium on Code Generation and Optimization, 2021

BuildIt: A Type-Based Multi-stage Programming Framework for Code Generation in C++.

[BibT_eX]

[DOI]

Ajay Brahmakshatriya

Proceedings of the IEEE/ACM International Symposium on Code Generation and Optimization, 2021

VeGen: a vectorizer generator for SIMD and beyond.

[BibT_eX]

[DOI]

Proceedings of the ASPLOS '21: 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2021

2020

A sparse iteration space transformation framework for sparse tensor algebra.

[BibT_eX]

[DOI]

Proc. ACM Program. Lang., 2020

Compliation Techniques for Graphs Algorithms on GPUs.

[BibT_eX]

[DOI]

CoRR, 2020

TIRAMISU: A Polyhedral Compiler for Dense and Sparse Deep Learning.

[BibT_eX]

[DOI]

Abdelkader Nadir Debbagh

Kamel Abdous

Fatima-Zohra Benhamida

Alex Renda

Jonathan Elliott Frankle

Michael Carbin

CoRR, 2020

A Unified Iteration Space Transformation Framework for Sparse and Dense Tensor Algebra.

[BibT_eX]

[DOI]

CoRR, 2020

Sparse Tensor Transpositions.

[BibT_eX]

[DOI]

Proceedings of the SPAA '20: 32nd ACM Symposium on Parallelism in Algorithms and Architectures, 2020

Automatic generation of efficient sparse tensor format conversion routines.

[BibT_eX]

[DOI]

Proceedings of the 41st ACM SIGPLAN International Conference on Programming Language Design and Implementation, 2020

Compiler 2.0: Using Machine Learning to Modernize Compiler Technology.

[BibT_eX]

[DOI]

Proceedings of the 21st ACM SIGPLAN/SIGBED International Conference on Languages, 2020

SALSA: A Domain Specific Architecture for Sequence Alignment.

[BibT_eX]

[DOI]

Lorenzo Di Tucci

Marco D. Santambrogio

Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium Workshops, 2020

GrAPL 2020 Keynote Speaker The GraphIt Universal Graph Framework: Achieving HighPerformance across Algorithms, Graph Types, and Architectures.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium Workshops, 2020

Optimizing ordered graph algorithms with GraphIt.

[BibT_eX]

[DOI]

Proceedings of the CGO '20: 18th ACM/IEEE International Symposium on Code Generation and Optimization, 2020

2019

Seq (OOPSLA 2019 Artifact).

[BibT_eX]

[DOI]

Dataset, August, 2019

Seq: a high-performance language for bioinformatics.

[BibT_eX]

[DOI]

Proc. ACM Program. Lang., 2019

PriorityGraph: A Unified Programming Model for Optimizing Ordered Graph Algorithms.

[BibT_eX]

[DOI]

CoRR, 2019

Compiler Auto-Vectorization with Imitation Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

BHive: A Benchmark Suite and Measurement Framework for Validating x86-64 Basic Block Performance Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Workload Characterization, 2019

Ithemal: Accurate, Portable and Fast Basic Block Throughput Estimation using Deep Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 36th International Conference on Machine Learning, 2019

Accelerated CNN Training through Gradient Approximation.

[BibT_eX]

[DOI]

Sree Harsha Nelaturu

Ziheng Wang

Proceedings of the 2nd Workshop on Energy Efficient Machine Learning and Cognitive Computing for Embedded Applications, 2019

Tensor Algebra Compilation with Workspaces.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM International Symposium on Code Generation and Optimization, 2019

Tiramisu: A Polyhedral Compiler for Expressing Fast and Portable Code.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM International Symposium on Code Generation and Optimization, 2019

Revec: program rejuvenation through revectorization.

[BibT_eX]

[DOI]

Proceedings of the 28th International Conference on Compiler Construction, 2019

The sparse tensor algebra compiler (keynote).

[BibT_eX]

[DOI]

Proceedings of the 28th International Conference on Compiler Construction, 2019

2018

Evaluating End-to-End Optimization for Data Analytics Applications in Weld.

[BibT_eX]

[DOI]

Proc. VLDB Endow., 2018

GraphIt: a high-performance graph DSL.

[BibT_eX]

[DOI]

Proc. ACM Program. Lang., 2018

goSLP: globally optimized superword level parallelism framework.

[BibT_eX]

[DOI]

Charith Mendis

Proc. ACM Program. Lang., 2018

Format abstraction for sparse tensor algebra compilers.

[BibT_eX]

[DOI]

Proc. ACM Program. Lang., 2018

Ithemal: Accurate, Portable and Fast Basic Block Throughput Estimation using Deep Neural Networks.

[BibT_eX]

[DOI]

Charith Mendis

Michael Carbin

CoRR, 2018

Cimple: Instruction and Memory Level Parallelism.

[BibT_eX]

[DOI]

CoRR, 2018

GraphIt - A High-Performance DSL for Graph Analytics.

[BibT_eX]

[DOI]

CoRR, 2018

Unified Sparse Formats for Tensor Algebra Compilers.

[BibT_eX]

[DOI]

CoRR, 2018

The Three Pillars of Machine-Based Programming.

[BibT_eX]

[DOI]

CoRR, 2018

Automatic Generation of Sparse Tensor Kernels with Workspaces.

[BibT_eX]

[DOI]

CoRR, 2018

Halide: decoupling algorithms from schedules for high-performance image processing.

[BibT_eX]

[DOI]

Commun. ACM, 2018

The three pillars of machine programming.

[BibT_eX]

[DOI]

Proceedings of the 2nd ACM SIGPLAN International Workshop on Machine Learning and Programming Languages, 2018

DAWG: A Defense Against Cache Timing Attacks in Speculative Execution Processors.

[BibT_eX]

[DOI]

Proceedings of the 51st Annual IEEE/ACM International Symposium on Microarchitecture, 2018

Gloss: Seamless Live Reconfiguration and Reoptimization of Stream Programs.

[BibT_eX]

[DOI]

Sumanaruban Rajadurai

Jeffrey Bosboom

Weng-Fai Wong

Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating Systems, 2018

A Unified Backend for Targeting FPGAs from DSLs.

[BibT_eX]

[DOI]

Emanuele Del Sozzo

Marco D. Santambrogio

Proceedings of the 29th IEEE International Conference on Application-specific Systems, 2018

Cimple: instruction and memory level parallelism: a DSL for uncovering ILP and MLP.

[BibT_eX]

[DOI]

Proceedings of the 27th International Conference on Parallel Architectures and Compilation Techniques, 2018

2017

The tensor algebra compiler.

[BibT_eX]

[DOI]

Proc. ACM Program. Lang., 2017

Weld: Rethinking the Interface Between Data-Intensive Applications.

[BibT_eX]

[DOI]

CoRR, 2017

taco: a tool to generate tensor algebra kernels.

[BibT_eX]

[DOI]

Proceedings of the 32nd IEEE/ACM International Conference on Automated Software Engineering, 2017

A Common Backend for Hardware Acceleration on FPGA.

[BibT_eX]

[DOI]

Emanuele Del Sozzo

Marco D. Santambrogio

Proceedings of the 2017 IEEE International Conference on Computer Design, 2017

A Common Runtime for High Performance Data Analysis.

[BibT_eX]

[DOI]

Proceedings of the 8th Biennial Conference on Innovative Data Systems Research, 2017

Making caches work for graph analytics.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Big Data (IEEE BigData 2017), 2017

2016

Simit: A Language for Physical Simulation.

[BibT_eX]

[DOI]

ACM Trans. Graph., 2016

Optimizing Cache Performance for Graph Analytics.

[BibT_eX]

[DOI]

CoRR, 2016

Distributed Halide.

[BibT_eX]

[DOI]

Tyler Denniston

Proceedings of the 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2016

Optimizing Indirect Memory References with milk.

[BibT_eX]

[DOI]

Vladimir Kiriansky

Yunming Zhang

Proceedings of the 2016 International Conference on Parallel Architectures and Compilation, 2016

2015

Helium: lifting high-performance stencil kernels from stripped x86 binaries to halide DSL code.

[BibT_eX]

[DOI]

Sylvain Paris

Proceedings of the 36th ACM SIGPLAN Conference on Programming Language Design and Implementation, 2015

Autotuning algorithmic choice for input sensitivity.

[BibT_eX]

[DOI]

Yufei Ding

Kalyan Veeramachaneni

Xipeng Shen

Una-May O'Reilly

Proceedings of the 36th ACM SIGPLAN Conference on Programming Language Design and Implementation, 2015

2014

WOSC 2014: second workshop on optimizing stencil computations.

[BibT_eX]

[DOI]

P. Sadayappan

Proceedings of the SPLASH'14, 2014

StreamJIT: a commensal compiler for high-performance stream programming.

[BibT_eX]

[DOI]

Jeffrey Bosboom

Sumanaruban Rajadurai

Weng-Fai Wong

Proceedings of the 2014 ACM International Conference on Object Oriented Programming Systems Languages & Applications, 2014

OpenTuner: an extensible framework for program autotuning.

[BibT_eX]

[DOI]

Kalyan Veeramachaneni

Jeffrey Bosboom

Una-May O'Reilly

Proceedings of the International Conference on Parallel Architectures and Compilation, 2014

2013

Detection of false sharing using machine learning.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2013

Halide: a language and compiler for optimizing parallelism, locality, and recomputation in image processing pipelines.

[BibT_eX]

[DOI]

Phitchaya Mangpo Phothilimthana

Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation, 2013

Dynamic expressivity with static optimization for streaming languages.

[BibT_eX]

[DOI]

Proceedings of the 7th ACM International Conference on Distributed Event-Based Systems, 2013

Portable performance on heterogeneous architectures.

[BibT_eX]

[DOI]

Proceedings of the Architectural Support for Programming Languages and Operating Systems, 2013

2012

Decoupling algorithms from schedules for easy optimization of image processing pipelines.

[BibT_eX]

[DOI]

ACM Trans. Graph., 2012

Transparent dynamic instrumentation.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Virtual Execution Environments, 2012

Hyperparameter Tuning in Bandit-Based Adaptive Operator Selection.

[BibT_eX]

[DOI]

Proceedings of the Applications of Evolutionary Computation, 2012

Siblingrivalry: online autotuning through local competitions.

[BibT_eX]

[DOI]

Proceedings of the 15th International Conference on Compilers, 2012

Aikido: accelerating shared data dynamic analyses.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Architectural Support for Programming Languages and Operating Systems, 2012

2011

Dynamic cache contention detection in multi-threaded applications.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Virtual Execution Environments, 2011

Multicore Performance Optimization Using Partner Cores.

[BibT_eX]

[DOI]

Proceedings of the 3rd USENIX Workshop on Hot Topics in Parallelism, 2011

PetaBricks: a language and compiler based on autotuning.

[BibT_eX]

[DOI]

Proceedings of the High Performance Embedded Architectures and Compilers, 2011

An efficient evolutionary algorithm for solving incrementally structured problems.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Genetic and Evolutionary Computation Conference, 2011

Language and compiler support for auto-tuning variable-accuracy algorithms.

[BibT_eX]

[DOI]

Proceedings of the CGO 2011, 2011

2010

Efficient memory shadowing for 64-bit architectures.

[BibT_eX]

[DOI]

Proceedings of the 9th International Symposium on Memory Management, 2010

Evaluation of IVR data collection UIs for untrained rural users.

[BibT_eX]

[DOI]

Adam Lerer

Molly Ward

Proceedings of the First ACM Annual Symposium on Computing for Development, 2010

Umbra: efficient and scalable memory shadowing.

[BibT_eX]

[DOI]

Proceedings of the CGO 2010, 2010

An empirical characterization of stream programs and its implications for language and compiler design.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Parallel Architectures and Compilation Techniques, 2010

2009

Tiled Multicore Processors.

[BibT_eX]

[DOI]

Proceedings of the Multicore Processors and Systems, 2009

Automatically patching errors in deployed software.

[BibT_eX]

[DOI]

Proceedings of the 22nd ACM Symposium on Operating Systems Principles 2009, 2009

Autotuning multigrid with PetaBricks.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE Conference on High Performance Computing, 2009

PetaBricks: a language and compiler for algorithmic choice.

[BibT_eX]

[DOI]

Proceedings of the 2009 ACM SIGPLAN Conference on Programming Language Design and Implementation, 2009

Manipulating lossless video in the compressed domain.

[BibT_eX]

[DOI]

Steven Hall

Proceedings of the 17th International Conference on Multimedia 2009, 2009

Computer-aided design for microfluidic chips based on multilayer soft lithography.

[BibT_eX]

[DOI]

Nada Amin

Proceedings of the 27th International Conference on Computer Design, 2009

Kendo: efficient deterministic multithreading in software.

[BibT_eX]

[DOI]

Marek Olszewski

Proceedings of the 14th International Conference on Architectural Support for Programming Languages and Operating Systems, 2009

2008

A lightweight streaming layer for multicore execution.

[BibT_eX]

[DOI]

SIGARCH Comput. Archit. News, 2008

Abstraction layers for scalable microfluidic biocomputing.

[BibT_eX]

[DOI]

Nat. Comput., 2008

How to Do a Million Watchpoints: Efficient Debugging Using Dynamic Instrumentation.

[BibT_eX]

[DOI]

Proceedings of the Compiler Construction, 17th International Conference, 2008

(How) can programmers conquer the multicore menace?

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques, 2008

2007

A step towards unifying schedule and storage optimization.

[BibT_eX]

[DOI]

Frédéric Vivien

ACM Trans. Program. Lang. Syst., 2007

A Practical Approach to Exploiting Coarse-Grained Pipeline Parallelism in C Programs.

[BibT_eX]

[DOI]

Vikram Chandrasekhar

Proceedings of the 40th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO-40 2007), 2007

Ubiquitous Memory Introspection.

[BibT_eX]

[DOI]

Proceedings of the Fifth International Symposium on Code Generation and Optimization (CGO 2007), 2007

2006

MPEG-2 decoding in a stream programming language.

[BibT_eX]

[DOI]

Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

Abstraction Layers for Scalable Microfluidic Biocomputers.

[BibT_eX]

[DOI]

Proceedings of the DNA Computing, 12th International Meeting on DNA Computing, 2006

Exploiting coarse-grained task, data, and pipeline parallelism in stream programs.

[BibT_eX]

[DOI]

Michael I. Gordon

Proceedings of the 12th International Conference on Architectural Support for Programming Languages and Operating Systems, 2006

2005

Scalar Operand Networks.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2005

Interprocedural parallelization analysis in SUIF.

[BibT_eX]

[DOI]

ACM Trans. Program. Lang. Syst., 2005

Teleport messaging for distributed stream programs.

[BibT_eX]

[DOI]

Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2005

Exploiting Vector Parallelism in Software Pipelined Loops.

[BibT_eX]

[DOI]

Samuel Larsen

Rodric M. Rabbah

Proceedings of the 38th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO-38 2005), 2005

Cache aware optimization of stream programs.

[BibT_eX]

[DOI]

Proceedings of the 2005 ACM SIGPLAN/SIGBED Conference on Languages, 2005

Predicting Unroll Factors Using Supervised Classification.

[BibT_eX]

[DOI]

Mark Stephenson

Proceedings of the 3nd IEEE / ACM International Symposium on Code Generation and Optimization (CGO 2005), 2005

Maintaining Consistency and Bounding Capacity of Software Code Caches.

[BibT_eX]

[DOI]

Proceedings of the 3nd IEEE / ACM International Symposium on Code Generation and Optimization (CGO 2005), 2005

Multicores from the Compiler's Perspective: A Blessing or a Curse?.

[BibT_eX]

[DOI]

Proceedings of the 3nd IEEE / ACM International Symposium on Code Generation and Optimization (CGO 2005), 2005

Optimizing stream programs using linear state space analysis.

[BibT_eX]

[DOI]

Sitij Agrawal

Proceedings of the 2005 International Conference on Compilers, 2005

2004

Convergent Scheduling.

[BibT_eX]

[DOI]

Diego Puppin

Mark Stephenson

J. Instr. Level Parallelism, 2004

Evaluation of the Raw Microprocessor: An Exposed-Wire-Delay Architecture for ILP and Streams.

[BibT_eX]

[DOI]

Proceedings of the 31st International Symposium on Computer Architecture (ISCA 2004), 2004

Language and Compiler Design for Streaming Applications.

[BibT_eX]

[DOI]

Proceedings of the 18th International Parallel and Distributed Processing Symposium (IPDPS 2004), 2004

2003

Meta optimization: improving compiler heuristics with machine learning.

[BibT_eX]

[DOI]

Proceedings of the ACM SIGPLAN 2003 Conference on Programming Language Design and Implementation 2003, 2003

Linear analysis and optimization of stream programs.

[BibT_eX]

[DOI]

Andrew A. Lamb

Proceedings of the ACM SIGPLAN 2003 Conference on Programming Language Design and Implementation 2003, 2003

Phased scheduling of stream programs.

[BibT_eX]

[DOI]

Michal Karczmarek

Proceedings of the 2003 Conference on Languages, 2003

Adapting Convergent Scheduling Using Machine-Learning.

[BibT_eX]

[DOI]

Proceedings of the Languages and Compilers for Parallel Computing, 2003

Dynamic native optimization of interpreters.

[BibT_eX]

[DOI]

Proceedings of the 2003 Workshop on Interpreters, Virtual Machines and Emulators, 2003

High-Bandwidth Packet Switching on the Raw General-Purpose Architecture.

[BibT_eX]

[DOI]

Gleb A. Chuvpilo

Proceedings of the 32nd International Conference on Parallel Processing (ICPP 2003), 2003

Scalar Operand Networks: On-Chip Interconnect for ILP in Partitioned Architecture.

[BibT_eX]

[DOI]

Michael Bedford Taylor

Walter Lee

Anant Agarwal

Proceedings of the Ninth International Symposium on High-Performance Computer Architecture (HPCA'03), 2003

Genetic Programming Applied to Compiler Heuristic Optimization.

[BibT_eX]

[DOI]

Proceedings of the Genetic Programming, 6th European Conference, EuroGP 2003, 2003

An Infrastructure for Adaptive Dynamic Optimization.

[BibT_eX]

[DOI]

Timothy Garnett

Proceedings of the 1st IEEE / ACM International Symposium on Code Generation and Optimization (CGO 2003), 2003

2002

A common machine language for grid-based architectures.

[BibT_eX]

[DOI]

SIGARCH Comput. Archit. News, 2002

The Raw Microprocessor: A Computational Fabric for Software Circuits and General-Purpose Programs.

[BibT_eX]

[DOI]

IEEE Micro, 2002

Secure Execution via Program Shepherding.

[BibT_eX]

[DOI]

Vladimir Kiriansky

Proceedings of the 11th USENIX Security Symposium, 2002

Defying the speed of light: : a spatially-aware compiler for wire-exposed architectures.

[BibT_eX]

[DOI]

Proceedings of the ACM SIGPLAN ASIA-PEPM 2002, 2002

Convergent scheduling.

[BibT_eX]

[DOI]

Proceedings of the 35th Annual International Symposium on Microarchitecture, 2002

Providing Web search capability for low-connectivity communities.

[BibT_eX]

[DOI]

Libby Levison

Proceedings of the 2002 International Symposium on Technology and Society, 2002

Efficient Pipelining of Nested Loops: Unroll-and-Squash.

[BibT_eX]

[DOI]

Darin Petkov

Randolph E. Harr

Proceedings of the 16th International Parallel and Distributed Processing Symposium (IPDPS 2002), 2002

StreamIt: A Language for Streaming Applications.

[BibT_eX]

[DOI]

Michal Karczmarek

Proceedings of the Compiler Construction, 11th International Conference, 2002

A stream compiler for communication-exposed architectures.

[BibT_eX]

[DOI]

Proceedings of the 10th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS-X), 2002

Increasing and Detecting Memory Address Congruence.

[BibT_eX]

[DOI]

Samuel Larsen

Emmett Witchel

Proceedings of the 2002 International Conference on Parallel Architectures and Compilation Techniques (PACT 2002), 2002

2001

Compiler Support for Scalable and Efficient Memory Systems.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2001

A Unified Framework for Schedule and Storage Optimization.

[BibT_eX]

[DOI]

Proceedings of the 2001 ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI), 2001

Strength Reduction of Integer Division and Modulo Operations.

[BibT_eX]

[DOI]

Proceedings of the Languages and Compilers for Parallel Computing, 2001

2000

Bitwidth analysis with application to silicon compilation.

[BibT_eX]

[DOI]

Mark Stephenson

Jonathan Babb

Proceedings of the 2000 ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI), 2000

Exploiting superword level parallelism with multimedia instruction sets.

[BibT_eX]

[DOI]

Samuel Larsen

Proceedings of the 2000 ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI), 2000

FlexCache: A Framework for Flexible Compiler Generated Data Caching.

[BibT_eX]

[DOI]

Csaba Andras Moritz

Matthew I. Frank

Proceedings of the Intelligent Memory Systems, Second International Workshop, 2000

1999

Maps: A Compiler-Managed Memory System for Raw Machines.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual International Symposium on Computer Architecture, 1999

Parallelizing Applications into Silicon.

[BibT_eX]

[DOI]

Proceedings of the 7th IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM '99), 1999

1998

Memory bank disambiguation using modulo unrolling for Raw machines.

[BibT_eX]

[DOI]

Proceedings of the 5th International Conference On High Performance Computing, 1998

Space-Time Scheduling of Instruction-Level Parallelism on a Raw Machine.

[BibT_eX]

[DOI]

Walter Lee

Rajeev Barua

Matthew I. Frank

Devabhaktuni Srikrishna

Jonathan Babb

Vivek Sarkar