María Jesús Garzarán

Konstantinos I. Karantasis

ACM Trans. Archit. Code Optim., 2014

Parallelization of Reordering Algorithms for Bandwidth and Wavefront Reduction.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2014

Evaluation of a Feature Tracking Vision Application on a Heterogeneous Chip.

[BibT_eX]

[DOI]

Proceedings of the 26th IEEE International Symposium on Computer Architecture and High Performance Computing, 2014

Improving JavaScript performance by deconstructing the type system.

[BibT_eX]

[DOI]

Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation, 2014

Directive-Based Compilers for GPUs.

[BibT_eX]

[DOI]

Proceedings of the Languages and Compilers for Parallel Computing, 2014

Optimization by runtime specialization for sparse matrix-vector multiplication.

[BibT_eX]

[DOI]

Proceedings of the Generative Programming: Concepts and Experiences, 2014

2013

Easy, fast, and energy-efficient object detection on heterogeneous on-chip architectures.

[BibT_eX]

[DOI]

Ehsan Totoni

Mert Dikmen

ACM Trans. Archit. Code Optim., 2013

2012

Optimization techniques for efficient HTA programs.

[BibT_eX]

[DOI]

Parallel Comput., 2012

Performance Portability with the Chapel Language.

[BibT_eX]

[DOI]

Albert Sidelnik

Saeed Maleki

Bradford L. Chamberlain

Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012

Hierarchical overlapped tiling.

[BibT_eX]

[DOI]

Xing Zhou

Jean Pierre Giacalone

Proceedings of the 10th Annual IEEE/ACM International Symposium on Code Generation and Optimization, 2012

2011

Scheduling of stream-based real-time applications for heterogeneous systems.

[BibT_eX]

[DOI]

Bruno Virlet

Xing Zhou

Jean Pierre Giacalone

Bob Kuhn

Proceedings of the ACM SIGPLAN/SIGBED 2011 conference on Languages, 2011

An Evaluation of Vectorizing Compilers.

[BibT_eX]

[DOI]

Proceedings of the 2011 International Conference on Parallel Architectures and Compilation Techniques, 2011

2010

A Parallel Numerical Solver Using Hierarchically Tiled Arrays.

[BibT_eX]

[DOI]

Proceedings of the Languages and Compilers for Parallel Computing, 2010

2009

ESoftCheck: Removal of Non-vital Checks for Fault Tolerance.

[BibT_eX]

[DOI]

Marc Snir

Proceedings of the CGO 2009, 2009

Optimization of tele-immersion codes.

[BibT_eX]

[DOI]

Proceedings of 2nd Workshop on General Purpose Processing on Graphics Processing Units, 2009

2008

Design Issues in Parallel Array Languages for Shared Memory.

[BibT_eX]

[DOI]

Proceedings of the Embedded Computer Systems: Architectures, 2008

Programming with tiles.

[BibT_eX]

[DOI]

Proceedings of the 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2008

P-Ray: A Software Suite for Multi-core Architecture Characterization.

[BibT_eX]

[DOI]

Proceedings of the Languages and Compilers for Parallel Computing, 2008

Efficient software checking for fault tolerance.

[BibT_eX]

[DOI]

Marc Snir

Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

Automatic generation of a parallel sorting algorithm.

[BibT_eX]

[DOI]

Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

2007

Techniques for Efficient Software Checking.

[BibT_eX]

[DOI]

Marc Snir

Proceedings of the Languages and Compilers for Parallel Computing, 2007

Optimizing Sorting with Machine Learning Algorithms.

[BibT_eX]

[DOI]

Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

07361 Abstracts Collection -- Programming Models for Ubiquitous Parallelism.

[BibT_eX]

[DOI]

Proceedings of the Programming Models for Ubiquitous Parallelism, 02.09. - 07.09.2007, 2007

07361 Introduction -- Programming Models for Ubiquitous Parallelism.

[BibT_eX]

[DOI]

Proceedings of the Programming Models for Ubiquitous Parallelism, 02.09. - 07.09.2007, 2007

Compiler Optimizations for Fault Tolerance Software Checking.

[BibT_eX]

[DOI]

Proceedings of the 16th International Conference on Parallel Architectures and Compilation Techniques (PACT 2007), 2007

2006

In search of a program generator to implement generic transformations for high-performance computing.

[BibT_eX]

[DOI]

Albert Cohen

Sébastien Donadio

Christoph Armin Herrmann

Oleg Kiselyov

Sci. Comput. Program., 2006

Programming for parallelism and locality with hierarchically tiled arrays.

[BibT_eX]

[DOI]

Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2006

Design and Use of htalib - A Library for Hierarchically Tiled Arrays.

[BibT_eX]

[DOI]

Proceedings of the Languages and Compilers for Parallel Computing, 2006

Hierarchically tiled arrays for parallelism and locality.

[BibT_eX]

[DOI]

Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

2005

Tradeoffs in buffering speculative memory state for thread-level speculation in multiprocessors.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2005

Is Search Really Necessary to Generate High-Performance BLAS?

[BibT_eX]

[DOI]

Proc. IEEE, 2005

Optimizing Matrix Multiplication with a Classifier Learning System.

[BibT_eX]

[DOI]

Proceedings of the Languages and Compilers for Parallel Computing, 2005

Analytic Models and Empirical Search: A Hybrid Approach to Code Optimization.

[BibT_eX]

[DOI]

Proceedings of the Languages and Compilers for Parallel Computing, 2005

A Language for the Compact Representation of Multiple Program Versions.

[BibT_eX]

[DOI]

Proceedings of the Languages and Compilers for Parallel Computing, 2005

Optimizing Sorting with Genetic Algorithms.

[BibT_eX]

[DOI]

Proceedings of the 3nd IEEE / ACM International Symposium on Code Generation and Optimization (CGO 2005), 2005

2004

The Hierarchically Tiled Arrays programming approach.

[BibT_eX]

[DOI]

Proceedings of the 7th Workshop on languages, 2004

Implementation of Parallel Numerical Algorithms Using Hierarchically Tiled Arrays.

[BibT_eX]

[DOI]

Proceedings of the Languages and Compilers for High Performance Computing, 2004

A Dynamically Tuned Sorting Library.

[BibT_eX]

[DOI]

Proceedings of the 2nd IEEE / ACM International Symposium on Code Generation and Optimization (CGO 2004), 2004

2003

A comparison of empirical and model-driven optimization.

[BibT_eX]

[DOI]

Proceedings of the ACM SIGPLAN 2003 Conference on Programming Language Design and Implementation 2003, 2003

The Power of Belady?s Algorithm in Register Allocation for Long Basic Blocks.

[BibT_eX]

[DOI]

Jia Guo