Basilio B. Fraguela

Proceedings of the Applications of Evolutionary Computation - 19th European Conference, 2016

2015

Developing adaptive multi-device applications with the Heterogeneous Programming Library.

[BibT_eX]

[DOI]

J. Supercomput., 2015

On Processing Extreme Data.

[BibT_eX]

[DOI]

Scalable Comput. Pract. Exp., 2015

Automatic Generation of Optimized OpenCL Codes Using OCLoptimizer.

[BibT_eX]

[DOI]

Comput. J., 2015

Enhancing and Evaluating the Configuration Capability of a Skeleton for Irregular Computations.

[BibT_eX]

[DOI]

Proceedings of the 23rd Euromicro International Conference on Parallel, 2015

Improving OpenCL Programmability with the Heterogeneous Programming Library.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Computational Science, 2015

2014

Address independent estimation of the boundaries of cache performance.

[BibT_eX]

[DOI]

Microprocess. Microsystems, 2014

An Algorithm Template for Domain-Based Parallel Irregular Algorithms.

[BibT_eX]

[DOI]

Int. J. Parallel Program., 2014

A fine-grained thread-aware management policy for shared caches.

[BibT_eX]

[DOI]

Concurr. Comput. Pract. Exp., 2014

Writing Self-adaptive Codes for Heterogeneous Systems.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2014 Parallel Processing, 2014

2013

Numerical simulation of pollutant transport in a shallow-water system on the Cell heterogeneous processor.

[BibT_eX]

[DOI]

J. Supercomput., 2013

Virtually split cache: An efficient mechanism to distribute instructions and data.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2013

A framework for argument-based task synchronization with automatic detection of dependencies.

[BibT_eX]

[DOI]

Parallel Comput., 2013

Accurate prediction of the behavior of multithreaded applications in shared caches.

[BibT_eX]

[DOI]

Parallel Comput., 2013

Exploiting heterogeneous parallelism with the Heterogeneous Programming Library.

[BibT_eX]

[DOI]

Moisés Viñas

Zeki Bozkus

J. Parallel Distributed Comput., 2013

Parallelization of shallow water simulations on current multi-threaded systems.

[BibT_eX]

[DOI]

Int. J. High Perform. Comput. Appl., 2013

A multi-GPU shallow-water simulation with transport of contaminants.

[BibT_eX]

[DOI]

Concurr. Comput. Pract. Exp., 2013

Graphics processing unit computing and exploitation of hardware accelerators.

[BibT_eX]

[DOI]

Enrique S. Quintana-Ortí

Robert Strzodka

Concurr. Comput. Pract. Exp., 2013

OCLoptimizer: An Iterative Optimization Tool for OpenCL.

[BibT_eX]

[DOI]

Jorge F. Fabeiro

Proceedings of the International Conference on Computational Science, 2013

2012

Static analysis of the worst-case memory performance for irregular codes with indirections.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2012

Optimization techniques for efficient HTA programs.

[BibT_eX]

[DOI]

Parallel Comput., 2012

Special issue editorial: Exploitation of hardware accelerators.

[BibT_eX]

[DOI]

Margarita Amor

Microprocess. Microsystems, 2012

Special issue editorial: Accelerators for high-performance computing.

[BibT_eX]

[DOI]

J. Parallel Distributed Comput., 2012

Automatic mapping of parallel applications on multicore architectures using the Servet benchmark suite.

[BibT_eX]

[DOI]

Jorge González-Domínguez

Comput. Electr. Eng., 2012

Using an Analytical Model of Shared Caches for Selecting the Optimal Parallelization Scheme.

[BibT_eX]

[DOI]

Proceedings of the 10th IEEE International Symposium on Parallel and Distributed Processing with Applications, 2012

A Portable High-Productivity Approach to Program Heterogeneous Systems.

[BibT_eX]

[DOI]

Zeki Bozkus

Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum, 2012

Adaptive Set-Granular Cooperative Caching.

[BibT_eX]

[DOI]

Proceedings of the 18th IEEE International Symposium on High Performance Computer Architecture, 2012

2011

An efficient parallel set container for multicore architectures.

[BibT_eX]

[DOI]

Álvaro de Vega

Proceedings of the Applications, Tools and Techniques on the Road to Exascale Computing, Proceedings of the conference ParCo 2011, 31 August, 2011

Simulation of pollutant transport in shallow water on a CUDA architecture.

[BibT_eX]

[DOI]

Proceedings of the 2011 International Conference on High Performance Computing & Simulation, 2011

2010

Address-Independent Estimation of the Worst-case Memory Performance.

[BibT_eX]

[DOI]

IEEE Trans. Ind. Informatics, 2010

Servet: A benchmark suite for autotuning on multicore clusters.

[BibT_eX]

[DOI]

Jorge González-Domínguez

Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

A Generic Algorithm Template for Divide-and-Conquer in Multicore Systems.

[BibT_eX]

[DOI]

Proceedings of the 12th IEEE International Conference on High Performance Computing and Communications, 2010

Reducing capacity and conflict misses using Set Saturation Levels.

[BibT_eX]

[DOI]

Proceedings of the 2010 International Conference on High Performance Computing, 2010

Streaming-Oriented Parallelization of Domain-Independent Irregular Kernels.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2010 Parallel Processing Workshops, 2010

2009

Writing productive stencil codes with overlapped tiling.

[BibT_eX]

[DOI]

Concurr. Comput. Pract. Exp., 2009

Static Prediction of Worst-Case Data Cache Performance in the Absence of Base Address Information.

[BibT_eX]

[DOI]

Proceedings of the 15th IEEE Real-Time and Embedded Technology and Applications Symposium, 2009

Performance Evaluation of MPI, UPC and OpenMP on Multicore Architectures.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2009

Task-Parallel versus Data-Parallel Library-Based Programming in Multicore Systems.

[BibT_eX]

[DOI]

Proceedings of the 17th Euromicro International Conference on Parallel, 2009

Adaptive line placement with the <i>set balancing cache</i>.

[BibT_eX]

[DOI]

Proceedings of the 42st Annual IEEE/ACM International Symposium on Microarchitecture (MICRO-42 2009), 2009

Performance Evaluation of Unified Parallel C Collective Communications.

[BibT_eX]

[DOI]

Proceedings of the 11th IEEE International Conference on High Performance Computing and Communications, 2009

Automatic Tuning of Discrete Fourier Transforms Driven by Analytical Modeling.

[BibT_eX]

[DOI]

Yevgen Voronenko

Markus Püschel

Proceedings of the PACT 2009, 2009

2008

Design Issues in Parallel Array Languages for Shared Memory.

[BibT_eX]

[DOI]

Proceedings of the Embedded Computer Systems: Architectures, 2008

Programming with tiles.

[BibT_eX]

[DOI]

Proceedings of the 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2008

2007

Precise automatable analytical modeling of the cache behavior of codes with indirections.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2007

Special Issue: Current Trends in Compilers for Parallel Computers.

[BibT_eX]

[DOI]

Concurr. Comput. Pract. Exp., 2007

Automated and accurate cache behavior analysis for codes with irregular access patterns.

[BibT_eX]

[DOI]

Concurr. Comput. Pract. Exp., 2007

2006

Analytical modeling of codes with arbitrary data-dependent conditional structures.

[BibT_eX]

[DOI]

J. Syst. Archit., 2006

Programming for parallelism and locality with hierarchically tiled arrays.

[BibT_eX]

[DOI]

Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2006

Design and Use of htalib - A Library for Hierarchically Tiled Arrays.

[BibT_eX]

[DOI]

Proceedings of the Languages and Compilers for Parallel Computing, 2006

Cache Behavior Modelling for Codes Involving Banded Matrices.

[BibT_eX]

[DOI]

Proceedings of the Languages and Compilers for Parallel Computing, 2006

Hierarchically tiled arrays for parallelism and locality.

[BibT_eX]

[DOI]

Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

2005

Optimal Tile Size Selection Guided by Analytical Models.

[BibT_eX]

M. G. Carmueja

Proceedings of the Parallel Computing: Current & Future Issues of High-End Computing, 2005

2004

A compiler tool to predict memory hierarchy performance of scientific codes.

[BibT_eX]

[DOI]

Parallel Comput., 2004

The Hierarchically Tiled Arrays programming approach.

[BibT_eX]

[DOI]

Proceedings of the 7th Workshop on languages, 2004

Implementation of Parallel Numerical Algorithms Using Hierarchically Tiled Arrays.

[BibT_eX]

[DOI]

Proceedings of the Languages and Compilers for High Performance Computing, 2004

Modeling the Cache Behavior of Codes with Arbitrary Data-Dependent Conditional Structures.

[BibT_eX]

[DOI]

Proceedings of the Advances in Computer Systems Architecture, 9th Asia-Pacific Conference, 2004

2003

Probabilistic Miss Equations: Evaluating Memory Hierarchy Performance.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2003

Cache Behavior Modeling of Codes with Data-Dependent Conditionals.

[BibT_eX]

[DOI]

Proceedings of the Software and Compilers for Embedded Systems, 7th International Workshop, 2003

Programming the FlexRAM parallel intelligent memory system.

[BibT_eX]

[DOI]

Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2003

Programming for Locality and Parallelism with Hierarchically Tiled Arrays.

[BibT_eX]

[DOI]

Proceedings of the Languages and Compilers for Parallel Computing, 2003

1999

Memory Hierarchy Performance Prediction for Blocked Sparse Algorithms.

[BibT_eX]

[DOI]

Parallel Process. Lett., 1999

Direct mapped cache performance modeling for sparse matrix operations.

[BibT_eX]

[DOI]

Proceedings of the Seventh Euromicro Workshop on Parallel and Distributed Processing. PDP'99, 1999

Set Associative Cache Behavior Optimization.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par '99 Parallel Processing, 5th International Euro-Par Conference, Toulouse, France, August 31, 1999

Automatic Analytical Modeling for the Estimation of Cache Misses.

[BibT_eX]

[DOI]

Proceedings of the 1999 International Conference on Parallel Architectures and Compilation Techniques, 1999

1998

Modeling Set Associative Caches Behavior for Irregular Computations.

[BibT_eX]

[DOI]

Proceedings of the 1998 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems, 1998

Cache Misses Prediction for High Performance Sparse Algorithms.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par '98 Parallel Processing, 1998

Cache Probabilistic Modeling for Basic Sparse Algebra Kernels Involving Matrices with a Non Uniform Distribution.

[BibT_eX]

[DOI]

Alejandro Quintela-del-Río

Proceedings of the 24th EUROMICRO '98 Conference, 1998

1996

Evaluation of vectorization/parallelization techniques: application to nonparametric curve estimation.

[BibT_eX]

[DOI]

Ramón Doallo Biempica

B. B. Fraguela-Rodríguez

Stat. Comput., 1996

Parallel Sparse Modified Gram-Schmidt QR Decomposition.

[BibT_eX]

[DOI]

Proceedings of the High-Performance Computing and Networking, 1996

1995

Extending CAML Light to Perform Distributed Computation.

[BibT_eX]

José Luis Freire