Diego R. Llanos Ferraris

According to our database1, Diego R. Llanos Ferraris authored at least 48 papers between 1999 and 2017.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Homepages:

On csauthors.net:

Bibliography

2017
BFCA+: automatic synthesis of parallel code with TLS capabilities.
The Journal of Supercomputing, 2017

A technique to automatically determine Ad-hoc communication patterns at runtime.
Parallel Computing, 2017

Using the Xeon Phi Platform to Run Speculatively-Parallelized Codes.
International Journal of Parallel Programming, 2017

TORMENT OpenACC2016: A Benchmarking Tool for OpenACC Compilers.
Proceedings of the 25th Euromicro International Conference on Parallel, 2017

Supporting the Xeon Phi Coprocessor in a Heterogeneous Programming Model.
Proceedings of the Euro-Par 2017: Parallel Processing - 23rd International Conference on Parallel and Distributed Computing, Santiago de Compostela, Spain, August 28, 2017

2016
An OpenMP Extension that Supports Thread-Level Speculation.
IEEE Trans. Parallel Distrib. Syst., 2016

New Data Structures to Handle Speculative Parallelization at Runtime.
International Journal of Parallel Programming, 2016

A Survey on Thread-Level Speculation Techniques.
ACM Comput. Surv., 2016

Comparative Analysis of OpenACC Compilers.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2016

2015
TuCCompi: A Multi-layer Model for Distributed Heterogeneous Computing with Tuning Capabilities.
International Journal of Parallel Programming, 2015

Comprehensive Evaluation of a New GPU-based Approach to the Shortest Path Problem.
International Journal of Parallel Programming, 2015

On the run-time cost of distributed-memory communications generated using the polyhedral model.
Proceedings of the 2015 International Conference on High Performance Computing & Simulation, 2015

Moody Scheduling for Speculative Parallelization.
Proceedings of the Euro-Par 2015: Parallel Processing, 2015

2014
The Shortest-Path Problem: Analysis and Comparison of Methods
Synthesis Lectures on Theoretical Computer Science, Morgan & Claypool Publishers, 2014

An Extensible System for Multilevel Automatic Data Partition and Mapping.
IEEE Trans. Parallel Distrib. Syst., 2014

Blending Extensibility and Performance in Dense and Sparse Parallel Data Management.
IEEE Trans. Parallel Distrib. Syst., 2014

Optimizing an APSP implementation for NVIDIA GPUs using kernel characterization criteria.
The Journal of Supercomputing, 2014

The BonaFide C Analyzer: automatic loop-level characterization and coverage measurement.
The Journal of Supercomputing, 2014

Squashing Alternatives for Software-Based Speculative Parallelization.
IEEE Trans. Computers, 2014

Exploiting distributed and shared memory hierarchies with Hitmap.
Proceedings of the International Conference on High Performance Computing & Simulation, 2014

A New GCC Plugin-Based Compiler Pass to Add Support for Thread-Level Speculation into OpenMP.
Proceedings of the Euro-Par 2014 Parallel Processing, 2014

2013
uBench: exposing the impact of CUDA block geometry in terms of performance.
The Journal of Supercomputing, 2013

Extending a hierarchical tiling arrays library to support sparse data partitioning.
The Journal of Supercomputing, 2013

A new GPU-based approach to the Shortest Path problem.
Proceedings of the International Conference on High Performance Computing & Simulation, 2013

2012
Using SPEC CPU2006 to evaluate the sequential and parallel code generated by commercial and open-source compilers.
The Journal of Supercomputing, 2012

Support for Thread-Level Speculation into OpenMP.
Proceedings of the OpenMP in a Heterogeneous World - 8th International Workshop on OpenMP, 2012

Using Fermi Architecture Knowledge to Speed up CUDA and OpenCL Programs.
Proceedings of the 10th IEEE International Symposium on Parallel and Distributed Processing with Applications, 2012

Encapsulated Synchronization and Load-Balance in Heterogeneous Programming.
Proceedings of the Euro-Par 2012 Parallel Processing - 18th International Conference, 2012

2011
Trasgo: a nested-parallel programming system.
The Journal of Supercomputing, 2011

Automatic Data Partitioning Applied to Multigrid PDE Solvers.
Proceedings of the 19th International Euromicro Conference on Parallel, 2011

Towards a Compiler Framework for Thread-Level Speculation.
Proceedings of the 19th International Euromicro Conference on Parallel, 2011

Understanding the impact of CUDA tuning techniques for Fermi.
Proceedings of the 2011 International Conference on High Performance Computing & Simulation, 2011

Exclusive squashing for thread-level speculation.
Proceedings of the 20th ACM International Symposium on High Performance Distributed Computing, 2011

Robust thread-level speculation.
Proceedings of the 18th International Conference on High Performance Computing, 2011

2010
Effortless and Efficient Distributed Data-Partitioning in Linear Algebra.
Proceedings of the 12th IEEE International Conference on High Performance Computing and Communications, 2010

2008
Just-In-Time Scheduling for Loop-based Speculative Parallelization.
Proceedings of the 16th Euromicro International Conference on Parallel, 2008

2007
New Scheduling Strategies for Randomized Incremental Algorithms in the Context of Speculative Parallelization.
IEEE Trans. Computers, 2007

Review of "Grid Computing Security by Anirban Chakrabarti", Springer, 2007, ISBN 3540444920.
ACM Queue, 2007

2006
TPCC-UVa: an open-source TPC-C implementation for global performance measurement of computer systems.
SIGMOD Record, 2006

Speculative Parallelization.
IEEE Computer, 2006

TPCC-UVa: an open-source TPC-C implementation for parallel and distributed systems.
Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

2005
Design Space Exploration of a Software Speculative Parallelization Scheme.
IEEE Trans. Parallel Distrib. Syst., 2005

MESETA: A New Scheduling Strategy for Speculative Parallelization of Randomized Incremental Algorithms.
Proceedings of the 34th International Conference on Parallel Processing Workshops (ICPP 2005 Workshops), 2005

2004
Speculative Parallelization of a Randomized Incremental Convex Hull Algorithm.
Proceedings of the Computational Science and Its Applications, 2004

2003
Toward efficient and robust software speculative parallelization on multiprocessors.
Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2003

2000
Exploiting parallelism in a network of workstations using COMA-BC.
SIGARCH Computer Architecture News, 2000

Reducing the Replacement Overhead on COMA Protocols for Workstation-Based Architectures.
Proceedings of the Euro-Par 2000, Parallel Processing, 6th International Euro-Par Conference, Munich, Germany, August 29, 2000

1999
A Configurable ACSL-Based Interface Generator for Simulated Systems.
Simulation, 1999


  Loading...