José Gracia

According to our database1, José Gracia
  • authored at least 31 papers between 2011 and 2017.
  • has a "Dijkstra number"2 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Homepage:

On csauthors.net:

Bibliography

2017
Application Productivity and Performance Evaluation of Transparent Locality-aware One-sided Communication Primitives.
IJNC, 2017

Patterns for OpenMP Task Data Dependency Overhead Measurements.
Proceedings of the Scaling OpenMP for Exascale Performance and Portability, 2017

2016
A Bandwidth-saving Optimization for MPI Broadcast Collective Operation.
CoRR, 2016

Leveraging MPI-3 Shared-Memory Extensions for Efficient PGAS Runtime Systems.
CoRR, 2016

Towards performance portability through locality-awareness for applications using one-sided communication primitives.
CoRR, 2016

Asynchronous progress design for a MPI-based PGAS one-sided communication system.
CoRR, 2016

HPC Benchmarking: Problem Size Matters.
Proceedings of the 7th International Workshop on Performance Modeling, 2016

Asynchronous Progress Design for a MPI-Based PGAS One-Sided Communication System.
Proceedings of the 22nd IEEE International Conference on Parallel and Distributed Systems, 2016

Towards Performance Portability through Locality-Awareness for Applications Using One-Sided Communication Primitives.
Proceedings of the Fourth International Symposium on Computing and Networking, 2016

2015
DART-MPI: An MPI-based Implementation of a PGAS Runtime System.
CoRR, 2015

CppSs - a C++ Library for Efficient Task Parallelism.
CoRR, 2015

A Bandwidth-Saving Optimization for MPI Broadcast Collective Operation.
Proceedings of the 44th International Conference on Parallel Processing Workshops, 2015

Providing Parallel Debugging for DASH Distributed Data Structures with GDB.
Proceedings of the International Conference on Computational Science, 2015

Leveraging MPI-3 Shared-Memory Extensions for Efficient PGAS Runtime Systems.
Proceedings of the Euro-Par 2015: Parallel Processing, 2015

2014
Avoiding Serialization Effects in Data-Dependency aware Task Parallel Algorithms for Spatial Decomposition.
CoRR, 2014

Performance Modeling of the HPCG Benchmark.
Proceedings of the High Performance Computing Systems. Performance Modeling, Benchmarking, and Simulation, 2014

DART-MPI: An MPI-based Implementation of a PGAS Runtime System.
Proceedings of the 8th International Conference on Partitioned Global Address Space Programming Models, 2014

DASH: Data Structures and Algorithms with Support for Hierarchical Locality.
Proceedings of the Euro-Par 2014: Parallel Processing Workshops, 2014

2013
Programmability and portability for exascale: Top down programming methodology and tools with StarSs.
J. Comput. Science, 2013

Cudagrind: A Valgrind Extension for CUDA.
CoRR, 2013

Cudagrind: Memory-Usage Checking for CUDA.
Proceedings of the Tools for High Performance Computing 2013, 2013

POLCA - A Programming Model for Large Scale, Strongly Heterogeneous Infrastructures.
Proceedings of the Parallel Computing: Accelerating Computational Science and Engineering (CSE), 2013

Cudagrind: A Valgrind Extension for CUDA.
Proceedings of the Parallel Computing: Accelerating Computational Science and Engineering (CSE), 2013

2012
Hybrid MPI/StarSs - a case study
CoRR, 2012

Task Debugging with TEMANEJO.
Proceedings of the Tools for High Performance Computing 2012, 2012

Avoiding Serialization Effects in Data / Dependency Aware Task Parallel Algorithms for Spatial Decomposition.
Proceedings of the 10th IEEE International Symposium on Parallel and Distributed Processing with Applications, 2012

Hybrid MPI/StarSs - A Case Study.
Proceedings of the 10th IEEE International Symposium on Parallel and Distributed Processing with Applications, 2012

Scheduling Overheads for Task-Based Parallel Programming Models.
Proceedings of the Facing the Multicore-Challenge, 2012

2011
TEMANEJO - a debugger for task based parallel programming models
CoRR, 2011

Temanejo: Debugging of Thread-Based Task-Parallel Programs in StarSS.
Proceedings of the Tools for High Performance Computing 2011, 2011

TEMANEJO – a debugger for task based parallel programming models.
Proceedings of the Applications, Tools and Techniques on the Road to Exascale Computing, Proceedings of the conference ParCo 2011, 31 August, 2011


  Loading...