Rafael Mayo

According to our database1, Rafael Mayo authored at least 132 papers between 1993 and 2018.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

On csauthors.net:

Bibliography

2018
Performance Model of MapReduce Iterative Applications for Hybrid Cloud Bursting.
IEEE Trans. Parallel Distrib. Syst., 2018

DMR API: Improving cluster productivity by turning applications into malleable.
Parallel Computing, 2018

On the adequacy of lightweight thread approaches for high-level parallel programming models.
Future Generation Comp. Syst., 2018

Boosting Advanced Computational Applications and Resources in Latin America through Collaboration and Sharing.
Computing in Science and Engineering, 2018

GSaaS: A Service to Cloudify and Schedule GPUs.
IEEE Access, 2018

2017
Time and energy modeling of a high-performance multi-threaded Cholesky factorization.
The Journal of Supercomputing, 2017

A simple model to exploit reliable algorithms in cloud federations.
Soft Comput., 2017

Scheduling multiple virtual environments in cloud federations for distributed calculations.
Future Generation Comp. Syst., 2017

Efficient Scalable Computing through Flexible Applications and Adaptive Workloads.
Proceedings of the 46th International Conference on Parallel Processing Workshops, 2017

GLTO: On the Adequacy of Lightweight Thread Approaches for OpenMP Implementations.
Proceedings of the 46th International Conference on Parallel Processing, 2017

GLT: A Unified API for Lightweight Thread Libraries.
Proceedings of the Euro-Par 2017: Parallel Processing - 23rd International Conference on Parallel and Distributed Computing, Santiago de Compostela, Spain, August 28, 2017

Cost Model and Analysis of Iterative MapReduce Applications for Hybrid Cloud Bursting.
Proceedings of the 17th IEEE/ACM International Symposium on Cluster, 2017

Evaluation of Data Locality Strategies for Hybrid Cloud Bursting of Iterative MapReduce.
Proceedings of the 17th IEEE/ACM International Symposium on Cluster, 2017

Benchmarking Performance: Influence of Task Location on Cluster Throughput.
Proceedings of the High Performance Computing - 4th Latin American Conference, 2017

2016
Adapting Reproducible Research Capabilities to Resilient Distributed Calculations.
IJGHPC, 2016

Mathematical analysis of the spreading of a rumor among different subgroups of spreaders.
CoRR, 2016

The Latin American Giant Observatory: a successful collaboration in Latin America based on Cosmic Rays and computer science domains.
CoRR, 2016

Architecture-aware configuration and scheduling of matrix multiplication on asymmetric multicore processors.
Cluster Computing, 2016

A Review of Lightweight Thread Approaches for High Performance Computing.
Proceedings of the 2016 IEEE International Conference on Cluster Computing, 2016

Enabling GPU Virtualization in Cloud Environments.
Proceedings of the CLOSER 2016, 2016

Fostering Collaboration in Energy Research and Technological Developments Applying New Exascale HPC Techniques.
Proceedings of the IEEE/ACM 16th International Symposium on Cluster, 2016

The Latin American Giant Observatory: A Successful Collaboration in Latin America Based on Cosmic Rays and Computer Science Domains.
Proceedings of the IEEE/ACM 16th International Symposium on Cluster, 2016


On exploiting data locality for iterative mapreduce applications in hybrid clouds.
Proceedings of the 3rd IEEE/ACM International Conference on Big Data Computing, 2016

2015
Time and energy modeling of high-performance Level-3 BLAS on x86 architectures.
Simulation Modelling Practice and Theory, 2015

A Comparative Analysis of Adaptive Solutions for Grid Environments.
International Journal of Parallel Programming, 2015

Reducing the cost of power monitoring with DC wattmeters.
Computer Science - R&D, 2015

GWpilot: Enabling multi-level scheduling in distributed infrastructures with GridWay and pilot jobs.
Future Generation Comp. Syst., 2015

Challenges and characterization of a Biological system on Grid by means of the PhyloGrid application.
CoRR, 2015

Architecture-Aware Configuration and Scheduling of Matrix Multiplication on Asymmetric Multicore Processors.
CoRR, 2015

Performance and Energy Optimization of Matrix Multiplication on Asymmetric big.LITTLE Processors.
CoRR, 2015

A novel pilot job approach for improving the execution of distributed codes: application to the study of ordering in collisional transport in fusion plasmas.
Concurrency and Computation: Practice and Experience, 2015

Improving the user experience of the rCUDA remote GPU virtualization framework.
Concurrency and Computation: Practice and Experience, 2015

Out-of-core macromolecular simulations on multithreaded architectures.
Concurrency and Computation: Practice and Experience, 2015

Time and energy modeling of an INTRA-ONLY HEVC encoder.
Proceedings of the 2015 Visual Communications and Image Processing, 2015

Enabling Big Data Analytics in the Hybrid Cloud Using Iterative MapReduce.
Proceedings of the 8th IEEE/ACM International Conference on Utility and Cloud Computing, 2015

Exploiting Task-Parallelism on GPU Clusters via OmpSs and rCUDA Virtualization.
Proceedings of the 2015 IEEE TrustCom/BigDataSE/ISPA, 2015

Evaluation of an adaptive framework for resilient Monte Carlo executions.
Proceedings of the 30th Annual ACM Symposium on Applied Computing, 2015

User-Guided Provisioning in Federated Clouds for Distributed Calculations.
Proceedings of the Adaptive Resource Management and Scheduling for Cloud Computing, 2015

Evaluating the Potential of Low Power Systems for Headphone-based Spatial Audio Applications.
Proceedings of the International Conference on Computational Science, 2015

Vectorization of binaural sound virtualization on the ARM Cortex-A15 architecture.
Proceedings of the 23rd European Signal Processing Conference, 2015

A Resilient Methodology for Accessing and Exploiting Data and Scientific Codes on Distributed Environments.
Proceedings of the 18th IEEE International Conference on Computational Science and Engineering, 2015

Exploring the Suitability of Remote GPGPU Virtualization for the OpenACC Programming Model Using rCUDA.
Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015

2014
A complete and efficient CUDA-sharing solution for HPC clusters.
Parallel Computing, 2014

Automatic detection of power bottlenecks in parallel scientific applications.
Computer Science - R&D, 2014

Modeling power and energy of the task-parallel Cholesky factorization on multicore processors.
Computer Science - R&D, 2014

Modeling power and energy consumption of dense matrix factorizations on multicore processors.
Concurrency and Computation: Practice and Experience, 2014

Enhancing performance and energy consumption of runtime schedulers for dense linear algebra.
Concurrency and Computation: Practice and Experience, 2014

Assessing the impact of the CPU power-saving modes on the task-parallel solution of sparse linear systems.
Cluster Computing, 2014

Adaptive Downtime for Live Migration of Virtual Machines.
Proceedings of the 7th IEEE/ACM International Conference on Utility and Cloud Computing, 2014

SLURM Support for Remote GPU Virtualization: Implementation and Performance Study.
Proceedings of the 26th IEEE International Symposium on Computer Architecture and High Performance Computing, 2014

Analyzing the Energy Efficiency of the Memory Subsystem in Multicore Processors.
Proceedings of the IEEE International Symposium on Parallel and Distributed Processing with Applications, 2014

Author's retrospective for biomedical image analysis on a cooperative cluster of gpus and multicores.
Proceedings of the ACM International Conference on Supercomputing 25th Anniversary Volume, 2014

Parallel performance and energy efficiency of modern video encoders on multithreaded architectures.
Proceedings of the 22nd European Signal Processing Conference, 2014

Evaluating the Impact of Virtualization on Performance and Power Dissipation.
Proceedings of the CLOSER 2014, 2014

2013
Energy-efficient execution of dense linear algebra algorithms on multi-core processors.
Cluster Computing, 2013

Montera: A Framework for Efficient Execution of Monte Carlo Codes on Grid Infrastructures.
Computing and Informatics, 2013

Solving Some Mysteries in Power Monitoring of Servers: Take Care of Your Wattmeters!
Proceedings of the Energy Efficiency in Large Scale Distributed Systems, 2013

Runtime Scheduling of the LU Factorization: Performance and Energy.
Proceedings of the Energy Efficiency in Large Scale Distributed Systems, 2013

Influence of InfiniBand FDR on the performance of remote GPU virtualization.
Proceedings of the 2013 IEEE International Conference on Cluster Computing, 2013

2012
A simulator to assess energy saving strategies and policies in HPC workloads.
Operating Systems Review, 2012

Color and texture analysis on emerging parallel architectures.
IJHPCA, 2012

Optimization of power consumption in the iterative solution of sparse linear systems on graphics processors.
Computer Science - R&D, 2012

DVFS-control techniques for dense linear algebra operations on multi-core processors.
Computer Science - R&D, 2012

Analysis of Strategies to Save Energy for Message-Passing Dense Linear Algebra Kernels.
Proceedings of the 20th Euromicro International Conference on Parallel, 2012

Saving Energy in the LU Factorization with Partial Pivoting on Multi-core Processors.
Proceedings of the 20th Euromicro International Conference on Parallel, 2012

Binding Performance and Power of Dense Linear Algebra Operations.
Proceedings of the 10th IEEE International Symposium on Parallel and Distributed Processing with Applications, 2012

Reducing Energy Consumption of Dense Linear Algebra Operations on Hybrid CPU-GPU Platforms.
Proceedings of the 10th IEEE International Symposium on Parallel and Distributed Processing with Applications, 2012

Performance improvements for the neoclassical transport calculation on Grid by means of pilot jobs.
Proceedings of the 2012 International Conference on High Performance Computing & Simulation, 2012

Leveraging Task-Parallelism in Energy-Efficient ILU Preconditioners.
Proceedings of the ICT as Key Technology against Global Warming, 2012

Tools for Power-Energy Modelling and Analysis of Parallel Scientific Applications.
Proceedings of the 41st International Conference on Parallel Processing, 2012

CU2rCU: Towards the complete rCUDA remote GPU virtualization and sharing solution.
Proceedings of the 19th International Conference on High Performance Computing, 2012

2011
Color and texture analysis using emerging parallel architectures.
IJHPCA, 2011

Using a Simple Prioritisation Mechanism to Effectively Interoperate Service and Opportunistic Grids in the EELA-2 e-Infrastructure.
J. Grid Comput., 2011

A parallel solver for huge dense linear systems.
Computer Physics Communications, 2011

Large-scale linear system solver using secondary storage: Self-energy in hybrid nanostructures.
Computer Physics Communications, 2011

More Efficient Executions of Monte Carlo Fusion Codes by Means of Montera: The ISDEP Use Case.
Proceedings of the 19th International Euromicro Conference on Parallel, 2011

Symmetric Rank-k Update on Clusters of Multicore Processors with SMPSs.
Proceedings of the Applications, Tools and Techniques on the Road to Exascale Computing, Proceedings of the conference ParCo 2011, 31 August, 2011

Power-aware Dense Linear Algebra Implementations on Multi-core and Many-core Processors.
Proceedings of the 3rd Many-core Applications Research Community (MARC) Symposium. Proceedings of the 3rd MARC Symposium, 2011

Evaluation of the Energy Performance of Dense Linear Algebra Kernels on Multi-core and Many-Core Processors.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

Power Consumption of Mixed Precision in the Iterative Solution of Sparse Linear Systems.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

Improving power efficiency of dense linear algebra algorithms on multi-core processors via slack control.
Proceedings of the 2011 International Conference on High Performance Computing & Simulation, 2011

Performance of CUDA Virtualized Remote GPUs in High Performance Clusters.
Proceedings of the International Conference on Parallel Processing, 2011

Enabling CUDA acceleration within virtual machines using rCUDA.
Proceedings of the 18th International Conference on High Performance Computing, 2011

Analysis and optimization of power consumption in the iterative solution of sparse linear systems on multi-core and many-core platforms.
Proceedings of the 2011 International Green Computing Conference and Workshops, 2011

2010
Advances in the Biomedical Applications of the EELA Project
CoRR, 2010

PhyloGrid: a development for a workflow in Phylogeny
CoRR, 2010

Executions of a Fusion Drift Kinetic Equation Solver on Grid.
Proceedings of the 18th Euromicro Conference on Parallel, 2010

From GEM to gGEM.
Proceedings of the 18th Euromicro Conference on Parallel, 2010

A Grid Version of the Fusion Code FAFNER.
Proceedings of the 18th Euromicro Conference on Parallel, 2010

Scalable Phylogenetics through Input Preprocessing.
Proceedings of the Advances in Bioinformatics, 2010

FAFNER2: A comparison between the Grid and the MPI versions of the code.
Proceedings of the 2010 International Conference on High Performance Computing & Simulation, 2010

rCUDA: Reducing the number of GPU-based accelerators in high performance clusters.
Proceedings of the 2010 International Conference on High Performance Computing & Simulation, 2010

Grid selection of models of nucleotide substitution.
Proceedings of the Healthgrid Applications and Core Technologies, 2010

Characterization of antigenetic serotypes from the dengue virus in Venezuela by means of Grid Computing.
Proceedings of the Healthgrid Applications and Core Technologies, 2010

EnergySaving Cluster Roll: Power Saving System for Clusters.
Proceedings of the Architecture of Computing Systems, 2010

2009
Toward the parallelization of GSL.
The Journal of Supercomputing, 2009

Out-of-core solution of linear systems on graphics processors.
IJPEDS, 2009

Exploiting the capabilities of modern GPUs for dense matrix computations.
Concurrency and Computation: Practice and Experience, 2009

Exploring the GPU for Enhancing Parallelism on Color and Texture Analysis.
Proceedings of the Parallel Computing: From Multicores and GPU's to Petascale, 2009

A Proposal to Extend the OpenMP Tasking Model for Heterogeneous Architectures.
Proceedings of the Evolving OpenMP in an Age of Extreme Parallelism, 2009

Computational Challenges on Grid Computing for Workflows Applied to Phylogeny.
Proceedings of the Distributed Computing, 2009

The evolution of HPV by means of a phylogenetic study.
Proceedings of the Healthgrid Research, Innovation and Business Case - Proceedings of HealthGrid 2009, Berlin, Germany, 29 June, 2009

An Efficient Implementation of GPU Virtualization in High Performance Clusters.
Proceedings of the Euro-Par 2009, 2009

An Extension of the StarSs Programming Model for Platforms with Multiple GPUs.
Proceedings of the Euro-Par 2009 Parallel Processing, 2009

2008
Attaining High Performance in General-Purpose Computations on Current Graphics Processors.
Proceedings of the High Performance Computing for Computational Science, 2008

Evaluation and tuning of the Level 3 CUBLAS for graphics processors.
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

Biomedical image analysis on a cooperative cluster of GPUs and multicores.
Proceedings of the 22nd Annual International Conference on Supercomputing, 2008

Solving Dense Linear Systems on Graphics Processors.
Proceedings of the Euro-Par 2008, 2008

2007
Stabilizing large-scale generalized systems on parallel computers using multithreading and message-passing.
Concurrency and Computation: Practice and Experience, 2007

Strategies for Parallelizing the Solution of Rational Matrix Equations.
Proceedings of the Parallel Computing: Architectures, 2007

Parallel Implementation of LQG Balanced Truncation for Large-Scale Systems.
Proceedings of the Large-Scale Scientific Computing, 6th International Conference, 2007

Advances in the biomedical applications of the EELA Project.
Proceedings of the From Genes to Personalized HealthCare: Grid Solutions for the Life Sciences, 2007

Applications ported to the EELA e-Infrastructure.
Proceedings of the Seventh IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2007), 2007

2006
Parallelization of GSL: The Web Service Interface.
Proceedings of the 14th Euromicro International Conference on Parallel, 2006

Biomedical Aplications in EELA.
Proceedings of the Challenges and Opportunities of HealthGrids, 2006

Parallel Solution of Large-Scale and Sparse Generalized Algebraic Riccati Equations.
Proceedings of the Euro-Par 2006, Parallel Processing, 12th International Euro-Par Conference, Dresden, Germany, August 28, 2006

2005
Parallelization of GSL on Clusters of Symmetric Multiprocessors.
Proceedings of the Parallel Computing: Current & Future Issues of High-End Computing, 2005

Parallel Order Reduction via Balanced Truncation for Optimal Cooling of Steel Profiles.
Proceedings of the Euro-Par 2005, Parallel Processing, 11th International Euro-Par Conference, Lisbon, Portugal, August 30, 2005

2004
Improving Instruction Set Architecture learning results.
Proceedings of the 2004 workshop on Computer architecture education, 2004

Parallelization of GSL: Architecture, Interfaces, and Programming Models.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2004

Parallel Algorithms for Balanced Truncation Model Reduction of Sparse Systems.
Proceedings of the Applied Parallel Computing, 2004


Parallelization of the GNU Scientific Library on Heterogeneous Systems.
Proceedings of the 3rd International Symposium on Parallel and Distributed Computing (ISPDC 2004), 2004

2003
Remote Model Reduction of Very Large Linear Systems.
Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS 2003), 2003

2002
Parallel Algorithms for LQ Optimal Control of Discrete-Time Periodic Linear Systems.
J. Parallel Distrib. Comput., 2002

Remote Parallel Model Reduction of Linear Time-Invariant Systems Made Easy.
Proceedings of the High Performance Computing for Computational Science, 2002

Enhanced Services for Remote Model Reduction of Large-Scale Dense Linear Systems.
Proceedings of the Applied Parallel Computing Advanced Scientific Computing, 2002

Solving Large Sparse Lyapunov Equations on Parallel Computers (Research Note).
Proceedings of the Euro-Par 2002, 2002

2001
Parallel solvers for discrete-time algebric Riccati equations.
Concurrency and Computation: Practice and Experience, 2001

2000
Solving Discrete-Time Periodic Riccati Equations on a Cluster (Research Note).
Proceedings of the Euro-Par 2000, Parallel Processing, 6th International Euro-Par Conference, Munich, Germany, August 29, 2000

1993
A tool-kit for the design and simulation of systolic algorithms.
Proceedings of the 1993 Euromicro Workshop on Parallel and Distributed Processing, 1993


  Loading...