Ravi Reddy

Orcid: 0000-0001-9181-3290

Affiliations:
  • University College Dublin, Ireland


According to our database1, Ravi Reddy authored at least 63 papers between 2000 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
SUARA: A scalable universal allreduce communication algorithm for acceleration of parallel deep learning applications.
J. Parallel Distributed Comput., January, 2024

OpenH: A Novel Programming Model and API for Developing Portable Parallel Programs on Heterogeneous Hybrid Servers.
IEEE Access, 2024

2023
Energy-Efficient Parallel Computing: Challenges to Scaling.
Inf., April, 2023

Efficient exact algorithms for continuous bi-objective performance-energy optimization of applications with linear energy and monotonically increasing performance profiles on heterogeneous high performance computing platforms.
Concurr. Comput. Pract. Exp., 2023

Acceleration of Bi-Objective Optimization of Data-Parallel Applications for Performance and Energy on Heterogeneous Hybrid Platforms.
IEEE Access, 2023

2022
Novel bi-objective optimization algorithms minimizing the max and sum of vectors of functions.
CoRR, 2022

On Energy Nonproportionality of CPUs and GPUs.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2022

2021
Bi-Objective Optimization of Data-Parallel Applications on Heterogeneous HPC Platforms for Performance and Energy Through Workload Distribution.
IEEE Trans. Parallel Distributed Syst., 2021

Improving the accuracy of energy predictive models for multicore CPUs by combining utilization and performance events model variables.
J. Parallel Distributed Comput., 2021

Energy Predictive Models of Computing: Theory, Practical Implications and Experimental Analysis on Multicore Processors.
IEEE Access, 2021

A Novel Algorithm for Bi-objective Performance-Energy Optimization of Applications with Continuous Performance and Linear Energy Profiles on Heterogeneous HPC Platforms.
Proceedings of the Euro-Par 2021: Parallel Processing Workshops, 2021

2020
The 27th International Heterogeneity in Computing Workshop and the 16th International Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogeneous Platforms.
Concurr. Comput. Pract. Exp., 2020

A novel data partitioning algorithm for dynamic energy optimization on heterogeneous high-performance computing platforms.
Concurr. Comput. Pract. Exp., 2020

A Comparative Study of Techniques for Energy Predictive Modeling Using Performance Monitoring Counters on Modern Multicore CPUs.
IEEE Access, 2020

A Hierarchical Data-Partitioning Algorithm for Performance Optimization of Data-Parallel Applications on Heterogeneous Multi-Accelerator NUMA Nodes.
IEEE Access, 2020

Accurate Energy Modelling of Hybrid Parallel Applications on Modern Heterogeneous Computing Platforms Using System-Level Measurements.
IEEE Access, 2020

2019
A Survey of Communication Performance Models for High-Performance Computing.
ACM Comput. Surv., 2019

Modern Multicore CPUs are not Energy Proportional: Opportunity for Bi-objective Optimization for Performance and Energy.
CoRR, 2019

Bi-objective Optimisation of Data-parallel Applications on Heterogeneous Platforms for Performance and Energy via Workload Distribution.
CoRR, 2019

Energy of Computing on Multicore CPUs: Predictive Models and Energy Conservation Law.
CoRR, 2019

Design of self-adaptable data parallel applications on multicore clusters automatically optimized for performance and energy through load distribution.
Concurr. Comput. Pract. Exp., 2019

Improving the Accuracy of Energy Predictive Models for Multicore CPUs Using Additivity of Performance Monitoring Counters.
Proceedings of the Parallel Computing Technologies, 2019

SummaGen: Parallel Matrix-Matrix Multiplication Based on Non-rectangular Partitions for Heterogeneous HPC Platforms.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2019

Optimization of Data-Parallel Applications on Heterogeneous HPC Platforms for Dynamic Energy Through Workload Distribution.
Proceedings of the Euro-Par 2019: Parallel Processing Workshops, 2019



2018
A Novel Data-Partitioning Algorithm for Performance Optimization of Data-Parallel Applications on Heterogeneous HPC Platforms.
IEEE Trans. Parallel Distributed Syst., 2018

Out-of-core implementation for accelerator kernels on heterogeneous clouds.
J. Supercomput., 2018

Hierarchical multicore thread mapping via estimation of remote communication.
J. Supercomput., 2018

Bi-Objective Optimization of Data-Parallel Applications on Homogeneous Multicore Clusters for Performance and Energy.
IEEE Trans. Computers, 2018

Novel Model-based Methods for Performance Optimization of Multithreaded 2D Discrete Fourier Transform on Multicore Processors.
CoRR, 2018

libhclooc: Software Library Facilitating Out-of-core Implementations of Accelerator Kernels on Hybrid Computing Platforms.
CoRR, 2018

Parallel Data Partitioning Algorithms for Optimization of Data-Parallel Applications on Modern Extreme-Scale Multicore Platforms for Performance and Energy.
IEEE Access, 2018

Performance Optimization of Multithreaded 2D Fast Fourier Transform on Multicore Processors Using Load Imbalancing Parallel Computing Method.
IEEE Access, 2018

How Pre-multicore Methods and Algorithms Perform in Multicore Era.
Proceedings of the High Performance Computing, 2018

Performance Optimization of Multithreaded 2D FFT on Multicore Processors: Challenges and Solution Approaches.
Proceedings of the 25th IEEE International Conference on High Performance Computing Workshops, 2018

2017
New Model-Based Methods and Algorithms for Performance and Energy Optimization of Data Parallel Applications on Homogeneous Multicore Clusters.
IEEE Trans. Parallel Distributed Syst., 2017

Additivity: A Selection Criterion for Performance Events for Reliable Energy Predictive Modeling.
Supercomput. Front. Innov., 2017

A Survey of Power and Energy Predictive Models in HPC Systems and Applications.
ACM Comput. Surv., 2017

2016
Design of a dual-hormone model predictive control for artificial pancreas with exercise model.
Proceedings of the 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2016

2011
Design and implementation of self-adaptable parallel algorithms for scientific computing on highly heterogeneous HPC platforms
CoRR, 2011

2010
Experimental Study of Six Different Implementations of Parallel Matrix Multiplication on Heterogeneous Computational Clusters of Multicore Processors.
Proceedings of the 18th Euromicro Conference on Parallel, 2010

2009
HeteroPBLAS: A Set of Parallel Basic Linear Algebra Subprograms Optimized for Heterogeneous Computational Clusters.
Scalable Comput. Pract. Exp., 2009

Parallel solvers for dense linear systems for heterogeneous computational clusters.
Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009

Two-Dimensional Matrix Partitioning for Parallel Computing on Heterogeneous Processors Based on Their Functional Performance Models.
Proceedings of the Euro-Par 2009, 2009

Distributed Data Partitioning for Heterogeneous Processors Based on Partial Estimation of Their Functional Performance Models.
Proceedings of the Euro-Par 2009, 2009

2008
Heterogeneous PBLAS: Optimization of PBLAS for Heterogeneous Computational Clusters.
Proceedings of the 7th International Symposium on Parallel and Distributed Computing (ISPDC 2008), 2008

Scalable Dense Factorizations for Heterogeneous Computational Clusters.
Proceedings of the 7th International Symposium on Parallel and Distributed Computing (ISPDC 2008), 2008

2007
Data distribution for dense factorization on computers with memory heterogeneity.
Parallel Comput., 2007

Data Partitioning with a Functional Performance Model of Heterogeneous Processors.
Int. J. High Perform. Comput. Appl., 2007

A Novel Algorithm of Optimal Matrix Partitioning for Parallel Dense Factorization on Heterogeneous Processors.
Proceedings of the Parallel Computing Technologies, 2007

2006
HeteroMPI: Towards a message-passing library for heterogeneous networks of computers.
J. Parallel Distributed Comput., 2006

Building the functional performance model of a processor.
Proceedings of the 2006 ACM Symposium on Applied Computing (SAC), 2006

HeteroMPI+ScaLAPACK: Towards a ScaLAPACK (Dense Linear Solvers) on Heterogeneous Networks of Computers.
Proceedings of the High Performance Computing, 2006

2005
Data partitioning for multiprocessors with memory heterogeneity and memory constraints.
Sci. Program., 2005

A Variable Group Block Distribution Strategy for Dense Factorizations on Networks of Heterogeneous Computers.
Proceedings of the Parallel Processing and Applied Mathematics, 2005

2004
On performance analysis of heterogeneous parallel algorithms.
Parallel Comput., 2004

Data Partitioning with a Realistic Performance Model of Networks of Heterogeneous Computers with Task Size Limits.
Proceedings of the 3rd International Symposium on Parallel and Distributed Computing (ISPDC 2004), 2004

Data Partitioning with a Realistic Performance Model of Networks of Heterogeneous Computers.
Proceedings of the 18th International Parallel and Distributed Processing Symposium (IPDPS 2004), 2004

2003
Classification of Partitioning Problems for Networks of Heterogeneous Computers.
Proceedings of the Parallel Processing and Applied Mathematics, 2003

An Approach to Assessment of Heterogeneous Parallel Algorithms.
Proceedings of the Parallel Computing Technologies, 2003

HMPI: Towards a Message-Passing Library for Heterogeneous Networks of Computers.
Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS 2003), 2003

2000
A phased approach towards an open standards based, highly available, scalable architecture with asynchronous message processing.
Proceedings of the Networked Planet: Management Beyond 2000, 2000


  Loading...