Gihan R. Mudalige

Orcid: 0000-0002-1398-5174

According to our database1, Gihan R. Mudalige authored at least 49 papers between 2006 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Predictive Analysis of Code Optimisations on Large-Scale Coupled CFD-Combustion Simulations using the CPX Mini-App.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023

Communication-Avoiding Optimizations for Large-Scale Unstructured-Mesh Applications with OP2.
Proceedings of the 52nd International Conference on Parallel Processing, 2023

2022
Scalable Many-Core Algorithms for Tridiagonal Solvers.
Comput. Sci. Eng., 2022

High Throughput Multidimensional Tridiagonal Systems Solvers on FPGAs.
CoRR, 2022

FPGA Acceleration of Structured-Mesh-Based Explicit and Implicit Numerical Solvers using SYCL.
Proceedings of the IWOCL'22: International Workshop on OpenCL, Bristol, United Kingdom, May 10, 2022

High throughput multidimensional tridiagonal system solvers on FPGAs.
Proceedings of the ICS '22: 2022 International Conference on Supercomputing, Virtual Event, June 28, 2022

Towards Virtual Certification of Gas Turbine Engines With Performance-Portable Simulations.
Proceedings of the IEEE International Conference on Cluster Computing, 2022

2021
Under the Hood of SYCL - An Initial Performance Analysis with An Unstructured-Mesh CFD Application.
Proceedings of the High Performance Computing - 36th International Conference, 2021

High-Level FPGA Accelerator Design for Structured-Mesh-Based Explicit Numerical Solvers.
Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium, 2021

Predictive Analysis of Large-Scale Coupled CFD Simulations with the CPX Mini-App.
Proceedings of the 28th IEEE International Conference on High Performance Computing, 2021

2020
Warwick Data Store: A Data Structure Abstraction Library.
Proceedings of the 2020 IEEE/ACM Performance Modeling, 2020

Modernising an Industrial CFD Application.
Proceedings of the Eighth International Symposium on Computing and Networking Workshops, 2020

Bitwise Reproducible task execution on unstructured mesh applications.
Proceedings of the 20th IEEE/ACM International Symposium on Cluster, 2020

2019
Locality optimized unstructured mesh algorithms on GPUs.
J. Parallel Distributed Comput., 2019

Improving resilience of scientific software through a domain-specific approach.
J. Parallel Distributed Comput., 2019

Large-scale performance of a DSL-based multi-block structured-mesh application for Direct Numerical Simulation.
J. Parallel Distributed Comput., 2019

Batch Solution of Small PDEs with the OPS DSL.
Proceedings of the High Performance Computing, 2019

2018
Loop Tiling in Large-Scale Stencil Codes at Run-Time with OPS.
IEEE Trans. Parallel Distributed Syst., 2018

Improving Locality of Unstructured Mesh Algorithms on GPUs.
CoRR, 2018

2017
Beyond 16GB: Out-of-Core Stencil Computations.
Proceedings of the Workshop on Memory Centric Programming for HPC, 2017

Comparison of Parallelisation Approaches, Languages, and Compilers for Unstructured Mesh Algorithms on GPUs.
Proceedings of the High Performance Computing Systems. Performance Modeling, Benchmarking, and Simulation, 2017

Achieving Performance Portability for a Heat Conduction Solver Mini-Application on Modern Multi-core Systems.
Proceedings of the 2017 IEEE International Conference on Cluster Computing, 2017

2016
Acceleration of a Full-Scale Industrial CFD Application with OP2.
IEEE Trans. Parallel Distributed Syst., 2016

Vectorizing unstructured mesh computations for many-core architectures.
Concurr. Comput. Pract. Exp., 2016

Auto-vectorizing a large-scale production unstructured-mesh CFD application.
Proceedings of the 3rd Workshop on Programming Models for SIMD/Vector Processing, 2016

2015
Design and Development of Domain Specific Active Libraries with Proxy Applications.
Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015

2014
The OPS domain specific abstraction for multi-block structured grid computations.
Proceedings of the Fourth International Workshop on Domain-Specific Languages and High-Level Frameworks for High Performance Computing, 2014

Performance Analysis of a High-Level Abstractions-Based Hydrocode on Future Computing Systems.
Proceedings of the High Performance Computing Systems. Performance Modeling, Benchmarking, and Simulation, 2014

2013
Design and initial performance of a high-level unstructured mesh framework on heterogeneous parallel systems.
Parallel Comput., 2013

Designing OP2 for GPU architectures.
J. Parallel Distributed Comput., 2013

Loop Chaining: A Programming Abstraction for Balancing Locality and Parallelism.
Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013

2012
Predictive modeling and analysis of OP2 on distributed memory GPU clusters.
SIGMETRICS Perform. Evaluation Rev., 2012

On the Acceleration of Wavefront Applications using Distributed Many-Core Architectures.
Comput. J., 2012

Performance Analysis and Optimization of the OP2 Framework on Many-Core Architectures.
Comput. J., 2012

An Analytical Study of Loop Tiling for a Large-Scale Unstructured Mesh Application.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Compiler Optimizations for Industrial Unstructured Mesh CFD Applications on GPUs.
Proceedings of the Languages and Compilers for Parallel Computing, 2012

Mesh independent loop fusion for unstructured mesh applications.
Proceedings of the Computing Frontiers Conference, CF'12, 2012

2011
Performance analysis of a hybrid MPI/CUDA implementation of the NASLU benchmark.
SIGMETRICS Perform. Evaluation Rev., 2011

Performance analysis of the OP2 framework on many-core architectures.
SIGMETRICS Perform. Evaluation Rev., 2011

Predictive analysis of a hydrodynamics application on large-scale CMP clusters.
Comput. Sci. Res. Dev., 2011

Design and Performance of the OP2 Library for Unstructured Mesh Applications.
Proceedings of the Euro-Par 2011: Parallel Processing Workshops - CCPI, CGWS, HeteroPar, HiBB, HPCVirt, HPPC, HPSS, MDGS, ProPer, Resilience, UCHPC, VHPC, Bordeaux, France, August 29, 2011

2010
To upgrade or not to upgrade? Catamount vs. Cray Linux Environment.
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

2009
Predictive analysis and optimisation of pipelined wavefront applications using reusable analytic models.
PhD thesis, 2009

Performance prediction and procurement in practice: assessing the suitability of commodity cluster components for wavefront codes.
IET Softw., 2009

WARPP: a toolkit for simulating high-performance parallel scientific codes.
Proceedings of the 2nd International Conference on Simulation Tools and Techniques for Communications, 2009

Predictive analysis and optimisation of pipelined wavefront computations.
Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009

Predictive Simulation of HPC Applications.
Proceedings of the IEEE 23rd International Conference on Advanced Information Networking and Applications, 2009

2008
A plug-and-play model for evaluating wavefront computations on parallel architectures.
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

2006
Predictive Performance Analysis of a Parallel Pipelined Synchronous Wavefront Application for Commodity Processor Cluster Systems.
Proceedings of the 2006 IEEE International Conference on Cluster Computing, 2006


  Loading...