I. Z. Reguly

Orcid: 0000-0002-4385-4204

According to our database1, I. Z. Reguly authored at least 48 papers between 2012 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Comparative evaluation of bandwidth-bound applications on the Intel Xeon CPU MAX Series.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

Evaluating the performance portability of SYCL across CPUs and GPUs on bandwidth-bound applications.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

Quantifying and comparing the impact of combinations of non-pharmaceutical interventions on the spread of COVID-19.
Proceedings of the 31st Mediterranean Conference on Control and Automatio, 2023

Communication-Avoiding Optimizations for Large-Scale Unstructured-Mesh Applications with OP2.
Proceedings of the 52nd International Conference on Parallel Processing, 2023

2022
Microsimulation based quantitative analysis of COVID-19 management strategies.
PLoS Comput. Biol., 2022

Scalable Many-Core Algorithms for Tridiagonal Solvers.
Comput. Sci. Eng., 2022

High Throughput Multidimensional Tridiagonal Systems Solvers on FPGAs.
CoRR, 2022

FPGA Acceleration of Structured-Mesh-Based Explicit and Implicit Numerical Solvers using SYCL.
Proceedings of the IWOCL'22: International Workshop on OpenCL, Bristol, United Kingdom, May 10, 2022

High throughput multidimensional tridiagonal system solvers on FPGAs.
Proceedings of the ICS '22: 2022 International Conference on Supercomputing, Virtual Event, June 28, 2022

Towards Virtual Certification of Gas Turbine Engines With Performance-Portable Simulations.
Proceedings of the IEEE International Conference on Cluster Computing, 2022

2021
Under the Hood of SYCL - An Initial Performance Analysis with An Unstructured-Mesh CFD Application.
Proceedings of the High Performance Computing - 36th International Conference, 2021

High-Level FPGA Accelerator Design for Structured-Mesh-Based Explicit Numerical Solvers.
Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium, 2021

Predictive Analysis of Large-Scale Coupled CFD Simulations with the CPX Mini-App.
Proceedings of the 28th IEEE International Conference on High Performance Computing, 2021

Automatic Parallelisation of Sturctured Mesh Computations with SYCL.
Proceedings of the IEEE International Conference on Cluster Computing, 2021

2020
Performance Portability of the MG-CFD Mini-App with SYCL.
Proceedings of the IWOCL '20: International Workshop on OpenCL, 2020

Modernising an Industrial CFD Application.
Proceedings of the Eighth International Symposium on Computing and Networking Workshops, 2020

Bitwise Reproducible task execution on unstructured mesh applications.
Proceedings of the 20th IEEE/ACM International Symposium on Cluster, 2020

Automatic parallel implementations of adjoint codes for structured mesh applications.
Proceedings of the 20th IEEE/ACM International Symposium on Cluster, 2020

2019
Using GPUs to accelerate computational diffusion MRI: From microstructure estimation to tractography and connectomes.
NeuroImage, 2019

Locality optimized unstructured mesh algorithms on GPUs.
J. Parallel Distributed Comput., 2019

Improving resilience of scientific software through a domain-specific approach.
J. Parallel Distributed Comput., 2019

Large-scale performance of a DSL-based multi-block structured-mesh application for Direct Numerical Simulation.
J. Parallel Distributed Comput., 2019

Batch Solution of Small PDEs with the OPS DSL.
Proceedings of the High Performance Computing, 2019

Performance Portability of Multi-Material Kernels.
Proceedings of the 2019 IEEE/ACM International Workshop on Performance, 2019

PPCU Sam: Open-source face recognition framework.
Proceedings of the Knowledge-Based and Intelligent Information & Engineering Systems: Proceedings of the 23rd International Conference KES-2019, 2019

GPU Support for Automatic Generation of Finite-Differences Stencil Kernels.
Proceedings of the High Performance Computing - 6th Latin American Conference, 2019

2018
Loop Tiling in Large-Scale Stencil Codes at Run-Time with OPS.
IEEE Trans. Parallel Distributed Syst., 2018

Improving Locality of Unstructured Mesh Algorithms on GPUs.
CoRR, 2018

2017
Beyond 16GB: Out-of-Core Stencil Computations.
Proceedings of the Workshop on Memory Centric Programming for HPC, 2017

Comparison of Parallelisation Approaches, Languages, and Compilers for Unstructured Mesh Algorithms on GPUs.
Proceedings of the High Performance Computing Systems. Performance Modeling, Benchmarking, and Simulation, 2017

Achieving Performance Portability for a Heat Conduction Solver Mini-Application on Modern Multi-core Systems.
Proceedings of the 2017 IEEE International Conference on Cluster Computing, 2017

2016
Acceleration of a Full-Scale Industrial CFD Application with OP2.
IEEE Trans. Parallel Distributed Syst., 2016

Vectorizing unstructured mesh computations for many-core architectures.
Concurr. Comput. Pract. Exp., 2016

High Performance Computing on the IBM Power8 Platform.
Proceedings of the High Performance Computing, 2016

Auto-vectorizing a large-scale production unstructured-mesh CFD application.
Proceedings of the 3rd Workshop on Programming Models for SIMD/Vector Processing, 2016

2015
AmgX: A Library for GPU Accelerated Algebraic Multigrid and Preconditioned Iterative Methods.
SIAM J. Sci. Comput., 2015

A comparison between parallelization approaches in molecular dynamics simulations on GPUs.
J. Comput. Chem., 2015

Finite Element Algorithms and Data Structures on Graphical Processing Units.
Int. J. Parallel Program., 2015

Analysis of parallel processor architectures for the solution of the Black-Scholes PDE.
Proceedings of the 2015 IEEE International Symposium on Circuits and Systems, 2015

Design and Development of Domain Specific Active Libraries with Proxy Applications.
Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015

Benchmarking the IBM Power8 processor.
Proceedings of 25th Annual International Conference on Computer Science and Software Engineering, 2015

2014
Abstraction and implementation of unstructured grid algorithms on massively parallel heterogeneous architectures
PhD thesis, 2014

The OPS domain specific abstraction for multi-block structured grid computations.
Proceedings of the Fourth International Workshop on Domain-Specific Languages and High-Level Frameworks for High Performance Computing, 2014

Performance Analysis of a High-Level Abstractions-Based Hydrocode on Future Computing Systems.
Proceedings of the High Performance Computing Systems. Performance Modeling, Benchmarking, and Simulation, 2014

GPU implementation of finite difference solvers.
Proceedings of the 7th Workshop on High Performance Computational Finance, 2014

2013
Design and initial performance of a high-level unstructured mesh framework on heterogeneous parallel systems.
Parallel Comput., 2013

Designing OP2 for GPU architectures.
J. Parallel Distributed Comput., 2013

2012
An Analytical Study of Loop Tiling for a Large-Scale Unstructured Mesh Application.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012


  Loading...