Goran Flegar

Orcid: 0000-0002-4154-0420

According to our database1, Goran Flegar authored at least 25 papers between 2017 and 2022.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2022
Ginkgo: A Modern Linear Operator Algebra Framework for High Performance Computing.
ACM Trans. Math. Softw., 2022

2021
Adaptive Precision Block-Jacobi for High Performance Preconditioning in the Ginkgo Linear Algebra Software.
ACM Trans. Math. Softw., 2021

Crediting pull requests to open source research software as an academic contribution.
J. Comput. Sci., 2021

2020
Acceleration of PageRank with Customized Precision Based on Mantissa Segmentation.
ACM Trans. Parallel Comput., 2020

Load-balancing Sparse Matrix Vector Product Kernels on GPUs.
ACM Trans. Parallel Comput., 2020

Ginkgo: A high performance numerical linear algebra library.
J. Open Source Softw., 2020

A customized precision format based on mantissa segmentation for accelerating sparse linear algebra.
Concurr. Comput. Pract. Exp., 2020

Multiprecision Block-Jacobi for Iterative Triangular Solves.
Proceedings of the Euro-Par 2020: Parallel Processing, 2020

2019
Sparse Linear System Solvers on GPUs: Parallel Preconditioning, Workload Balancing, and Communication Reduction.
PhD thesis, 2019

FloatX: A C++ Library for Customized Floating-Point Arithmetic.
ACM Trans. Math. Softw., 2019

Variable-size batched Gauss-Jordan elimination for block-Jacobi preconditioning on graphics processors.
Parallel Comput., 2019

Toward a modular precision ecosystem for high-performance computing.
Int. J. High Perform. Comput. Appl., 2019

Adaptive precision in block-Jacobi preconditioning for iterative sparse linear system solvers.
Concurr. Comput. Pract. Exp., 2019

Towards Continuous Benchmarking: An Automated Performance Evaluation Framework for High Performance Software.
Proceedings of the Platform for Advanced Scientific Computing Conference, 2019

ParILUT - A Parallel Threshold ILU for GPUs.
Proceedings of the 2019 IEEE International Parallel and Distributed Processing Symposium, 2019

Are we Doing the Right Thing? - A Critical Analysis of the Academic HPC Community.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2019

2018
Residual Replacement in Mixed-Precision Iterative Refinement for Sparse Linear Systems.
Proceedings of the High Performance Computing, 2018

Variable-Size Batched Condition Number Calculation on GPUs.
Proceedings of the 30th International Symposium on Computer Architecture and High Performance Computing, 2018

2017
Communication in task-parallel ILU-preconditioned CG solvers using MPI + OmpSs.
Concurr. Comput. Pract. Exp., 2017

Overcoming Load Imbalance for Irregular Sparse Matrices.
Proceedings of the Seventh Workshop on Irregular Applications: Architectures and Algorithms, 2017

Flexible batched sparse matrix-vector product on GPUs.
Proceedings of the 8th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, 2017

Batched Gauss-Jordan Elimination for Block-Jacobi Preconditioner Generation on GPUs.
Proceedings of the 8th International Workshop on Programming Models and Applications for Multicores and Manycores, 2017

Variable-Size Batched LU for Small Matrices and Its Integration into Block-Jacobi Preconditioning.
Proceedings of the 46th International Conference on Parallel Processing, 2017

Variable-Size Batched Gauss-Huard for Block-Jacobi Preconditioning.
Proceedings of the International Conference on Computational Science, 2017

Balanced CSR Sparse Matrix-Vector Product on Graphics Processors.
Proceedings of the Euro-Par 2017: Parallel Processing - 23rd International Conference on Parallel and Distributed Computing, Santiago de Compostela, Spain, August 28, 2017


  Loading...