Terry Cojean

Orcid: 0000-0002-1560-921X

Affiliations:
  • Karlsruhe Institute of Technology, Germany


According to our database1, Terry Cojean authored at least 25 papers between 2016 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Providing performance portable numerics for Intel GPUs.
Concurr. Comput. Pract. Exp., 2023

2022
Ginkgo: A Modern Linear Operator Algebra Framework for High Performance Computing.
ACM Trans. Math. Softw., 2022

Ginkgo - A math library designed for platform portability.
Parallel Comput., 2022

2021
Adaptive Precision Block-Jacobi for High Performance Preconditioning in the Ginkgo Linear Algebra Software.
ACM Trans. Math. Softw., 2021

Evaluating asynchronous Schwarz solvers on GPUs.
Int. J. High Perform. Comput. Appl., 2021

A survey of numerical linear algebra methods utilizing mixed-precision arithmetic.
Int. J. High Perform. Comput. Appl., 2021

Porting a sparse linear algebra math library to Intel GPUs.
CoRR, 2021

Porting Sparse Linear Algebra to Intel GPUs.
Proceedings of the Euro-Par 2021: Parallel Processing Workshops, 2021

2020
Acceleration of PageRank with Customized Precision Based on Mantissa Segmentation.
ACM Trans. Parallel Comput., 2020

Load-balancing Sparse Matrix Vector Product Kernels on GPUs.
ACM Trans. Parallel Comput., 2020

Ginkgo: A high performance numerical linear algebra library.
J. Open Source Softw., 2020

Evaluating the Performance of NVIDIA's A100 Ampere GPU for Sparse Linear Algebra Computations.
CoRR, 2020

A Survey of Numerical Methods Utilizing Mixed Precision Arithmetic.
CoRR, 2020

Evaluating Abstract Asynchronous Schwarz solvers.
CoRR, 2020

A customized precision format based on mantissa segmentation for accelerating sparse linear algebra.
Concurr. Comput. Pract. Exp., 2020

Sparse Linear Algebra on AMD and NVIDIA GPUs - The Race Is On.
Proceedings of the High Performance Computing - 35th International Conference, 2020

Two-stage Asynchronous Iterative Solvers for multi-GPU Clusters.
Proceedings of the 11th IEEE/ACM Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, 2020

Evaluating the Performance of NVIDIA's A100 Ampere GPU for Sparse and Batched Computations.
Proceedings of the 2020 IEEE/ACM Performance Modeling, 2020

Preparing Ginkgo for AMD GPUs - A Testimonial on Porting CUDA Code to HIP.
Proceedings of the Euro-Par 2020: Parallel Processing Workshops, 2020

Multiprecision Block-Jacobi for Iterative Triangular Solves.
Proceedings of the Euro-Par 2020: Parallel Processing, 2020

2019
Resource aggregation for task-based Cholesky Factorization on top of modern architectures.
Parallel Comput., 2019

Towards Continuous Benchmarking: An Automated Performance Evaluation Framework for High Performance Software.
Proceedings of the Platform for Advanced Scientific Computing Conference, 2019

2018
Programmation des architectures hétérogènes à l'aide de tâches divisibles ou modulables. (Programmation of heterogeneous architectures using moldable tasks).
PhD thesis, 2018

2016
Scheduling of Linear Algebra Kernels on Multiple Heterogeneous Resources.
Proceedings of the 23rd IEEE International Conference on High Performance Computing, 2016

Resource Aggregation for Task-Based Cholesky Factorization on Top of Heterogeneous Machines.
Proceedings of the Euro-Par 2016: Parallel Processing Workshops, 2016


  Loading...