Hartwig Anzt

According to our database1, Hartwig Anzt authored at least 49 papers between 2010 and 2018.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

On csauthors.net:

Bibliography

2018
Incomplete Sparse Approximate Inverses for Parallel Preconditioning.
Parallel Computing, 2018

Using Jacobi iterations and blocking for solving sparse triangular systems in incomplete factorization preconditioning.
J. Parallel Distrib. Comput., 2018

Optimization and performance evaluation of the IDR iterative Krylov solver on GPUs.
IJHPCA, 2018

2017
Preconditioned Krylov solvers on GPUs.
Parallel Computing, 2017

On the performance and energy efficiency of sparse linear algebra on GPUs.
IJHPCA, 2017

With Extreme Computing, the Rules Have Changed.
Computing in Science and Engineering, 2017

Overcoming Load Imbalance for Irregular Sparse Matrices.
Proceedings of the Seventh Workshop on Irregular Applications: Architectures and Algorithms, 2017

Flexible batched sparse matrix-vector product on GPUs.
Proceedings of the 8th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, 2017

Batched Gauss-Jordan Elimination for Block-Jacobi Preconditioner Generation on GPUs.
Proceedings of the 8th International Workshop on Programming Models and Applications for Multicores and Manycores, 2017

Variable-Size Batched LU for Small Matrices and Its Integration into Block-Jacobi Preconditioning.
Proceedings of the 46th International Conference on Parallel Processing, 2017

Variable-Size Batched Gauss-Huard for Block-Jacobi Preconditioning.
Proceedings of the International Conference on Computational Science, 2017

Bringing High Performance Computing to Big Data Algorithms.
Proceedings of the Handbook of Big Data Technologies, 2017

2016
Domain Overlap for Iterative Sparse Triangular Solves on GPUs.
Proceedings of the Software for Exascale Computing - SPPEXA 2013-2015, 2016

Implementation and Tuning of Batched Cholesky Factorization and Solve for NVIDIA GPUs.
IEEE Trans. Parallel Distrib. Syst., 2016

Updating incomplete factorization preconditioners for model order reduction.
Numerical Algorithms, 2016

Accelerating the Conjugate Gradient Algorithm with GPUs in CFD Simulations.
Proceedings of the High Performance Computing for Computational Science - VECPAR 2016, 2016

Batched Generation of Incomplete Sparse Approximate Inverses on GPUs.
Proceedings of the 7th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, 2016

Heterogeneous Streaming.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016

Efficiency of General Krylov Methods on GPUs - An Experimental Study.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016

2015
Acceleration of GPU-based Krylov solvers via data transfer reduction.
IJHPCA, 2015

Experiences in autotuning matrix multiplication for energy minimization on GPUs.
Concurrency and Computation: Practice and Experience, 2015

Unveiling the performance-energy trade-off in iterative linear system solvers for multithreaded processors.
Concurrency and Computation: Practice and Experience, 2015

Asynchronous Iterative Algorithm for Computing Incomplete Factorizations on GPUs.
Proceedings of the High Performance Computing - 30th International Conference, 2015

Accelerating the LOBPCG method on GPUs using a blocked sparse matrix vector product.
Proceedings of the Symposium on High Performance Computing, 2015

GPU-accelerated co-design of induced dimension reduction: algorithmic fusion and kernel overlap.
Proceedings of the 2nd International Workshop on Hardware-Software Co-Design for High Performance Computing, 2015

Tuning stationary iterative solvers for fault resilience.
Proceedings of the 6th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, 2015

Adaptive precision solvers for sparse linear systems.
Proceedings of the 3rd International Workshop on Energy Efficient Supercomputing, 2015

Energy efficiency and performance frontiers for sparse computations on GPU supercomputers.
Proceedings of the Sixth International Workshop on Programming Models and Applications for Multicores and Manycores, 2015

Iterative Sparse Triangular Solves for Preconditioning.
Proceedings of the Euro-Par 2015: Parallel Processing, 2015

Accelerating collaborative filtering using concepts from high performance computing.
Proceedings of the 2015 IEEE International Conference on Big Data, 2015

2014
A unified energy footprint for simulation software.
Computer Science - R&D, 2014

Self-adaptive Multiprecision Preconditioners on Multicore and Manycore Architectures.
Proceedings of the High Performance Computing for Computational Science - VECPAR 2014 - 11th International Conference, Eugene, OR, USA, June 30, 2014

Improving the Performance of CA-GMRES on Multicores with Multiple GPUs.
Proceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium, 2014

Hybrid Multi-elimination ILU Preconditioners on GPUs.
Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014

Optimizing Krylov Subspace Solvers on Graphics Processing Units.
Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014

2013
A block-asynchronous relaxation method for graphics processing units.
J. Parallel Distrib. Comput., 2013

Performance and Energy Analysis of the Iterative Solution of Sparse Linear Systems on Multicore and Manycore Architectures.
Proceedings of the Parallel Processing and Applied Mathematics, 2013

Reformulated Conjugate Gradient for the Energy-Aware Solution of Linear Systems on GPUs.
Proceedings of the 42nd International Conference on Parallel Processing, 2013

2012
Block-asynchronous Multigrid Smoothers for GPU-accelerated Systems.
Proceedings of the International Conference on Computational Science, 2012

Optimization of power consumption in the iterative solution of sparse linear systems on graphics processors.
Computer Science - R&D, 2012

A Block-Asynchronous Relaxation Method for Graphics Processing Units.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum, 2012

Weighted Block-Asynchronous Iteration on GPU-Accelerated Systems.
Proceedings of the Euro-Par 2012: Parallel Processing Workshops, 2012

GPU-Accelerated Asynchronous Error Correction for Mixed Precision Iterative Refinement.
Proceedings of the Euro-Par 2012 Parallel Processing - 18th International Conference, 2012

2011

Power Consumption of Mixed Precision in the Iterative Solution of Sparse Linear Systems.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

Analysis and optimization of power consumption in the iterative solution of sparse linear systems on multi-core and many-core platforms.
Proceedings of the 2011 International Green Computing Conference and Workshops, 2011

2010
Energy efficiency of mixed precision iterative refinement methods using hybrid hardware platforms - An evaluation of different solver and hardware configurations.
Computer Science - R&D, 2010

An Error Correction Solver for Linear Systems: Evaluation of Mixed Precision Implementations.
Proceedings of the High Performance Computing for Computational Science - VECPAR 2010, 2010

Mixed Precision Iterative Refinement Methods for Linear Systems: Convergence Analysis Based on Krylov Subspace Methods.
Proceedings of the Applied Parallel and Scientific Computing, 2010


  Loading...