Harald Köstler

Orcid: 0000-0002-6992-2690

Affiliations:
  • University of Erlangen-Nuremberg, Germany


According to our database1, Harald Köstler authored at least 97 papers between 2004 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Towards Code Generation for Octree-Based Multigrid Solvers.
CoRR, 2024

A Continuous Benchmarking Infrastructure for High-Performance Computing Applications.
CoRR, 2024

waLBerla-wind: a lattice-Boltzmann-based high-performance flow solver for wind energy applications.
CoRR, 2024

2023
MD-Bench: A performance-focused prototyping harness for state-of-the-art short-range molecular dynamics algorithms.
Future Gener. Comput. Syst., December, 2023

Detectron2 for Lesion Detection in Diabetic Retinopathy.
Algorithms, March, 2023

p-adaptive discontinuous Galerkin method for the shallow water equations on heterogeneous computing architectures.
CoRR, 2023

Efficient and scalable hybrid fluid-particle simulations with geometrically resolved particles on heterogeneous CPU-GPU architectures.
CoRR, 2023

MD-Bench: Engineering the in-core performance of short-range molecular dynamics kernels from state-of-the-art simulation packages.
CoRR, 2023

Shallow Water DG Simulations on FPGAs: Design and Comparison of a Novel Code Generation Pipeline.
Proceedings of the High Performance Computing - 38th International Conference, 2023

AI Driven Near Real-time Locational Marginal Pricing Method: A Feasibility and Robustness Study.
Proceedings of the IEEE PES Innovative Smart Grid Technologies Europe, 2023

Generating Coupling Interfaces for Multiphysics Simulations with ExaStencils and waLBerla.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023

Evolving Nonlinear Multigrid Methods With Grammar-Guided Genetic Programming.
Proceedings of the Companion Proceedings of the Conference on Genetic and Evolutionary Computation, 2023

2022
PiSCAT: A Python Package for Interferometric Scattering Microscopy.
J. Open Source Softw., 2022

Genetic programming for iterative numerical methods.
Genet. Program. Evolvable Mach., 2022

MD-Bench: A Generic Proxy-App Toolbox for State-of-the-Art Molecular Dynamics Algorithms.
Proceedings of the Parallel Processing and Applied Mathematics, 2022

Evolving generalizable multigrid-based helmholtz preconditioners with grammar-guided genetic programming.
Proceedings of the GECCO '22: Genetic and Evolutionary Computation Conference, Boston, Massachusetts, USA, July 9, 2022

Closing the Performance Gap Between Lisp and C.
Proceedings of the 15th European Lisp Symposium, 2022

2021
On revisiting energy and performance in microservices applications: A cloud elasticity-driven approach.
Parallel Comput., 2021

tinyMD: Mapping molecular dynamics simulations to heterogeneous hardware using partial evaluation.
J. Comput. Sci., 2021

lbmpy: Automatic code generation for efficient parallel lattice Boltzmann methods.
J. Comput. Sci., 2021

Highly efficient lattice Boltzmann multiphase simulations of immiscible fluids at high-density ratios on CPUs and GPUs through code generation.
Int. J. High Perform. Comput. Appl., 2021

EvoStencils: a grammar-based genetic programming approach for constructing efficient geometric multigrid methods.
Genet. Program. Evolvable Mach., 2021

Deep Learning for Real-Time Aerodynamic Evaluations of Arbitrary Vehicle Shapes.
CoRR, 2021

Known Operator Learning and Hybrid Machine Learning in Medical Imaging - A Review of the Past, the Present, and the Future.
CoRR, 2021

waLBerla: A block-structured high-performance framework for multiphysics simulations.
Comput. Math. Appl., 2021

2020

tinyMD: A Portable and Scalable Implementation for Pairwise Interactions Simulations.
CoRR, 2020

Quantum simulation and circuit design for solving multidimensional Poisson equations.
CoRR, 2020

lbmpy: A flexible code generation toolkit for highly efficient lattice Boltzmann simulations.
CoRR, 2020

Constructing efficient multigrid solvers with genetic programming.
Proceedings of the GECCO '20: Genetic and Evolutionary Computation Conference, 2020

2019
A scalable and extensible checkpointing scheme for massively parallel simulations.
Int. J. High Perform. Comput. Appl., 2019

Optimizing Geometric Multigrid Methods with Evolutionary Computation.
CoRR, 2019

Towards whole program generation of quadrature-free discontinuous Galerkin methods for the shallow water equations.
CoRR, 2019

Code generation for massively parallel phase-field simulations.
Proceedings of the International Conference for High Performance Computing, 2019

Unified Generation of DG-Kernels for Different HPC Frameworks.
Proceedings of the Parallel Computing: Technology Trends, 2019

2018
Reconfigurable Hardware Generation of Multigrid Solvers with Conjugate Gradient Coarse-Grid Solution.
Parallel Process. Lett., 2018

Automatic Data Layout Transformations in the ExaStencils Code Generator.
Parallel Process. Lett., 2018

Petalisp: run time code generation for operations on strided arrays.
Proceedings of the 5th ACM SIGPLAN International Workshop on Libraries, 2018

Knowledge Amalgamation for Computational Science and Engineering.
Proceedings of the Intelligent Computer Mathematics - 11th International Conference, 2018

Unified Code Generation for the Parallel Computation of Pairwise Interactions Using Partial Evaluation.
Proceedings of the 17th International Symposium on Parallel and Distributed Computing, 2018

Whole Program Generation of Massively Parallel Shallow Water Equation Solvers.
Proceedings of the IEEE International Conference on Cluster Computing, 2018

2017
A matrix-free approach to efficient affine-linear image registration on CPU and GPU.
J. Real Time Image Process., 2017

A Scala prototype to generate multigrid solver implementations for different problems and target multi-core platforms.
Int. J. Comput. Sci. Eng., 2017

Lattice Boltzmann Benchmark Kernels as a Testbed for Performance Analysis.
CoRR, 2017

Towards generating efficient flow solvers with the ExaStencils approach.
Concurr. Comput. Pract. Exp., 2017

Genetic programming meets linear algebra: how genetic programming can be used to find improved iterative numerical methods.
Proceedings of the Genetic and Evolutionary Computation Conference, 2017

2016
Systems of Partial Differential Equations in ExaSlang.
Proceedings of the Software for Exascale Computing - SPPEXA 2013-2015, 2016

Performance Prediction of Multigrid-Solver Configurations.
Proceedings of the Software for Exascale Computing - SPPEXA 2013-2015, 2016

A Python extension for the massively parallel multiphysics simulation framework waLBerla.
Int. J. Parallel Emergent Distributed Syst., 2016

Performance engineering to achieve real-time high dynamic range imaging.
J. Real Time Image Process., 2016

When do microswimmers exit the Stokes regime?
CoRR, 2016

Automatic Generation of Massively Parallel Codes from ExaSlang.
Comput., 2016

A multi-objective genetic algorithm for simulating optimal fights in StarCraft II.
Proceedings of the IEEE Conference on Computational Intelligence and Games, 2016

2015
A Gauss-Seidel Iteration Scheme for Reference-Free 3-D Histological Image Reconstruction.
IEEE Trans. Medical Imaging, 2015

Comparison Study for Whitney (Raviart-Thomas)-Type Source Models in Finite-Element-Method-Based EEG Forward Modeling.
IEEE Trans. Biomed. Eng., 2015

Performance modeling and analysis of heterogeneous lattice Boltzmann simulations on CPU-GPU clusters.
Parallel Comput., 2015

Tsunami and Storm Surge Simulation Using Low Power Architectures - Concept and Evaluation.
Proceedings of the SIMULTECH 2015 - Proceedings of the 5th International Conference on Simulation and Modeling Methodologies, Technologies and Applications, Colmar, Alsace, France, 21, 2015

Massively parallel phase-field simulations for ternary eutectic directional solidification.
Proceedings of the International Conference for High Performance Computing, 2015

Potential-Field-Based Unit Behavior Optimization for Balancing in StarCraft II.
Proceedings of the Genetic and Evolutionary Computation Conference, 2015

2014
Guest Editors' Note: Special Issue On High-Performance Stencil Computations.
Parallel Process. Lett., 2014

Experiments on Optimizing the Performance of Stencil Codes with SPL Conqueror.
Parallel Process. Lett., 2014

Towards a performance-portable description of geometric multigrid algorithms using a domain-specific language.
J. Parallel Distributed Comput., 2014

Real-time simulation of temperature in hot rolling rolls.
J. Comput. Sci., 2014

A Scala Prototype to Generate Multigrid Solver Implementations for Different Problems and Target Multi-Core Platforms.
CoRR, 2014

Parallel multigrid on hierarchical hybrid grids: a performance study on current high performance computing clusters.
Concurr. Comput. Pract. Exp., 2014

ExaSlang: a domain-specific language for highly scalable multigrid solvers.
Proceedings of the Fourth International Workshop on Domain-Specific Languages and High-Level Frameworks for High Performance Computing, 2014

An Evaluation of Domain-Specific Language Technologies for Code Generation.
Proceedings of the 2014 14th International Conference on Computational Science and Its Applications, Guimaraes, Portugal, June 30, 2014

ExaStencils: Advanced Stencil-Code Engineering.
Proceedings of the Euro-Par 2014: Parallel Processing Workshops, 2014

2013
Interactive particle dynamics using OpenCL and Kinect.
Int. J. Parallel Emergent Distributed Syst., 2013

A Multi-objective Genetic Algorithm for Build Order Optimization in StarCraft II.
Künstliche Intell., 2013

The CSE Software Challenge - Covering the Complete Stack.
it Inf. Technol., 2013

A framework for hybrid parallel flow simulations with a trillion cells in complex geometries.
Proceedings of the International Conference for High Performance Computing, 2013

Parallel Simulations of Self-propelled Microorganisms.
Proceedings of the Parallel Computing: Accelerating Computational Science and Engineering (CSE), 2013

A Generic Prototype to Benchmark Algorithms and Data Structures for Hierarchical Hybrid Grids.
Proceedings of the Parallel Computing: Accelerating Computational Science and Engineering (CSE), 2013

2012
Optimizing Opening Strategies in a Real-time Strategy Game by a Multi-objective Genetic Algorithm.
Proceedings of the Research and Development in Intelligent Systems XXIX, 2012

Towards Domain-Specific Computing for Stencil Codes in HPC.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Estimating Blood Flow Based on 2D Angiographic Image Sequences.
Proceedings of the Bildverarbeitung für die Medizin 2012 - Algorithmen - Systeme, 2012

2011
A flexible Patch-based lattice Boltzmann parallelization approach for heterogeneous GPU-CPU clusters.
Parallel Comput., 2011

WaLBerla: HPC software design for computational engineering simulations.
J. Comput. Sci., 2011

Performance engineering for the Lattice Boltzmann method on GPGPUs: Architectural requirements and performance results
CoRR, 2011

A Geometric Multigrid Solver on Tsubame 2.0.
Proceedings of the Efficient Algorithms for Global Optimization Methods in Computer Vision, 2011

2010
A practical framework for the construction of prolongation operators for multigrid based on canonical basis functions.
Comput. Vis. Sci., 2010

Fast Wavelet Transform Utilizing a Multicore-Aware Framework.
Proceedings of the Applied Parallel and Scientific Computing, 2010

Modeling Multigrid Algorithms for Variational Imaging.
Proceedings of the 21st Australian Software Engineering Conference (ASWEC 2010), 2010

2009
An Orthogonal Matching Pursuit Algorithm for Image Denoising on the Cell Broadband Engine.
Proceedings of the Parallel Processing and Applied Mathematics, 2009

2008
A multigrid framework for variational approaches in medical image processing and computer vision.
PhD thesis, 2008

A fast full multigrid solver for applications in image processing.
Numer. Linear Algebra Appl., 2008

Multigrid solution of the optical flow system using a combined diffusion- and curvature-based regularizer.
Numer. Linear Algebra Appl., 2008

Computer-aided evaluation of anatomical accuracy of image fusion between X-ray CT and SPECT.
Comput. Medical Imaging Graph., 2008

2007
Numerical Mathematics of the Subtraction Method for the Modeling of a Current Dipole in EEG Source Reconstruction Using Finite Element Head Models.
SIAM J. Sci. Comput., 2007

3D optical flow computation using a parallel variational multigrid scheme with application to cardiac C-arm CT motion.
Image Vis. Comput., 2007

Nonlinear Diffusion vs. Wavelet Based Noise Reduction in CT Using Correlation Analysis.
Proceedings of the 12th International Fall Workshop on Vision, Modeling, and Visualization, 2007

2006
An accurate multigrid solver for computing singular solutions of elliptic problems.
Numer. Linear Algebra Appl., 2006

Adaptive variational sinogram interpolation of sparsely sampled CT data.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Teaching the Foundations of Computational Science on the Undergraduate Level.
Proceedings of the Computational Science, 2006

2005
High Performance Computing Education for Students in Computational Engineering.
Proceedings of the Computational Science, 2005

2004
Extrapolation Techniques for Computing Accurate Solutions of Elliptic Problems with Singular Solutions.
Proceedings of the Computational Science, 2004


  Loading...