Thorsten Kurth

According to our database1, Thorsten Kurth authored at least 33 papers between 2016 and 2020.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

On csauthors.net:

Bibliography

2020
IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads.
CoRR, 2020

Using Machine Learning to Augment Coarse-Grid Computational Fluid Dynamics Simulations.
CoRR, 2020

Hierarchical Roofline Performance Analysis for Deep Learning Applications.
CoRR, 2020

Hierarchical Roofline analysis for GPUs: Accelerating performance optimization for the NERSC-9 Perlmutter system.
Concurr. Comput. Pract. Exp., 2020

Time-Based Roofline for Deep Learning Performance Analysis.
Proceedings of the Fourth IEEE/ACM Workshop on Deep Learning on Supercomputers, 2020

2019
Highly-scalable, physics-informed GANs for learning solutions of stochastic PDEs.
CoRR, 2019

TensorFlow at Scale: Performance and productivity analysis of distributed training with Horovod, MLSL, and Cray PE ML.
Concurr. Comput. Pract. Exp., 2019

Eigensolver performance comparison on Cray XC systems.
Concurr. Comput. Pract. Exp., 2019

Acceleration of Scientific Deep Learning Models on Heterogeneous Computing Platform with Intel<sup>®</sup> FPGAs.
Proceedings of the High Performance Computing, 2019

Highly-Ccalable, Physics-Informed GANs for Learning Solutions of Stochastic PDEs.
Proceedings of the Third IEEE/ACM Workshop on Deep Learning on Supercomputers, 2019

Performance Portability of a Wilson Dslash Stencil Operator Mini-App Using Kokkos and SYCL.
Proceedings of the 2019 IEEE/ACM International Workshop on Performance, 2019

PCS: A Productive Computational Science Platform.
Proceedings of the 17th International Conference on High Performance Computing & Simulation, 2019

2018
A per-cent-level determination of the nucleon axial coupling from quantum chromodynamics.
Nat., 2018

Simulating the weak death of the neutron in a femtoscale universe with near-Exascale computing.
CoRR, 2018

Preparing NERSC users for Cori, a Cray XC40 system with Intel many integrated cores.
Concurr. Comput. Pract. Exp., 2018

Lessons Learned from Optimizing Kernels for Adaptive Aggregation Multi-grid Solvers in Lattice QCD.
Proceedings of the High Performance Computing, 2018

Sparse CSB_Coo Matrix-Vector and Matrix-Matrix Performance on Intel Xeon Architectures.
Proceedings of the High Performance Computing, 2018

Exascale deep learning for climate analytics.
Proceedings of the International Conference for High Performance Computing, 2018

A Case Study for Performance Portability Using OpenMP 4.5.
Proceedings of the Accelerator Programming Using Directives - 5th International Workshop, 2018

Simulating the <i>weak</i> death of the Neutron in a femtoscale universe with near-exascale computing.
Proceedings of the International Conference for High Performance Computing, 2018

A Metric for Evaluating Supercomputer Performance in the Era of Extreme Heterogeneity.
Proceedings of the 2018 IEEE/ACM Performance Modeling, 2018

2017
Improved treatment of exact exchange in Quantum ESPRESSO.
Comput. Phys. Commun., 2017

Scaling GRPC Tensorflow on 512 nodes of Cori Supercomputer.
CoRR, 2017

Deep Neural Networks for Physics Analysis on low-level whole-detector data at the LHC.
CoRR, 2017

Analyzing Performance of Selected NESAP Applications on the Cori HPC System.
Proceedings of the High Performance Computing, 2017

Performance Variability on Xeon Phi.
Proceedings of the High Performance Computing, 2017

Deep learning at 15PF: supervised and semi-supervised classification for scientific data.
Proceedings of the International Conference for High Performance Computing, 2017

2016
Optimization of the Sparse Matrix-Vector Products of an IDR Krylov Iterative Solver in EMGeo for the Intel KNL Manycore Processor.
Proceedings of the High Performance Computing, 2016

Optimizing Wilson-Dirac Operator and Linear Solvers for Intel® KNL.
Proceedings of the High Performance Computing, 2016

Applying the Roofline Performance Model to the Intel Xeon Phi Knights Landing Processor.
Proceedings of the High Performance Computing, 2016


MPI usage at NERSC: Present and Future.
Proceedings of the 23rd European MPI Users' Group Meeting, EuroMPI 2016, 2016

OpenMP Parallelization and Optimization of Graph-Based Machine Learning Algorithms.
Proceedings of the OpenMP: Memory, Devices, and Tasks, 2016


  Loading...