David Böhme

Orcid: 0000-0002-4159-1519

According to our database1, David Böhme authored at least 37 papers between 2008 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
MPI Implementation Profiling for Better Application Performance.
CoRR, 2024

Empirical Study of Molecular Dynamics Workflow Data Movement: DYAD vs. Traditional I/O Systems.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024

Non-Blocking GPU-CPU Notifications to Enable More GPU-CPU Parallelism.
Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region, 2024

A Mechanism to Generate Interception Based Tools for HPC Libraries.
Proceedings of the Euro-Par 2024: Parallel Processing, 2024

2023
Thicket: Seeing the Performance Experiment Forest for the Individual Run Trees.
Proceedings of the 32nd International Symposium on High-Performance Parallel and Distributed Computing, 2023

2021
Ubiquitous Performance Analysis.
Proceedings of the High Performance Computing - 36th International Conference, 2021

Did the GPU obfuscate the load imbalance in my MPI simulation?
Proceedings of the IEEE/ACM International Workshop on Hierarchical Parallelism for Exascale Computing, 2021

2020
CodeSeer: input-dependent code variants selection via machine learning.
Proceedings of the ICS '20: 2020 International Conference on Supercomputing, 2020

2019

The Case for a Common Instrumentation Interface for HPC Codes.
Proceedings of the IEEE/ACM International Workshop on Programming and Performance Visualization Tools, 2019

FuncyTuner: Auto-tuning Scientific Applications With Per-loop Compilation.
Proceedings of the 48th International Conference on Parallel Processing, 2019

2018
Scalasca analysis report of the ASCI Sweep3D benchmark on 294,912 processes in virtual-node mode on IBM Blue Gene/P with manually annotated iterations.
Dataset, August, 2018

Scalasca analysis report of the ASCI Sweep3D benchmark on 65,536 processes in virtual-node mode on IBM Blue Gene/P.
Dataset, April, 2018

Scalasca analysis report of the ASCI Sweep3D benchmark on 294,912 processes in virtual-node mode on IBM Blue Gene/P.
Dataset, April, 2018

Scalasca analysis report for SPEC MPI.2007 benchmark 132.zeump2 on 512 processes in virtual-node mode on Blue Gene/P.
Dataset, April, 2018

Visual Analytics Challenges in Analyzing Calling Context Trees.
Proceedings of the Programming and Performance Visualization Tools, 2018

2017
Predicting the performance impact of different fat-tree configurations.
Proceedings of the International Conference for High Performance Computing, 2017

Flexible Data Aggregation for Performance Profiling.
Proceedings of the 2017 IEEE International Conference on Cluster Computing, 2017

2016
Identifying the Root Causes of Wait States in Large-Scale Parallel Applications.
ACM Trans. Parallel Comput., 2016

VIPACT: A Visualization Interface for Analyzing Calling Context Trees.
Proceedings of the Third Workshop on Visual Performance Analysis, 2016

Caliper: performance introspection for HPC software stacks.
Proceedings of the International Conference for High Performance Computing, 2016

HIPS Introduction and Committees.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016

2015
Recovering logical structure from Charm++ event traces.
Proceedings of the International Conference for High Performance Computing, 2015

2014
Characterizing Load and Communication Imbalance in Parallel Applications.
PhD thesis, 2014

Catching Idlers with Ease: A Lightweight Wait-State Profiler for MPI Programs.
Proceedings of the 21st European MPI Users' Group Meeting, 2014

2013
Understanding the formation of wait states in applications with one-sided communication.
Proceedings of the 20th European MPI Users's Group Meeting, 2013

2012
Extending Scalasca's Analysis Features.
Proceedings of the Tools for High Performance Computing 2012, 2012

Scalable Critical-Path Based Performance Analysis.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012

Characterizing Load and Communication Imbalance in Large-Scale Parallel Applications.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum, 2012

2011
Scalasca.
Proceedings of the Entwicklung und Evolution von Forschungssoftware: Tagungsband des Workshops, 2011

2010
Large-Scale Performance Analysis of Sweep3D with the Scalasca Toolset.
Parallel Process. Lett., 2010

Performance analysis of Sweep3D on Blue Gene/P with the Scalasca toolset.
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Identifying the Root Causes of Wait States in Large-Scale Parallel Applications.
Proceedings of the 39th International Conference on Parallel Processing, 2010

2009
Parallel software for retrieval of aerosol distribution from LIDAR data in the framework of EARLINET-ASOS.
Comput. Phys. Commun., 2009

Recent Developments in the Scalasca Toolset.
Proceedings of the Tools for High Performance Computing 2009, 2009

Performance Simulation of Non-blocking Communication in Message-Passing Applications.
Proceedings of the Euro-Par 2009, 2009

2008
Performance Issues of Synchronisation in the MPI-2 One-Sided Communication API.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2008


  Loading...