Matthias S. Müller

According to our database1, Matthias S. Müller
  • authored at least 120 papers between 1999 and 2017.
  • has a "Dijkstra number"2 of three.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Homepage:

On csauthors.net:

Bibliography

2017
Assessing the Performance of OpenMP Programs on the Knights Landing Architecture.
Proceedings of the Scaling OpenMP for Exascale Performance and Portability, 2017

OpenMP Tools Interface: Synchronization Information for Data Race Detection.
Proceedings of the Scaling OpenMP for Exascale Performance and Portability, 2017

A Pattern for Overlapping Communication and Computation with OpenMP ^* Target Directives.
Proceedings of the Scaling OpenMP for Exascale Performance and Portability, 2017

2016
Editorial for the special issue on energy-aware high performance computing.
Computer Science - R&D, 2016

Editorial for the special issue on Energy-aware high performance computing.
Computer Science - R&D, 2016

Software Cost Analysis of GPU-Accelerated Aeroacoustics Simulations in C++ with OpenACC.
Proceedings of the High Performance Computing, 2016

From Describing to Prescribing Parallelism: Translating the SPEC ACCEL OpenACC Suite to OpenMP Target Directives.
Proceedings of the High Performance Computing, 2016

Development effort estimation in HPC.
Proceedings of the International Conference for High Performance Computing, 2016

Using Directed Variance to Identify Meaningful Views in Call-Path Performance Profiles.
Proceedings of the Third Workshop on Visual Performance Analysis, 2016

Correlating sub-phenomena in performance data in the frequency domain.
Proceedings of the 6th IEEE Symposium on Large Data Analysis and Visualization, 2016

Visualizing Performance Data with Respect to the Simulated Geometry.
Proceedings of the High-Performance Scientific Computing, 2016

Performance Optimization of Parallel Applications in Diverse On-Demand Development Teams.
Proceedings of the High-Performance Scientific Computing, 2016

NUMA-Aware Task Performance Analysis.
Proceedings of the OpenMP: Memory, Devices, and Tasks, 2016

Testing Infrastructure for OpenMP Debugging Interface Implementations.
Proceedings of the OpenMP: Memory, Devices, and Tasks, 2016

ARCHER: Effectively Spotting Data Races in Large OpenMP Applications.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016

An OpenMP Epoch Model for Correctness Checking.
Proceedings of the 45th International Conference on Parallel Processing Workshops, 2016

The Scientific Programming Integrated Degree Program - A Pioneering Approach to join Theory and Practice.
Proceedings of the International Conference on Computational Science 2016, 2016

2015
Editorial for the fifth international conference on energy-aware high performance computing.
Computer Science - R&D, 2015

Modeling the Productivity of HPC Systems on a Computing Center Scale.
Proceedings of the High Performance Computing - 30th International Conference, 2015

Effective communication for a system of cluster-on-a-chip processors.
Proceedings of the Sixth International Workshop on Programming Models and Applications for Multicores and Manycores, 2015

Evaluating OpenMP Performance on Thousands of Cores on the Numascale Architecture.
Proceedings of the Parallel Computing: On the Road to Exascale, 2015

Evaluating the Energy Consumption of OpenMP Applications on Haswell Processors.
Proceedings of the OpenMP: Heterogenous Execution and Data Movements, 2015

Lessons Learned from Implementing OMPD: A Debugging Interface for OpenMP.
Proceedings of the OpenMP: Heterogenous Execution and Data Movements, 2015

Performance Analysis for Target Devices with the OpenMP Tools Interface.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, 2015

Event-Action Mappings for Parallel Tools Infrastructures.
Proceedings of the Euro-Par 2015: Parallel Processing, 2015

2014
Towards an accurate simulation of the crystallisation process in injection moulded plastic components by hybrid parallelisation.
IJHPCA, 2014

Editorial for the Fourth International Conference on Energy-Aware High Performance Computing.
Computer Science - R&D, 2014

Visualization of memory access behavior on hierarchical NUMA architectures.
Proceedings of the First Workshop on Visual Performance Analysis, 2014

Towards providing low-overhead data race detection for large OpenMP applications.
Proceedings of the 2014 LLVM Compiler Infrastructure in HPC, 2014

SPEC ACCEL: A Standard Application Suite for Measuring Hardware Accelerator Performance.
Proceedings of the High Performance Computing Systems. Performance Modeling, Benchmarking, and Simulation, 2014

An OpenMP Extension Library for Memory Affinity.
Proceedings of the Using and Improving OpenMP for Devices, Tasks, and More, 2014

Classification of Common Errors in OpenMP Applications.
Proceedings of the Using and Improving OpenMP for Devices, Tasks, and More, 2014

MPI Runtime Error Detection with MUST: A Scalable and Crash-Safe Approach.
Proceedings of the 43rd International Conference on Parallel Processing Workshops, 2014

A Pattern-Based Comparison of OpenACC and OpenMP for Accelerator Computing.
Proceedings of the Euro-Par 2014 Parallel Processing, 2014

Analysis of Parallel Applications on a High Performance-Low Energy Computer.
Proceedings of the Euro-Par 2014: Parallel Processing Workshops, 2014

Memory Usage Optimizations for Online Event Analysis.
Proceedings of the Solving Software Challenges for Exascale, 2014

2013
MPI runtime error detection with MUST: Advances in deadlock detection.
Scientific Programming, 2013

Performance and quality of service of data and video movement over a 100 Gbps testbed.
Future Generation Comp. Syst., 2013

Accelerators for Technical Computing: Is It Worth the Pain? A TCO Perspective.
Proceedings of the Supercomputing - 28th International Supercomputing Conference, 2013

Distributed wait state tracking for runtime MPI deadlock detection.
Proceedings of the International Conference for High Performance Computing, 2013

Runtime MPI collective checking with tree-based overlay networks.
Proceedings of the 20th European MPI Users's Group Meeting, 2013

Suitability of Performance Tools for OpenMP Task-Parallel Programs.
Proceedings of the Tools for High Performance Computing 2013, 2013

Towards a Performance Engineering Workflow for OpenMP 4.0.
Proceedings of the Parallel Computing: Accelerating Computational Science and Engineering (CSE), 2013

Performance Characteristics of Large SMP Machines.
Proceedings of the OpenMP in the Era of Low Power Devices and Accelerators, 2013

Accelerators, quo vadis? Performance vs. productivity.
Proceedings of the International Conference on High Performance Computing & Simulation, 2013

Intralayer Communication for Tree-Based Overlay Networks.
Proceedings of the 42nd International Conference on Parallel Processing, 2013

Assessing the Performance of OpenMP Programs on the Intel Xeon Phi.
Proceedings of the Euro-Par 2013 Parallel Processing, 2013

2012
MPI runtime error detection with MUST: advances in deadlock detection.
Proceedings of the SC Conference on High Performance Computing Networking, 2012

MPI Runtime Error Detection with MUST: Advanced Error Reports.
Proceedings of the Tools for High Performance Computing 2012, 2012

SPEC OMP2012 - An Application Benchmark Suite for Parallel Systems Using OpenMP.
Proceedings of the OpenMP in a Heterogeneous World - 8th International Workshop on OpenMP, 2012

Holistic Debugging of MPI Derived Datatypes.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012

HIPS Introduction.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum, 2012

GTI: A Generic Tools Infrastructure for Event-Based Tools in Parallel Systems.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012

2011
SPEC Benchmarks.
Proceedings of the Encyclopedia of Parallel Computing, 2011

Trace-based performance analysis for the petascale simulation code FLASH.
IJHPCA, 2011

The International Exascale Software Project roadmap.
IJHPCA, 2011

Order Preserving Event Aggregation in TBONs.
Proceedings of the Recent Advances in the Message Passing Interface, 2011

Memory Performance and SPEC OpenMP Scalability on Quad-Socket x86_64 Systems.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2011

2010
A generic attribute extension to OTF and its use for MPI replay.
Proceedings of the International Conference on Computational Science, 2010

Guest Editors' Introduction.
International Journal of Parallel Programming, 2010

Quantifying power consumption variations of HPC systems using SPEC MPI benchmarks.
Computer Science - R&D, 2010

Implementation, performance, and science results from a 30.7 TFLOPS IBM BladeCenter cluster.
Concurrency and Computation: Practice and Experience, 2010

Preface.
Concurrency and Computation: Practice and Experience, 2010

SPEC MPI2007 - an application benchmark suite for parallel systems using MPI.
Concurrency and Computation: Practice and Experience, 2010

Highly Scalable Dynamic Load Balancing in the Atmospheric Modeling System COSMO-SPECS+FD4.
Proceedings of the Applied Parallel and Scientific Computing, 2010

Characterizing the energy consumption of data transfers and arithmetic operations on x86-64 processors.
Proceedings of the International Green Computing Conference 2010, 2010

PROPER 2010: Third Workshop on Productivity and Performance - Tools for HPC Application Development.
Proceedings of the Euro-Par 2010 Parallel Processing Workshops, 2010

2009
MPI Correctness Checking for OpenMP/MPI Applications.
International Journal of Parallel Programming, 2009

Performance at Exascale.
IJHPCA, 2009

Tools for scalable parallel program analysis: Vampir NG, MARMOT, and DeWiz.
IJCSE, 2009

MUST: A Scalable Approach to Runtime Error Detection in MPI Programs.
Proceedings of the Tools for High Performance Computing 2009, 2009

A framework for detailed multiphase cloud modeling on HPC systems.
Proceedings of the Parallel Computing: From Multicores and GPU's to Petascale, 2009

An Interface for Integrated MPI Correctness Checking.
Proceedings of the Parallel Computing: From Multicores and GPU's to Petascale, 2009

GeneIndex: An Open Source Parallel Program for Enumerating and Locating Words in a Genome.
Proceedings of the International Joint Conferences on Bioinformatics, 2009

A graph based approach for MPI deadlock detection.
Proceedings of the 23rd international conference on Supercomputing, 2009

PROPER 2009: Workshop on Productivity and Performance - Tools for HPC Application Development.
Proceedings of the Euro-Par 2009, 2009

Pattern Matching and I/O Replay for POSIX I/O in Parallel Programs.
Proceedings of the Euro-Par 2009 Parallel Processing, 2009

Memory Performance and Cache Coherency Effects on an Intel Nehalem Multiprocessor System.
Proceedings of the PACT 2009, 2009

2008
Performance evaluation of supercomputers using HPCC and IMB Benchmarks.
J. Comput. Syst. Sci., 2008

Internal Timer Synchronization for Parallel Event Tracing.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2008

MPI Correctness Checking with Marmot.
Proceedings of the Tools for High Performance Computing, 2008

The Vampir Performance Analysis Tool-Set.
Proceedings of the Tools for High Performance Computing, 2008

Detection of Violations to the MPI Standard in Hybrid OpenMP/MPI Applications.
Proceedings of the OpenMP in a New Era of Parallelism, 4th International Workshop, 2008

Workshop on Productivity and Performance (PROPER 2008).
Proceedings of the Euro-Par 2008 Workshops, 2008

Trace-Based Analysis and Optimization for the Semtex CFD Application - Hidden Remote Memory Accesses and I/O Performance.
Proceedings of the Euro-Par 2008 Workshops, 2008

2007
Introduction.
International Journal of Parallel Programming, 2007

Special Issue on OpenMP - Guest Editors' Introduction.
International Journal of Parallel Programming, 2007

Scalability and Usability of HPC Programming Tools.
Proceedings of the Parallel Computing: Architectures, 2007

Developing Scalable Applications with Vampir, VampirServer and VampirTrace.
Proceedings of the Parallel Computing: Architectures, 2007

Analyzing Mutual Influences of High Performance Computing Programs on SGI Altix 3700 and 4700 Systems with PARbench.
Proceedings of the Parallel Computing: Architectures, 2007

Memory Allocation Tracing with VampirTrace.
Proceedings of the Computational Science - ICCS 2007, 7th International Conference, Beijing, China, May 27, 2007

Quality Assurance for Clusters: Acceptance-, Stress-, and Burn-In Tests for General Purpose Clusters.
Proceedings of the High Performance Computing and Communications, 2007

I/O Induced Scalability Limits of Bioinformatics Applications.
Proceedings of the 7th IEEE International Conference on Bioinformatics and Bioengineering, 2007

2006
Performance evaluation of supercomputers using HPCC and IMB benchmarks.
Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

Progress Towards Petascale Applications in Biology: Status in 2006.
Proceedings of the Euro-Par 2006 Workshops: Parallel Processing, 2006

High Throughput Image Analysis on PetaFLOPS Systems.
Proceedings of the Euro-Par 2006 Workshops: Parallel Processing, 2006

2005
The Grid.
it - Information Technology, 2005

Network Bandwidth Measurements and Ratio Analysis with the HPC Challenge Benchmark Suite (HPCC).
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2005

MPI Application Development with MARMOT.
Proceedings of the Parallel Computing: Current & Future Issues of High-End Computing, 2005

SPEC OpenMP Benchmarks on Four Generations of NEC SX Parallel Vector Systems.
Proceedings of the OpenMP Shared Memory Parallel Programming - International Workshops, 2005

2004
SPEC HPG benchmarks for high-performance systems.
IJHPCN, 2004

The emerging role of biogrids.
Commun. ACM, 2004

MPI I/O Analysis and Error Detection with MARMOT.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2004

MPI Application Development Using the Analysis Tool MARMOT.
Proceedings of the Computational Science, 2004

A Global Grid for Analysis of Arthropod Evolution.
Proceedings of the 5th International Workshop on Grid Computing (GRID 2004), 2004

2003
An OpenMP compiler benchmark.
Scientific Programming, 2003

Towards Efficient Execution of MPI Applications on the Grid: Porting and Optimization Issues.
J. Grid Comput., 2003

MARMOT: An MPI Analysis and Checking Tool.
Proceedings of the Parallel Computing: Software Technology, 2003

SPEC HPG Benchmarks for Large Systems.
Proceedings of the High Performance Computing, 5th International Symposium, 2003

Software Development in the Grid: The DAMIEN Tool-Set.
Proceedings of the Computational Science - ICCS 2003, 2003

Performance Analysis of a Parallel Application in the GRID.
Proceedings of the Computational Science - ICCS 2003, 2003

Performance Prediction in a Grid Environment.
Proceedings of the Grid Computing, 2003

Grid enabled MPI solutions for Clusters.
Proceedings of the 3rd IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2003), 2003

2002
A software development environment for Grid computing.
Concurrency and Computation: Practice and Experience, 2002

Experiences Using OpenMP Based on Compiler Directed Software DSM on a PC Cluster.
Proceedings of the OpenMP Shared Memory Parallel Programming, 2002

A Shared Memory Benchmark in OpenMP.
Proceedings of the High Performance Computing, 4th International Symposium, 2002

2001
Metacomputing across intercontinental networks.
Future Generation Comp. Syst., 2001

Some Simple OpenMP Optimization Techniques.
Proceedings of the OpenMP Shared Memory Parallel Programming, 2001

2000
The Problems and the Solutions of the Metacomputing Experiment in SC99.
Proceedings of the High-Performance Computing and Networking, 8th International Conference, 2000

1999
Parallel / High-Performance Object-Oriented Scientific Computing.
Proceedings of the Object-Oriented Technology, ECOOP'99 Workshop Reader, 1999


  Loading...