Karl Fürlinger

Proceedings of the Euro-Par 2017: Parallel Processing Workshops, 2017

2016

Tool Support for Developing DASH Applications.

[BibT_eX]

[DOI]

Proceedings of the Software for Exascale Computing - SPPEXA 2013-2015, 2016

Expressing and Exploiting Multi-Dimensional Locality in DASH.

[BibT_eX]

[DOI]

Proceedings of the Software for Exascale Computing - SPPEXA 2013-2015, 2016

Online MPI Trace Compression Using Event Flow Graphs and Wavelets.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Computational Science 2016, 2016

DASH: A C++ PGAS Library for Distributed Data Structures and Parallel Algorithms.

[BibT_eX]

[DOI]

Roger Kowalewski

Proceedings of the 18th IEEE International Conference on High Performance Computing and Communications; 14th IEEE International Conference on Smart City; 2nd IEEE International Conference on Data Science and Systems, 2016

A Multi-dimensional Distributed Array Abstraction for PGAS.

[BibT_eX]

[DOI]

Nasty-MPI: Debugging Synchronization Errors in MPI-3 One-Sided Applications.

[BibT_eX]

[DOI]

Roger Kowalewski

Proceedings of the Euro-Par 2016: Parallel Processing, 2016

2015

DART-MPI: An MPI-based Implementation of a PGAS Runtime System.

[BibT_eX]

[DOI]

CoRR, 2015

DART-CUDA: A PGAS Runtime System for Multi-GPU Systems.

[BibT_eX]

[DOI]

Lei Zhou

Proceedings of the 14th International Symposium on Parallel and Distributed Computing, 2015

Visual MPI Performance Analysis using Event Flow Graphs.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Computational Science, 2015

Automatic On-Line Detection of MPI Application Structure with Event Flow Graphs.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2015: Parallel Processing, 2015

2014

A framework for comparative performance study on virtualised machines.

[BibT_eX]

[DOI]

Jiaqi Zhao

Jie Tao

Int. J. Ad Hoc Ubiquitous Comput., 2014

DART-MPI: An MPI-based Implementation of a PGAS Runtime System.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Partitioned Global Address Space Programming Models, 2014

DASH: Data Structures and Algorithms with Support for Hierarchical Locality.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2014: Parallel Processing Workshops, 2014

MPI Trace Compression Using Event Flow Graphs.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2014 Parallel Processing, 2014

2013

Online Performance Introspection with IPM.

[BibT_eX]

[DOI]

Proceedings of the 10th IEEE International Conference on High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing, 2013

Topic 1: Support Tools and Environments - (Introduction).

[BibT_eX]

[DOI]

Bronis R. de Supinski

Bettina Krammer

Dimitrios S. Nikolopoulos

Jesús Labarta

Proceedings of the Euro-Par 2013 Parallel Processing, 2013

2012

A Performance Study of Virtual Machines on Multicore Architectures.

[BibT_eX]

[DOI]

Proceedings of the 20th Euromicro International Conference on Parallel, 2012

Trends in Computation, Communication and Storage and the Consequences for Data-intensive Science.

[BibT_eX]

[DOI]

Simone Ferlin Oliveira

Dieter Kranzlmüller

Proceedings of the 14th IEEE International Conference on High Performance Computing and Communication & 9th IEEE International Conference on Embedded Software and Systems, 2012

2011

OpenMP Profiling with OmpP.

[BibT_eX]

[DOI]

Proceedings of the Encyclopedia of Parallel Computing, 2011

Performance Evaluation of OpenMP Applications on Virtualized Multicore Machines.

[BibT_eX]

[DOI]

Jie Tao

Holger Marten

Proceedings of the OpenMP in the Petascale Era - 7th International Workshop on OpenMP, 2011

Comprehensive Performance Monitoring for GPU Cluster Systems.

[BibT_eX]

[DOI]

Nicholas J. Wright

Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

Towards Energy Efficient Parallel Computing on Consumer Electronic Devices.

[BibT_eX]

[DOI]

Christof Klausecker

Dieter Kranzlmüller

Proceedings of the Information and Communication on Technology for the Fight against Global Warming, 2011

Investigating the Scalability of OpenFOAM for the Solution of Transport Equations and Large Eddy Simulations.

[BibT_eX]

[DOI]

Orlando Rivera

Dieter Kranzlmüller

Proceedings of the Algorithms and Architectures for Parallel Processing, 2011

Parallel Aspects of OpenFOAM with Large Eddy Simulations.

[BibT_eX]

[DOI]

Orlando Rivera

Proceedings of the 13th IEEE International Conference on High Performance Computing & Communication, 2011

2010

A programming model performance study using the NAS parallel benchmarks.

[BibT_eX]

[DOI]

Sci. Program., 2010

OpenMP application profiling - state of the art and directions for the future.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Computational Science, 2010

Recording the control flow of parallel applications to determine iterative and phase-based behavior.

[BibT_eX]

[DOI]

Future Gener. Comput. Syst., 2010

Effective Performance Measurement at Petascale Using IPM.

[BibT_eX]

[DOI]

Nicholas J. Wright

Proceedings of the 16th IEEE International Conference on Parallel and Distributed Systems, 2010

Effective Holistic Performance Measurement at Petascale Using IPM.

[BibT_eX]

[DOI]

Proceedings of the Competence in High Performance Computing 2010, 2010

2009

Capturing and Analyzing the Execution Control Flow of OpenMP Applications.

[BibT_eX]

[DOI]

Int. J. Parallel Program., 2009

Performance Analysis and Workload Characterization with IPM.

[BibT_eX]

[DOI]

Nicholas J. Wright

Proceedings of the Tools for High Performance Computing 2009, 2009

Performance Profiling for OpenMP Tasks.

[BibT_eX]

[DOI]

Proceedings of the Evolving OpenMP in an Age of Extreme Parallelism, 2009

Capturing and Visualizing Event Flow Graphs of MPI Applications.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2009, 2009

2008

Usage of the SCALASCA toolset for scalable performance analysis of large-scale parallel applications.

[BibT_eX]

[DOI]

Proceedings of the Tools for High Performance Computing, 2008

Visualizing the Program Execution Control Flow of OpenMP Applications.

[BibT_eX]

[DOI]

Proceedings of the OpenMP in a New Era of Parallelism, 4th International Workshop, 2008

Detection and Analysis of Iterative Behavior in Parallel Applications.

[BibT_eX]

[DOI]

Proceedings of the Computational Science, 2008

Enabling Data Structure Oriented Performance Analysis with Hardware Performance Counter Support.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2008 Workshops, 2008

OpenMP-centric performance analysis of hybrid applications.

[BibT_eX]

[DOI]

Proceedings of the 2008 IEEE International Conference on Cluster Computing, 29 September, 2008

2007

Specification and detection of performance problems with ASL.

[BibT_eX]

[DOI]

Concurr. Comput. Pract. Exp., 2007

Continuous Runtime Profiling of OpenMP Applications.

[BibT_eX]

Proceedings of the Parallel Computing: Architectures, 2007

Scalability Analysis of the SPEC OpenMP Benchmarks on Large-Scale Shared Memory Multiprocessors.

[BibT_eX]

[DOI]

Jack J. Dongarra

Proceedings of the Computational Science - ICCS 2007, 7th International Conference, Beijing, China, May 27, 2007

On Using Incremental Profiling for the Performance Analysis of Shared Memory Parallel Applications.

[BibT_eX]

[DOI]

Jack J. Dongarra

Proceedings of the Euro-Par 2007, 2007

2006

Scalable automated online performance analysis of applications using performance properties.

[BibT_eX]

[DOI]

PhD thesis, 2006

Analyzing Overheads and Scalability Characteristics of OpenMP Applications.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing for Computational Science, 2006

Automated Performance Analysis Using ASL Performance Properties.

[BibT_eX]

[DOI]

Proceedings of the Applied Parallel Computing. State of the Art in Scientific Computing, 2006

Finding Inefficiencies in OpenMP Applications Automatically with Periscope.

[BibT_eX]

[DOI]

Proceedings of the Computational Science, 2006

2005

Periscope: Advanced Techniques for Performance Analysis.

[BibT_eX]

Edmond Kereku

Proceedings of the Parallel Computing: Current & Future Issues of High-End Computing, 2005

: A Profiling Tool for OpenMP.

[BibT_eX]

[DOI]

Proceedings of the OpenMP Shared Memory Parallel Programming - International Workshops, 2005

Performance Analysis of Shared-Memory Parallel Applications Using Performance Properties.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing and Communications, 2005

2004

Task-Queue Based Hybrid Parallelism: A Case Study.

[BibT_eX]

[DOI]

Olaf Schenk

Michael Hagemann

Proceedings of the Euro-Par 2004 Parallel Processing, 2004

2003

Distributed Configurable Application Monitoring on SMP Clusters.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface,10th European PVM/MPI Users' Group Meeting, Venice, Italy, September 29, 2003

Distributed Application Monitoring for Clustered SMP Architectures.

[BibT_eX]

[DOI]