Wolfgang E. Nagel

Proceedings of the 4. DFN-Forum Kommunikationstechnologien, 2011

2010

Preface.

[BibT_eX]

[DOI]

Matthias S. Müller

Concurr. Comput. Pract. Exp., 2010

Preface.

[BibT_eX]

[DOI]

Concurr. Comput. Pract. Exp., 2010

Highly Scalable Dynamic Load Balancing in the Atmospheric Modeling System COSMO-SPECS+FD4.

[BibT_eX]

[DOI]

Proceedings of the Applied Parallel and Scientific Computing, 2010

Efficient Pattern Based I/O Analysis of Parallel Programs.

[BibT_eX]

[DOI]

Proceedings of the 39th International Conference on Parallel Processing, 2010

eeClust: Energy-Efficient Cluster Computing.

[BibT_eX]

[DOI]

Proceedings of the Competence in High Performance Computing 2010, 2010

Score-P: A Unified Performance Measurement System for Petascale Applications.

[BibT_eX]

[DOI]

Proceedings of the Competence in High Performance Computing 2010, 2010

2009

Performance at Exascale.

[BibT_eX]

[DOI]

Bernd Mohr

Matthias S. Müller

Int. J. High Perform. Comput. Appl., 2009

Tools for scalable parallel program analysis: Vampir NG, MARMOT, and DeWiz.

[BibT_eX]

[DOI]

Int. J. Comput. Sci. Eng., 2009

A framework for detailed multiphase cloud modeling on HPC systems.

[BibT_eX]

[DOI]

Proceedings of the Parallel Computing: From Multicores and GPU's to Petascale, 2009

An Interface for Integrated MPI Correctness Checking.

[BibT_eX]

[DOI]

Proceedings of the Parallel Computing: From Multicores and GPU's to Petascale, 2009

Comparing cache architectures and coherency protocols on x86-64 multicore SMP systems.

[BibT_eX]

[DOI]

Daniel Hackenberg

Daniel Molka

Proceedings of the 42st Annual IEEE/ACM International Symposium on Microarchitecture (MICRO-42 2009), 2009

Pattern Matching and I/O Replay for POSIX I/O in Parallel Programs.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2009 Parallel Processing, 2009

2008

Improved Performance for Nodal Spectral Element Operators.

[BibT_eX]

[DOI]

Uwe Fladrich

Jörg Stiller

Int. J. High Perform. Comput. Appl., 2008

Internal Timer Synchronization for Parallel Event Tracing.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2008

The Vampir Performance Analysis Tool-Set.

[BibT_eX]

[DOI]

Proceedings of the Tools for High Performance Computing, 2008

Trace-Based Analysis and Optimization for the Semtex CFD Application - Hidden Remote Memory Accesses and I/O Performance.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2008 Workshops, 2008

Event Tracing and Visualization for Cell Broadband Engine Systems.

[BibT_eX]

[DOI]

Daniel Hackenberg

Proceedings of the Euro-Par 2008, 2008

2007

Analyzing Cache Bandwidth on the Intel Core 2 Architecture.

[BibT_eX]

Robert Schöne

Stefan Pflüger

Proceedings of the Parallel Computing: Architectures, 2007

Developing Scalable Applications with Vampir, VampirServer and VampirTrace.

[BibT_eX]

Proceedings of the Parallel Computing: Architectures, 2007

Analyzing Mutual Influences of High Performance Computing Programs on SGI Altix 3700 and 4700 Systems with PARbench.

[BibT_eX]

Proceedings of the Parallel Computing: Architectures, 2007

Analysis of Linux Scheduling with VAMPIR.

[BibT_eX]

[DOI]

Proceedings of the Computational Science - ICCS 2007, 7th International Conference, Beijing, China, May 27, 2007

Memory Allocation Tracing with VampirTrace.

[BibT_eX]

[DOI]

Proceedings of the Computational Science - ICCS 2007, 7th International Conference, Beijing, China, May 27, 2007

Topic 2 Performance Prediction and Evaluation.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2007, 2007

Computational Steering and Online Visualization of Scientific Applications on Large-Scale HPC Systems within e-Science Infrastructures.

[BibT_eX]

[DOI]

Proceedings of the Third International Conference on e-Science and Grid Computing, 2007

2006

Compressible memory data structures for event-based trace analysis.

[BibT_eX]

[DOI]

Future Gener. Comput. Syst., 2006

Open trace - The open trace format (OTF) and open tracing for HPC.

[BibT_eX]

[DOI]

Allen D. Malony

Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

M09 - Program analysis tools for massively parallel applications: how to achieve highest performance.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

Visualization of Repetitive Patterns in Event Traces.

[BibT_eX]

[DOI]

Proceedings of the Applied Parallel Computing. State of the Art in Scientific Computing, 2006

Introducing the Open Trace Format (OTF).

[BibT_eX]

[DOI]

Proceedings of the Computational Science, 2006

Analyzing the Interaction of OpenMP Programs Within Multiprogramming Environments on a Sun Fire E25K System with PARbench.

[BibT_eX]

[DOI]

Rick Janda

Bernd Trenkler

Proceedings of the Euro-Par 2006, Parallel Processing, 12th International Euro-Par Conference, Dresden, Germany, August 28, 2006

Optimizing OpenMP Parallelized DGEMM Calls on SGI Altix 3700.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2006, Parallel Processing, 12th International Euro-Par Conference, Dresden, Germany, August 28, 2006

2005

Monitoring cache behavior on parallel SMP architectures and related programming tools.

[BibT_eX]

[DOI]

Ralph Müller-Pfefferkorn

Future Gener. Comput. Syst., 2005

High Performance Event Trace Visualization.

[BibT_eX]

[DOI]

Proceedings of the 13th Euromicro Workshop on Parallel, 2005

Performance Comparison and Optimization: Case Studies using BenchIT.

[BibT_eX]

Proceedings of the Parallel Computing: Current & Future Issues of High-End Computing, 2005

Scheduling issues on IBM p690: Performance Analysis with the PARbench Environment.

[BibT_eX]

H. Dietze

Bernd Trenkler

Proceedings of the Parallel Computing: Current & Future Issues of High-End Computing, 2005

Tracing the Cache Behaviour of Data Structures in Fortran Applications.

[BibT_eX]

L. Barabas

Ralph Müller-Pfefferkorn

Reinhard Neumann

Proceedings of the Parallel Computing: Current & Future Issues of High-End Computing, 2005

Construction and Compression of Complete Call Graphs for Post-Mortem Program Trace Analysis.

[BibT_eX]

[DOI]

Proceedings of the 34th International Conference on Parallel Processing (ICPP 2005), 2005

New Algorithms for Performance Trace Analysis Based on Compressed Complete Call Graphs.

[BibT_eX]

[DOI]

Proceedings of the Computational Science, 2005

Statistical Methods for Automatic Performance Bottleneck Detection in MPI Based Programs.

[BibT_eX]

[DOI]

Proceedings of the Computational Science, 2005

Knowledge Based Automatic Scalability Analysis and Extrapolation for MPI Programs.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2005, Parallel Processing, 11th International Euro-Par Conference, Lisbon, Portugal, August 30, 2005

05501 Abstracts Collection - Automatic Performance Analysis.

[BibT_eX]

[DOI]

Proceedings of the Automatic Performance Analysis, 12.-16. December 2005, 2005

05501 Summary - Automatic Performance Analysis.

[BibT_eX]

[DOI]

Proceedings of the Automatic Performance Analysis, 12.-16. December 2005, 2005

2004

Grid-Computing.

[BibT_eX]

[DOI]

François Bry

Michael Schroeder

Inform. Spektrum, 2004

Performance Analysis with BenchIT: Portable, Flexible, Easy to Use.

[BibT_eX]

[DOI]

Proceedings of the 1st International Conference on Quantitative Evaluation of Systems (QEST 2004), 2004

Detection of Collective MPI Operation Patterns.

[BibT_eX]

[DOI]

Dieter Kranzlmüller

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2004

Pattern Matching of Collective MPI Operations.

[BibT_eX]

Dieter Kranzlmüller

Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 2004

A Parallel PSPG Finite Element Method for Direct Simulation of Incompressible Flow.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2004 Parallel Processing, 2004

Topic 2: Performance Evaluation.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2004 Parallel Processing, 2004

Optimizing Cache Access: A Tool for Source-to-Source Transformations and Real-Life Compiler Tests.

[BibT_eX]

[DOI]

Ralph Müller-Pfefferkorn

Bernd Trenkler

Proceedings of the Euro-Par 2004 Parallel Processing, 2004

Tools for Scalable Parallel Program Analysis - Vampir VNG and DeWiz.

[BibT_eX]

Dieter Kranzlmüller

Proceedings of the Distributed and Parallel Systems: Cluster and Grid Computing (DAPSYS 2004, 2004

2003

BenchIT - Performance Measurements and Comparison for Scientific Applications.

[BibT_eX]

Proceedings of the Parallel Computing: Software Technology, 2003

Scalable Performance Analysis of Parallel Systems: Concepts and Experiences.

[BibT_eX]

Proceedings of the Parallel Computing: Software Technology, 2003

Performance Analysis of a Parallel Application in the GRID.

[BibT_eX]

[DOI]

Proceedings of the Computational Science - ICCS 2003, 2003

A Distributed Performance Analysis Architecture for Clusters.

[BibT_eX]

[DOI]

Allen D. Malony

Proceedings of the 2003 IEEE International Conference on Cluster Computing (CLUSTER 2003), 2003

2002

VGV: Supporting Performance Analysis of Object-Oriented Mixed MPI/OpenMP Parallel Applications.

[BibT_eX]

[DOI]

Proceedings of the 16th International Parallel and Distributed Processing Symposium (IPDPS 2002), 2002

2001

An Integrated Performance Visualizer for MPI/OpenMP Programs.

[BibT_eX]

[DOI]

Proceedings of the OpenMP Shared Memory Parallel Programming, 2001

Performance Optimization for Large Scale Computing: The Scalable VAMPIR Approach.

[BibT_eX]

[DOI]

Proceedings of the Computational Science - ICCS 2001, 2001

An Hierarchical MPI Communication Model for the Parallelized Solution of Multiple Integrals.

[BibT_eX]

[DOI]

Proceedings of the High-Performance Computing and Networking, 9th International Conference, 2001

Group-Based Performance Analysis for Multithreaded SMP Cluster Applications.

[BibT_eX]

[DOI]

Hans-Christian Hoppe

Proceedings of the Euro-Par 2001: Parallel Processing, 2001

2000

Performance Tuning on Parallel Systems: All Problems Solved?

[BibT_eX]

[DOI]

Stephan Seidl

Proceedings of the Applied Parallel Computing, 2000

An Efficient Parallel Linear Solver with a Cascadic Conjugate Gradient Method: Experience with Reality.

[BibT_eX]

[DOI]

Peter Gottschling

Proceedings of the Euro-Par 2000, Parallel Processing, 6th International Euro-Par Conference, Munich, Germany, August 29, 2000

Performance Evaluation and Prediction.

[BibT_eX]

[DOI]

Thomas Fahringer

Proceedings of the Euro-Par 2000, Parallel Processing, 6th International Euro-Par Conference, Munich, Germany, August 29, 2000

1999

A New Approach for Parallel Multigrid Adaption.

[BibT_eX]

Jörg Stiller

Krzysztof Boryczko

Proceedings of the Ninth SIAM Conference on Parallel Processing for Scientific Computing, 1999

MG - A toolbox for parallel grid adaption and implementing multigrid solvers unstructured.

[BibT_eX]

Jörg Stiller

Proceedings of the Parallel Computing: Fundamentals & Applications, 1999

Effective performance problem detection of MPI programs on MPP systems: From the global view to the details.

[BibT_eX]

Proceedings of the Parallel Computing: Fundamentals & Applications, 1999

Three-dimensional direct numerical simulation of flow problems with electromagnetic control on parallel systems.

[BibT_eX]

Proceedings of the Parallel Computing: Fundamentals & Applications, 1999

1997

Metacomputing in a Regional ATM-Testbed - Experience with Reality.

[BibT_eX]

Proceedings of the Parallel Computing: Fundamentals, 1997

1995

Effektive Nutzung von Parallelrechnern in Rechenzentrumsumgebungen.

[BibT_eX]

Proceedings of the Organisation und Betrieb von DV-Versorungssystemen, 1995

1993

Ein verteiltes Scheduler-System für Mehrprozessorrechner mit gemeinsamem Speicher: Untersuchungen zur Ablaufplanung von parallelen Programmen.

[BibT_eX]

[DOI]

PhD thesis, 1993

1991

Benchmarking parallel programs in a multiprogramming environment: the PAR-Bench system.

[BibT_eX]

[DOI]

Markus A. Linn

Parallel Comput., 1991

Parallel programs and background load: efficiency studies with the PAR-Bench system.

[BibT_eX]

[DOI]

Markus A. Linn

Proceedings of the 5th international conference on Supercomputing, 1991

1990

Exploiting autotasking on a CRAY Y-MP: an improved software interface to multitasking.

[BibT_eX]

[DOI]

Parallel Comput., 1990

Parallelizing QCD with dynamical fermions on a Cray multiprocessor system.

[BibT_eX]

[DOI]

S. Knecht

E. Laermann

Parallel Comput., 1990

Prinzipien der Parallelverarbeitung auf Rechnern mit gemeinsamem Speicher.

[BibT_eX]

[DOI]

Proceedings of the GI, 1990

1989

Multitasking: experience with applications on a CRAY X-MP.

[BibT_eX]

[DOI]

Friedel Hossfeld

Renate Knecht

Parallel Comput., 1989

A comparison of parallel processing on Cray X-MP AND IBM 3090 VF multiprocessors.

[BibT_eX]

[DOI]

Ferenc Szelényi

Proceedings of the 3rd international conference on Supercomputing, 1989

1988

Using multiple CPUs for problem solving: experiences in multitasking on the CRAY X-MP/48.

[BibT_eX]

[DOI]

Parallel Comput., 1988

Three-dimensional numerical simulations of the czochralski bulk flow on a CRAY X-MP multiprocessor architecture.

[BibT_eX]

[DOI]