Martin Schulz

Aswathy Nedumpalli Sankaranarayanan

Proceedings of the Design, Automation & Test in Europe Conference, 2026

Multi-Partner Project: Advancing European Semiconductor and Chiplet Innovation Through the Bavarian Chip Design Center.

[BibT_eX]

[DOI]

Aswathy Nedumpalli Sankaranarayanan

Andreas Korb

Konrad Hohentanner

Proceedings of the Design, Automation & Test in Europe Conference, 2026

POSTER: Towards a RISC-V-based SmartNIC Architecture on FPGA.

[BibT_eX]

[DOI]

Kun Qin

Taiki Okano

Proceedings of the 23rd ACM International Conference on Computing Frontiers, 2026

2025

Toward Dynamic Resource Management: An Asynchronous Many-Task (AMT) Runtime System leveraging Dynamic Processes with PSets (DPP).

[BibT_eX]

[DOI]

SN Comput. Sci., December, 2025

Integration of Quantum Accelerators with High Performance Computing - A Review of Quantum Programming Tools.

[BibT_eX]

[DOI]

Muhammad Arslan Ansari

Christian B. Mendl

ACM Trans. Quantum Comput., September, 2025

HiPC25 Computational Artifact: Dynamic Resource Management in HPC Systems using Dynamic Processes with PSets.

[BibT_eX]

[DOI]

Antonio J. Peña Monferrer

Sergio Iserte

Martin Schreiber

Pierre-François Dutot

Olivier Richard

Keerthi Gaddameedi

Tobias Neckel

Hans-Joachim Bungartz

Dataset, September, 2025

Advancing user-space networking for DDS message-oriented middleware: Further extensions.

[BibT_eX]

[DOI]

Pervasive Mob. Comput., 2025

Dynamic Resource Management: Comparison of Asynchronous Many-Task (AMT) and Dynamic Processes with PSets (DPP).

[BibT_eX]

[DOI]

Proceedings of the Asynchronous Many-Task Systems and Applications, 2025

Towards a Unified Architectural Representation in HPCQC: Extending Sys-Sage for Quantum Technologies.

[BibT_eX]

[DOI]

Proceedings of the ISC High Performance 2025 Research Paper Proceedings (40th International Conference), 2025

Bridging the Gap Between Genericity and Programmability of Dynamic Resources in HPC.

[BibT_eX]

[DOI]

Proceedings of the ISC High Performance 2025 Research Paper Proceedings (40th International Conference), 2025

Application-Focused HPC Network Monitoring.

[BibT_eX]

[DOI]

Philipp A. Friese

Olivier Marsden

Proceedings of the ISC High Performance 2025 Research Paper Proceedings (40th International Conference), 2025

Telemetry for Quantum Systems in HPC Centers.

[BibT_eX]

[DOI]

Mahmoud Abuzayed

Proceedings of the ISC High Performance 2025 Research Paper Proceedings (40th International Conference), 2025

Minimizing Readout and State Preparation Time for Neutral Atom Quantum Computing.

[BibT_eX]

[DOI]

Proceedings of the Software Engineering 2025 Companion Proceedings, 2025

Bridge the Gap Between HPC Systems and Various Quantum Platforms: A Unified Quantum Platform.

[BibT_eX]

[DOI]

Proceedings of the Software Engineering 2025 Companion Proceedings, 2025

MT4G: A Tool for Reliable Auto-Discovery of NVIDIA and AMD GPU Compute and Memory Topologies.

[BibT_eX]

[DOI]

Stepan Vanecek

Manuel Walter Mußbacher

Dominik Größler

Urvij Saroliya

Proceedings of the SC '25 Workshops of the International Conference for High Performance Computing, 2025

A GPU FFT Wrapper to Co-optimize Floating-Point Precision and Library Selection via Predictive Error Modeling.

[BibT_eX]

[DOI]

Julius Lehner

Proceedings of the SC '25 Workshops of the International Conference for High Performance Computing, 2025

Tackling the Challenges of Adding Pulse-level Support to a Heterogeneous HPCQC Software Stack: MQSS Pulse.

[BibT_eX]

[DOI]

Proceedings of the SC '25 Workshops of the International Conference for High Performance Computing, 2025

HiPARS: Highly-Parallel Atom Rearrangement Sequencer.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Quantum Computing and Engineering, 2025

Towards a Unified Multi-Target Mlir-Based Compiler: A Heterogeneous Compilation Framework for High-Performance and Quantum Computing Integration.

[BibT_eX]

[DOI]

Martín Letras

Proceedings of the IEEE International Conference on Quantum Computing and Engineering, 2025

Qubit Health Analytics and Clustering for HPC-Integrated Quantum Processors.

[BibT_eX]

[DOI]

Xiaolong Deng

Proceedings of the IEEE International Conference on Quantum Computing and Engineering, 2025

Q-BEAST: A Practical Course on Experimental Evaluation and Characterization of Quantum Computing Systems.

[BibT_eX]

[DOI]

Minh Chung

Yaknan John Gambo

Burak Mete

Proceedings of the IEEE International Conference on Quantum Computing and Engineering, 2025

Analysis of the RISC-V Vector Extension for Vulkan Graphics Kernels.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2025

SARComp: High-Performance Algorithms for SAR Processing from FFT Kernels to Matched Filtering.

[BibT_eX]

[DOI]

Maron Schlemon

Rolf Scheiber

Proceedings of the IEEE High Performance Extreme Computing Conference, 2025

Dynamic Resource Management in HPC Systems Using Dynamic Processes with PSets.

[BibT_eX]

[DOI]

Keerthi Gaddameedi

Tobias Neckel

Hans-Joachim Bungartz

Pierre-François Dutot

Proceedings of the 32nd IEEE International Conference on High Performance Computing, 2025

Design of an FPGA-Based Neutral Atom Rearrangement Accelerator for Quantum Computing.

[BibT_eX]

[DOI]

Proceedings of the Design, Automation & Test in Europe Conference, 2025

KLiNQ: Knowledge Distillation-Assisted Lightweight Neural Network for Qubit Readout on FPGA.

[BibT_eX]

[DOI]

Proceedings of the 62nd ACM/IEEE Design Automation Conference, 2025

VersaSlot: Efficient Fine-grained FPGA Sharing with Big.Little Slots and Live Migration in FPGA Cluster.

[BibT_eX]

[DOI]

Proceedings of the 62nd ACM/IEEE Design Automation Conference, 2025

Closing the HPC-Cloud Convergence Gap: Multi-Tenant Slingshot RDMA for Kubernetes.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Cluster Computing, 2025

POSTER: Performance Comparison of GPU Programming Models Using HeCBench Benchmarks.

[BibT_eX]

[DOI]

Proceedings of the 22nd ACM International Conference on Computing Frontiers, 2025

FERIVer: An FPGA-assisted Emulated Framework for RTL Verification of RISC-V Processors.

[BibT_eX]

[DOI]

Proceedings of the 22nd ACM International Conference on Computing Frontiers, 2025

Cache Miss Curve Analysis via Cardinality Domain.

[BibT_eX]

[DOI]

Proceedings of the 34th International Conference on Parallel Architectures and Compilation Techniques, 2025

2024

Malleability in Modern HPC Systems: Current Experiences, Challenges, and Future Opportunities.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., September, 2024

Malleability techniques applications in high-performance computing.

[BibT_eX]

[DOI]

Jesús Carretero

Estela Suarez

Int. J. High Perform. Comput. Appl., 2024

Design Principles of Dynamic Resource Management for High-Performance Parallel Programming Models.

[BibT_eX]

[DOI]

CoRR, 2024

Every Mapping Counts in Large Amounts: Folio Accounting.

[BibT_eX]

[DOI]

David Hildenbrand

Nadav Amit

Proceedings of the 2024 USENIX Annual Technical Conference, 2024

Calibration and Performance Evaluation of a Superconducting Quantum Processor in an HPC Center.

[BibT_eX]

[DOI]

Proceedings of the ISC High Performance 2024 Research Paper Proceedings (39th International Conference), 2024

Leveraging Hybrid Classical-Quantum Methods for Efficient Load Rebalancing in HPC.

[BibT_eX]

[DOI]

Proceedings of the SC24-W: Workshops of the International Conference for High Performance Computing, 2024

A Software Platform to Support Disaggregated Quantum Accelerators.

[BibT_eX]

[DOI]

Ercüment Kaya

Aleksandra Swierkowska

Proceedings of the SC24-W: Workshops of the International Conference for High Performance Computing, 2024

Comparison of Atom Detection Algorithms for Neutral Atom Quantum Computing.

[BibT_eX]

[DOI]

Andrea Alberti

Proceedings of the IEEE International Conference on Quantum Computing and Engineering, 2024

QDMI - Quantum Device Management Interface: Hardware-Software Interface for the Munich Quantum Software Stack.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Quantum Computing and Engineering, 2024

Achieving Pareto-Optimality in Quantum Circuit Compilation via a Multi-Objective Heuristic Optimization Approach.

[BibT_eX]

[DOI]

Aleksandra Swierkowska

Proceedings of the IEEE International Conference on Quantum Computing and Engineering, 2024

QPI: A Programming Interface for Quantum Computers.

[BibT_eX]

[DOI]

Ercüment Kaya

Burak Mete

Proceedings of the IEEE International Conference on Quantum Computing and Engineering, 2024

An FPGA-Accelerated Atom Sorting Unit for Neutral Atom Quantum Computers.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Quantum Computing and Engineering, 2024

An FPGA-based Quantum Control System with a Runtime Configurable Signal Generator.

[BibT_eX]

[DOI]

Taiqian Guo

Proceedings of the IEEE International Conference on Quantum Computing and Engineering, 2024

Integration of Quantum Accelerators into HPC: Toward a Unified Quantum Platform.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Quantum Computing and Engineering, 2024

Dynamic Resource Management for In-Situ Techniques Using MPI-Sessions.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in the Message Passing Interface, 2024

Power Consumption and Energy Efficiency of Quantum Computing Platforms in High Performance Computing Integration.

[BibT_eX]

[DOI]

Xiaolong Deng

Proceedings of the Parallel Processing and Applied Mathematics, 2024

Adopting User-Space Networking for DDS Message-Oriented Middleware.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Pervasive Computing and Communications, 2024

sys-sage: A Unified Representation of Dynamic Topologies & Attributes on HPC Systems.

[BibT_eX]

[DOI]

Stepan Vanecek

Proceedings of the 38th ACM International Conference on Supercomputing, 2024

Reinforcement Learning-Driven Co-Scheduling and Diverse Resource Assignments on NUMA Systems.

[BibT_eX]

[DOI]

Proceedings of the 42nd IEEE International Conference on Computer Design, 2024

Real-Time Capability of DLR'S Beamforming Synthetic Aperture Radar Processing Architecture.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Non-Blocking GPU-CPU Notifications to Enable More GPU-CPU Parallelism.

[BibT_eX]

[DOI]

Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region, 2024

A Mechanism to Generate Interception Based Tools for HPC Libraries.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2024: Parallel Processing, 2024

A Layered Approach for Dynamic Resource Management in HPC.

[BibT_eX]

[DOI]

Hans-Joachim Bungartz

Pierre-François Dutot

Proceedings of the Euro-Par 2024: Parallel Processing Workshops, 2024

Dataset Distillation by Automatic Training Trajectories.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Distributed Order Recording Techniques for Efficient Record-and-Replay of Multi - Threaded Programs.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Cluster Computing, 2024

A Portable Tool to Compare Performance Profiles from GPU Offloading Programming Models.

[BibT_eX]

[DOI]

Proceedings of the 21st ACM International Conference on Computing Frontiers, 2024

The European Chips Act and its Impact on Teaching.

[BibT_eX]

[DOI]

Michael Pehl

Proceedings of the 21st ACM International Conference on Computing Frontiers, 2024

From the Physics Lab to the Computer Lab: Towards Flexible and Comprehensive DevOps for Quantum Computing.

[BibT_eX]

[DOI]

Proceedings of the 21st ACM International Conference on Computing Frontiers, 2024

Exploring the ARM Coherent Mesh Network Topology.

[BibT_eX]

[DOI]

Philipp Friese

Thandayuthapani Subramanian

Proceedings of the Architecture of Computing Systems - 37th International Conference, 2024

2023

Quantum Task Offloading with the OpenMP API.

[BibT_eX]

[DOI]

CoRR, 2023

GreenCourier: Carbon-Aware Scheduling for Serverless Functions.

[BibT_eX]

[DOI]

Mohak Chadha

Proceedings of the 9th International Workshop on Serverless Computing, 2023

Overcoming Weak Scaling Challenges in Tree-Based Nearest Neighbor Time Series Mining.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing - 38th International Conference, 2023

A Case Study on PMIx-Usage for Dynamic Resource Management.

[BibT_eX]

[DOI]

Martin Schreiber

Proceedings of the High Performance Computing, 2023

Probabilistic Job History Conversion and Performance Model Generation for Malleable Scheduling Simulations.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing, 2023

GPUscout: Locating Data Movement-related Bottlenecks on GPUs.

[BibT_eX]

[DOI]

Soumya Sen

Stepan Vanecek

Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

Sustainability in HPC: Vision and Opportunities.

[BibT_eX]

[DOI]

Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

DDS Implementations as Real-Time Middleware - A Systematic Evaluation.

[BibT_eX]

[DOI]

Proceedings of the 29th IEEE International Conference on Embedded and Real-Time Computing Systems and Applications, 2023

Realistic Neutral Atom Image Simulation.

[BibT_eX]

[DOI]

Dimitrios Tsevas

Proceedings of the IEEE International Conference on Quantum Computing and Engineering, 2023

Toward a Unified Hybrid HPCQC Toolchain.

[BibT_eX]

[DOI]

Philipp Seitz

Proceedings of the IEEE International Conference on Quantum Computing and Engineering, 2023

Towards the Munich Quantum Software Stack: Enabling Efficient Access and Tool Support for Quantum Computers.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Quantum Computing and Engineering, 2023

Challenges in HPCQC Integration.

[BibT_eX]

[DOI]

Muhammad Arslan Ansari

Proceedings of the IEEE International Conference on Quantum Computing and Engineering, 2023

Quantum Computer Metrics and HPC Center Environmental Sensor Data Analysis Towards Fidelity Prediction.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Quantum Computing and Engineering, 2023

Systematic Analysis of DDS Implementations.

[BibT_eX]

[DOI]

Proceedings of the 24th International Middleware Conference, 2023

Federated Learning via Decentralized Dataset Distillation in Resource-Constrained Edge Environments.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2023

HiSEP-Q: A Highly Scalable and Efficient Quantum Control Processor for Superconducting Qubits.

[BibT_eX]

[DOI]

Kun Qin

Proceedings of the 41st IEEE International Conference on Computer Design, 2023

A Scalable and Cross-Technology Quantum Control Processor.

[BibT_eX]

[DOI]

Proceedings of the 33rd International Conference on Field-Programmable Logic and Applications, 2023

OpenCUBE: Building an Open Source Cloud Blueprint with EPI Systems.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2023: Parallel Processing Workshops - Euro-Par 2023 International Workshops, Limassol, Cyprus, August 28, 2023

Hierarchical Resource Partitioning on Modern GPUs: A Reinforcement Learning Approach.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Cluster Computing, 2023

Copy-on-Pin: The Missing Piece for Correct Copy-on-Write.

[BibT_eX]

[DOI]

David Hildenbrand

Nadav Amit

Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

2022

Operational Data Analytics in practice: Experiences from design to deployment in production HPC environments.

[BibT_eX]

[DOI]

Parallel Comput., 2022

Resiliency in numerical algorithm design for extreme scale simulations.

[BibT_eX]

[DOI]

Int. J. High Perform. Comput. Appl., 2022

Accelerating HPC With Quantum Computing: It Is a Software Challenge Too.

[BibT_eX]

[DOI]

Comput. Sci. Eng., 2022

An Emulation Layer for Dynamic Resources with MPI Sessions.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing. ISC High Performance 2022 International Workshops - Hamburg, Germany, May 29, 2022

On the Convergence of Malleability and the HPC PowerStack: Exploiting Dynamism in Over-Provisioned and Power-Constrained HPC Systems.

[BibT_eX]

[DOI]

Isaías Comprés

Proceedings of the High Performance Computing. ISC High Performance 2022 International Workshops - Hamburg, Germany, May 29, 2022

Towards Dynamic Resource Management with MPI Sessions and PMIx.

[BibT_eX]

[DOI]

Proceedings of the EuroMPI/USA'22: 29th European MPI Users' Group Meeting, Chattanooga, TN, USA, September 26, 2022

Exploiting Reduced Precision for GPU-based Time Series Mining.

[BibT_eX]

[DOI]

Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022

Optimizing Hardware Resource Partitioning and Job Allocations on Modern GPUs under Power Caps.

[BibT_eX]

[DOI]

Proceedings of the Workshop Proceedings of the 51st International Conference on Parallel Processing, 2022

Resource-Constrained Optimizations For Synthetic Aperture Radar On-Board Image Processing.

[BibT_eX]

[DOI]

Maron Schlemon

Robert Josef Widhopf-Fenk

Rolf Scheiber

Proceedings of the IEEE High Performance Extreme Computing Conference, 2022

Querying Distributed Sensor Streams in the Edge-to-Cloud Continuum.

[BibT_eX]

[DOI]

Roman Karlstetter

Proceedings of the IEEE International Conference on Edge Computing and Communications, 2022

Orchestrated Co-scheduling, Resource Partitioning, and Power Capping on CPU-GPU Heterogeneous Systems via Machine Learning.

[BibT_eX]

[DOI]

Proceedings of the Architecture of Computing Systems - 35th International Conference, 2022

Energy Efficient Frequency Scaling on GPUs in Heterogeneous HPC Systems.

[BibT_eX]

[DOI]

Karlo Kraljic

Daniel Kerger

Proceedings of the Architecture of Computing Systems - 35th International Conference, 2022

2021

Guest Editorial: Special Issue on Computing Frontiers.

[BibT_eX]

[DOI]

Kento Sato

J. Signal Process. Syst., 2021

PredCom: A Predictive Approach to Collecting Approximated Communication Traces.

[BibT_eX]

[DOI]

Shinobu Miwa

IEEE Trans. Parallel Distributed Syst., 2021

Quantum Algorithms for Solving Ordinary Differential Equations via Classical Integration Methods.

[BibT_eX]

[DOI]

Quantum, 2021

Graph-based multi-core higher-order time integration of linear autonomous partial differential equations.

[BibT_eX]

[DOI]

Martin Schreiber

J. Comput. Sci., 2021

Understanding I/O Behavior in Scientific and Data-Intensive Computing (Dagstuhl Seminar 21332).

[BibT_eX]

[DOI]

Dagstuhl Reports, 2021

virtio-mem: paravirtualized memory hot(un)plug.

[BibT_eX]

[DOI]

David Hildenbrand

Proceedings of the VEE '21: 17th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments, 2021

Efficient LLVM-based dynamic binary translation.

[BibT_eX]

[DOI]

Alexis Engelke

Dominik Okwieka

Manjunath Gorentla Venkata

Proceedings of the VEE '21: 17th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments, 2021

A next-generation discontinuous galerkin fluid dynamics solver with application to high-resolution lung airflow simulations.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2021

On the Exploration and Optimization of High-Dimensional Architectural Design Space.

[BibT_eX]

[DOI]

Proceedings of the 1st Workshop on Performance EngineeRing, 2021

Correlation-wise Smoothing: Lightweight Knowledge Extraction for HPC Monitoring Data.

[BibT_eX]

[DOI]

Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium, 2021

On the Inevitability of Integrated HPC Systems and How they will Change HPC System Operations.

[BibT_eX]

[DOI]

Proceedings of the HEART '21: 11th International Symposium on Highly Efficient Accelerators and Reconfigurable Technologies, 2021

Living on the Edge: Efficient Handling of Large Scale Sensor Data.

[BibT_eX]

[DOI]

Proceedings of the 21st IEEE/ACM International Symposium on Cluster, 2021

2020

Footprint-Based DIMM Hotplug.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2020

QMPI: A next generation MPI profiling interface for modern HPC platforms.

[BibT_eX]

[DOI]

Parallel Comput., 2020

EReinit: Scalable and efficient fault-tolerance for bulk-synchronous MPI applications.

[BibT_eX]

[DOI]

Concurr. Comput. Pract. Exp., 2020

A survey of MPI usage in the US exascale computing project.

[BibT_eX]

[DOI]

David E. Bernholdt

Swen Boehm

George Bosilca

Concurr. Comput. Pract. Exp., 2020

Instrew: leveraging LLVM for high performance dynamic binary instrumentation.

[BibT_eX]

[DOI]

Alexis Engelke

Proceedings of the VEE '20: 16th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments, 2020

Time Series Mining at Petascale Performance.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing - 35th International Conference, 2020

Characterizing HPC Performance Variation with Monitoring and Unsupervised Learning.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing, 2020

Footprint-Aware Power Capping for Hybrid Memory Based Systems.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing - 35th International Conference, 2020

Pattern-Aware Staging for Hybrid Memory Systems.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing - 35th International Conference, 2020

Workshop 16: SNACS Scalable Networks for Advanced Computing Systems.

[BibT_eX]

[DOI]

Ilkay Altintas

Matthew G. F. Dosanjh

Ryan E. Grant

Taylor L. Groves

Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium Workshops, 2020

Cache-Aware Matrix Polynomials.

[BibT_eX]

[DOI]

Proceedings of the Computational Science - ICCS 2020, 2020

DCDB Wintermute: Enabling Online and Holistic Operational Data Analytics on HPC Systems.

[BibT_eX]

[DOI]

Proceedings of the HPDC '20: The 29th International Symposium on High-Performance Parallel and Distributed Computing, 2020

2019

The MPI_T events interface: An early evaluation and overview of the interface.

[BibT_eX]

[DOI]

Parallel Comput., 2019

Pruners.

[BibT_eX]

[DOI]

Christopher M. Chambreau

Simone Atzeni

Michael Bentley

Int. J. High Perform. Comput. Appl., 2019

From facility to application sensor data: modular, continuous and holistic monitoring with DCDB.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2019

Preparation and optimization of a diverse workload for a large-scale heterogeneous system.

[BibT_eX]

[DOI]

Ian Karlin

Yoonho Park

Guillaume Thomas-Collignon

Sara Kokkila Schumacher

Proceedings of the International Conference for High Performance Computing, 2019

Predicting faults in high performance computing systems: an in-depth survey of the state-of-the-practice.

[BibT_eX]

[DOI]

David Jauk

Dai Yang

Proceedings of the International Conference for High Performance Computing, 2019

QMPI: a next generation MPI profiling interface for modern HPC platforms.

[BibT_eX]

[DOI]

Bengisu Elis

Dai Yang

Proceedings of the 26th European MPI Users' Group Meeting, 2019

Optimizing computation-communication overlap in asynchronous task-based programs: poster.

[BibT_eX]

[DOI]

Proceedings of the 24th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2019

Exploring High Bandwidth Memory for PET Image Reconstruction.

[BibT_eX]

[DOI]

Proceedings of the Parallel Computing: Technology Trends, 2019

SAFIRE: Scalable and Accurate Fault Injection for Parallel Multithreaded Applications.

[BibT_eX]

[DOI]

Giorgis Georgakoudis

Hans Vandierendonck

Proceedings of the 2019 IEEE International Parallel and Distributed Processing Symposium, 2019

Power efficient job scheduling by predicting the impact of processor manufacturing variability.

[BibT_eX]

[DOI]

Proceedings of the ACM International Conference on Supercomputing, 2019

Optimizing computation-communication overlap in asynchronous task-based programs.

[BibT_eX]

[DOI]

Proceedings of the ACM International Conference on Supercomputing, 2019

Reducing False Node Failure Predictions in HPC.

[BibT_eX]

[DOI]

Proceedings of the 26th IEEE International Conference on High Performance Computing, 2019

2018

MemAxes: Visualization and Analytics for Characterizing Complex Memory Performance Behaviors.

[BibT_eX]

[DOI]

IEEE Trans. Vis. Comput. Graph., 2018

FlipTracker: understanding natural error resilience in HPC applications.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2018

Enabling callback-driven runtime introspection via MPI_T.

[BibT_eX]

[DOI]

Proceedings of the 25th European MPI Users' Group Meeting, 2018

Analyzing Resource Trade-offs in Hardware Overprovisioned Supercomputers.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium, 2018

Interference between I/O and MPI Traffic on Fat-tree Networks.

[BibT_eX]

[DOI]

Proceedings of the 47th International Conference on Parallel Processing, 2018

Thread-local concurrency: a technique to handle data race detection at programming model abstraction.

[BibT_eX]

[DOI]

Proceedings of the 27th International Symposium on High-Performance Parallel and Distributed Computing, 2018

Co-Scheduling in a Task-Based Programming Model.

[BibT_eX]

[DOI]

Proceedings of the 3rd Workshop on Co-Scheduling of HPC Applications, 2018

Panel discussions: "Challenges to the scaling limits: How can we achieve sustainable power-performance improvements?".

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Symposium in Low-Power and High-Speed Chips, 2018

2017

ScrubJay: deriving knowledge from the disarray of HPC performance data.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2017

REFINE: realistic fault injection via compiler-based instrumentation for accuracy, portability and speed.

[BibT_eX]

[DOI]

Giorgis Georgakoudis

Proceedings of the International Conference for High Performance Computing, 2017

Simulating Power Scheduling at Scale.

[BibT_eX]

[DOI]

Proceedings of the 5th International Workshop on Energy Efficient Supercomputing, 2017

Noise Injection Techniques to Expose Subtle and Unintended Message Races.

[BibT_eX]

[DOI]

Christopher M. Chambreau

Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2017

OpenMP Tools Interface: Synchronization Information for Data Race Detection.

[BibT_eX]

[DOI]

Proceedings of the Scaling OpenMP for Exascale Performance and Portability, 2017

Production Hardware Overprovisioning: Real-World Performance Optimization Using an Extensible Power-Aware Resource Management Framework.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017

Power Aware High Performance Computing: Challenges and Opportunities for Application and System Developers - Survey & Tutorial.

[BibT_eX]

[DOI]

Proceedings of the 2017 International Conference on High Performance Computing & Simulation, 2017

Accelerating Big Data Infrastructure and Applications (Ongoing Collaboration).

[BibT_eX]

[DOI]

Proceedings of the 37th IEEE International Conference on Distributed Computing Systems Workshops, 2017

Understanding the Spatial Characteristics of DRAM Errors in HPC Clusters.

[BibT_eX]

[DOI]

Proceedings of the ACM Workshop on Fault-Tolerance for HPC at Extreme Scale, 2017

Flexible Data Aggregation for Performance Profiling.

[BibT_eX]

[DOI]

David Böhme

David Beckingsale

Proceedings of the 2017 IEEE International Conference on Cluster Computing, 2017

2016

Exploiting Redundancy and Application Scalability for Cost-Effective, Time-Constrained Execution of HPC Applications on Amazon EC2.

[BibT_eX]

[DOI]

Aniruddha Marathe

Rachel Harris

IEEE Trans. Parallel Distributed Syst., 2016

Ordering Traces Logically to Identify Lateness in Message Passing Programs.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2016

Evaluating and extending user-level fault tolerance in MPI applications.

[BibT_eX]

[DOI]

Howard Pritchard

Int. J. High Perform. Comput. Appl., 2016

Exploring the MPI tool information interface: features and capabilities.

[BibT_eX]

[DOI]

Tanzima Z. Islam

Int. J. High Perform. Comput. Appl., 2016

Development effort estimation in HPC.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2016

Economic Viability of Hardware Overprovisioning in Power-Constrained High Performance Computing.

[BibT_eX]

[DOI]

Proceedings of the 4th International Workshop on Energy Efficient Supercomputing, 2016

VIPACT: A Visualization Interface for Analyzing Calling Context Trees.

[BibT_eX]

[DOI]

Proceedings of the Third Workshop on Visual Performance Analysis, 2016

Pinpointing scale-dependent integer overflow bugs in large-scale parallel applications.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2016

A machine learning framework for performance coverage analysis of proxy applications.

[BibT_eX]

[DOI]

Tanzima Z. Islam

Jayaraman J. Thiagarajan

Abhinav Bhatele

Proceedings of the International Conference for High Performance Computing, 2016

A Performance Model for Allocating the Parallelism in a Multigrid-in-Time Solver.

[BibT_eX]

[DOI]

Proceedings of the 7th International Workshop on Performance Modeling, 2016

A Unified Platform for Exploring Power Management Strategies.

[BibT_eX]

[DOI]

Proceedings of the 4th International Workshop on Energy Efficient Supercomputing, 2016

Caliper: performance introspection for HPC software stacks.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2016

Allowing MPI tools builders to forget about Fortran.

[BibT_eX]

[DOI]

Søren Rasmussen

Proceedings of the 23rd European MPI Users' Group Meeting, EuroMPI 2016, 2016

MPI Sessions: Leveraging Runtime Infrastructure to Increase Scalability of Applications at Exascale.

[BibT_eX]

[DOI]

Proceedings of the 23rd European MPI Users' Group Meeting, EuroMPI 2016, 2016

Testing Infrastructure for OpenMP Debugging Interface Implementations.

[BibT_eX]

[DOI]

Proceedings of the OpenMP: Memory, Devices, and Tasks, 2016

Structural Clustering: A New Approach to Support Performance Analysis at Scale.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016

I/O Aware Power Shifting.

[BibT_eX]

[DOI]

Lee Savoie

Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016

MPMD Framework for Offloading Load Balance Computation.

[BibT_eX]

[DOI]

Olga Pearce

Nancy M. Amato

Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016

Power Balancing in an Emulated Exascale Environment.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016

Systemwide Power Management with Argo.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016

ARCHER: Effectively Spotting Data Races in Large OpenMP Applications.

[BibT_eX]

[DOI]

Simone Atzeni

Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016

Runtime-Guided Mitigation of Manufacturing Variability in Power-Constrained Multi-Socket NUMA Nodes.

[BibT_eX]

[DOI]

Proceedings of the 2016 International Conference on Supercomputing, 2016

Fast Multi-parameter Performance Modeling.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Cluster Computing, 2016

IPAS: intelligent protection against silent output corruption in scientific applications.

[BibT_eX]

[DOI]

Proceedings of the 2016 International Symposium on Code Generation and Optimization, 2016

2015

Connecting Performance Analysis and Visualization (Dagstuhl Perspectives Workshop 14022).

[BibT_eX]

[DOI]

Dagstuhl Manifestos, 2015

Debugging high-performance computing applications at massive scales.

[BibT_eX]

[DOI]

Commun. ACM, 2015

A Run-Time System for Power-Constrained HPC Applications.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing - 30th International Conference, 2015

Clock delta compression for scalable order-replay of non-deterministic parallel applications.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2015

Recovering logical structure from Charm++ event traces.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2015

Analyzing and mitigating the impact of manufacturing variability in power-constrained supercomputing.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2015

Dynamic power sharing for higher job throughput.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2015

Finding the limits of power-constrained application performance.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2015

Decoupled load balancing.

[BibT_eX]

[DOI]

Olga Pearce

Nancy M. Amato

Proceedings of the 20th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2015

Lessons Learned from Implementing OMPD: A Debugging Interface for OpenMP.

[BibT_eX]

[DOI]

Proceedings of the OpenMP: Heterogenous Execution and Data Movements, 2015

Predicting Optimal Power Allocation for CPU and DRAM Domains.

[BibT_eX]

[DOI]

Ananta Tiwari

Laura Carrington

Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, 2015

A Scalable Prescriptive Parallel Debugging Model.

[BibT_eX]

[DOI]

Nicklas Bo Jensen

Niklas Quarfot Nielsen

Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015

Identifying the Culprits Behind Network Congestion.

[BibT_eX]

[DOI]

Abhinav Bhatele

Andrew R. Titus

Jayaraman J. Thiagarajan

Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015

Practical Resource Management in Power-Constrained, High Performance Computing.

[BibT_eX]

[DOI]

Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing, 2015

POW: System-wide Dynamic Reallocation of Limited Power in HPC.

[BibT_eX]

[DOI]

Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing, 2015

Event-Action Mappings for Parallel Tools Infrastructures.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2015: Parallel Processing, 2015

Distributed Monitoring and Management of Exascale Systems in the Argo Project.

[BibT_eX]

[DOI]

Proceedings of the Distributed Applications and Interoperable Systems, 2015

An Approach to Selecting Thread + Process Mixes for Hybrid MPI + OpenMP Applications.

[BibT_eX]

[DOI]

Hormozd Gahvari

Ulrike Meier Yang

Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015

2014

Combing the Communication Hairball: Visualizing Parallel Execution Traces using Logical Time.

[BibT_eX]

[DOI]

IEEE Trans. Vis. Comput. Graph., 2014

Enabling fair pricing on high performance computer systems with node sharing.

[BibT_eX]

[DOI]

Sci. Program., 2014

Connecting Performance Analysis and Visualization to Advance Extreme Scale Computing (Dagstuhl Perspectives Workshop 14022).

[BibT_eX]

[DOI]

Dagstuhl Reports, 2014

State of the Art of Performance Visualization.

[BibT_eX]

[DOI]

Proceedings of the 16th Eurographics Conference on Visualization, 2014

Towards providing low-overhead data race detection for large OpenMP applications.

[BibT_eX]

[DOI]

Proceedings of the 2014 LLVM Compiler Infrastructure in HPC, 2014

Algebraic Multigrid on a Dragonfly Network: First Experiences on a Cray XC30.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing Systems. Performance Modeling, Benchmarking, and Simulation, 2014

Evaluating User-Level Fault Tolerance for MPI Applications.

[BibT_eX]

[DOI]

Proceedings of the 21st European MPI Users' Group Meeting, 2014

Exploring the Capabilities of the New MPI_T Interface.

[BibT_eX]

[DOI]

Tanzima Z. Islam

Proceedings of the 21st European MPI Users' Group Meeting, 2014

Extracting logical structure and identifying stragglers in parallel execution traces.

[BibT_eX]

[DOI]

Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2014

Accurate application progress analysis for large-scale parallel debugging.

[BibT_eX]

[DOI]

Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation, 2014

Overcoming the Scalability Challenges of Epidemic Simulations on Blue Waters.

[BibT_eX]

[DOI]

Lukasz Wesolowski

Proceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium, 2014

MPI Runtime Error Detection with MUST: A Scalable and Crash-Safe Approach.

[BibT_eX]

[DOI]

Proceedings of the 43rd International Conference on Parallel Processing Workshops, 2014

Flux: A Next-Generation Resource Management Framework for Large HPC Centers.

[BibT_eX]

[DOI]

Proceedings of the 43rd International Conference on Parallel Processing Workshops, 2014

Adaptive Configuration Selection for Power-Constrained Heterogeneous Systems.

[BibT_eX]

[DOI]

Proceedings of the 43rd International Conference on Parallel Processing, 2014

Exploiting redundancy for cost-effective, time-constrained execution of HPC applications on amazon EC2.

[BibT_eX]

[DOI]

Aniruddha Marathe

Rachel Harris

Proceedings of the 23rd International Symposium on High-Performance Parallel and Distributed Computing, 2014

Modeling the Impact of Reduced Memory Bandwidth on HPC Applications.

[BibT_eX]

[DOI]

Ananta Tiwari

Anthony Gamst

Michael A. Laurenzano

Laura Carrington

Proceedings of the Euro-Par 2014 Parallel Processing, 2014

Memory Usage Optimizations for Online Event Analysis.

[BibT_eX]

[DOI]

Proceedings of the Solving Software Challenges for Exascale, 2014

2013

Strategies for Energy-Efficient Resource Management of Hybrid Programming Models.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2013

Parallelizing heavyweight debugging tools with mpiecho.

[BibT_eX]

[DOI]

Parallel Comput., 2013

LIBI: A framework for bootstrapping extreme scale software systems.

[BibT_eX]

[DOI]

Matthew P. LeGendre

Parallel Comput., 2013

A study of application-level recovery methods for transient network faults.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, 2013

Enabling fair pricing on HPC systems with node sharing.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2013

Overcoming extreme-scale reproducibility challenges through a unified, targeted, and multilevel toolset.

[BibT_eX]

[DOI]

Zvonimir Rakamaric

Proceedings of the 1st International Workshop on Software Engineering for High Performance Computing in Computational Science and Engineering, 2013

Runtime MPI collective checking with tree-based overlay networks.

[BibT_eX]

[DOI]

Alexandre E. Eichenberger

Proceedings of the 20th European MPI Users's Group Meeting, 2013

Performance Analysis Techniques for the Exascale Co-Design Process.

[BibT_eX]

[DOI]

Proceedings of the Parallel Computing: Accelerating Computational Science and Engineering (CSE), 2013

OMPT: An OpenMP Tools Application Programming Interface for Performance Analysis.

[BibT_eX]

[DOI]

John M. Mellor-Crummey

Proceedings of the OpenMP in the Era of Low Power Devices and Accelerators, 2013

Exploring Traditional and Emerging Parallel Programming Models Using a Proxy Application.

[BibT_eX]

[DOI]

Ian Karlin

Abhinav Bhatele

Jeff Keasler

Bradford L. Chamberlain

Proceedings of the 27th IEEE International Symposium on Parallel and Distributed Processing, 2013

Systematic Reduction of Data Movement in Algebraic Multigrid Solvers.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013

Efficient and Scalable Retrieval Techniques for Global File Properties.

[BibT_eX]

[DOI]

Michael J. Brim

Proceedings of the 27th IEEE International Symposium on Parallel and Distributed Processing, 2013

Exploring hardware overprovisioning in power-constrained, high performance computing.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Supercomputing, 2013

Intralayer Communication for Tree-Based Overlay Networks.

[BibT_eX]

[DOI]

Proceedings of the 42nd International Conference on Parallel Processing, 2013

A comparative study of high-performance computing on the cloud.

[BibT_eX]

[DOI]

Aniruddha Marathe

Rachel Harris

Xin Yuan

Proceedings of the 22nd International Symposium on High-Performance Parallel and Distributed Computing, 2013

Alignment-Based Metrics for Trace Comparison.

[BibT_eX]

[DOI]

Matthias Weber

Holger Brunst

Proceedings of the Euro-Par 2013 Parallel Processing, 2013

2012

Visualizing Network Traffic to Understand the Performance of Massively Parallel Simulations.

[BibT_eX]

[DOI]

IEEE Trans. Vis. Comput. Graph., 2012

What scientific applications can benefit from hardware transactional memory?

[BibT_eX]

[DOI]

Proceedings of the SC Conference on High Performance Computing Networking, 2012

Characterizing and mitigating work time inflation in task parallel programs.

[BibT_eX]

[DOI]

Stephen Olivier

Jan F. Prins

Proceedings of the SC Conference on High Performance Computing Networking, 2012

MPI runtime error detection with MUST: advances in deadlock detection.

[BibT_eX]

[DOI]

Proceedings of the SC Conference on High Performance Computing Networking, 2012

Performance Modeling of Algebraic Multigrid on Blue Gene/Q: Lessons Learned.

[BibT_eX]

[DOI]

Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Mapping applications with collectives over sub-communicators on torus networks.

[BibT_eX]

[DOI]

Proceedings of the SC Conference on High Performance Computing Networking, 2012

Novel views of performance data to analyze large-scale adaptive applications.

[BibT_eX]

[DOI]

Proceedings of the SC Conference on High Performance Computing Networking, 2012

MPI Runtime Error Detection with MUST: Advanced Error Reports.

[BibT_eX]

[DOI]

Proceedings of the Tools for High Performance Computing 2012, 2012

The myrmics memory allocator: hierarchical, message-passing allocation for global address spaces.

[BibT_eX]

[DOI]

Spyros Lyberis

Polyvios Pratikakis

Proceedings of the International Symposium on Memory Management, 2012

Beyond DVFS: A First Look at Performance under a Hardware-Enforced Power Bound.

[BibT_eX]

[DOI]

Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum, 2012

GTI: A Generic Tools Infrastructure for Event-Based Tools in Parallel Systems.

[BibT_eX]

[DOI]

Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012

Scalable Critical-Path Based Performance Analysis.

[BibT_eX]

[DOI]

David Böhme

Felix Wolf

Markus Geimer

Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012

Quantifying the effectiveness of load balance algorithms.

[BibT_eX]

[DOI]

Olga Pearce

Nancy M. Amato

Proceedings of the International Conference on Supercomputing, 2012

Fault resilience of the algebraic multi-grid solver.

[BibT_eX]

[DOI]

Marc Casas-Guix

Karthikeyan Sankaralingam

Proceedings of the International Conference on Supercomputing, 2012

Mechanisms and Evaluation of Cross-Layer Fault-Tolerance for Supercomputing.

[BibT_eX]

[DOI]

Chen-Han Ho

Marc de Kruijf

Proceedings of the 41st International Conference on Parallel Processing, 2012

Modeling the Performance of an Algebraic Multigrid Cycle Using Hybrid MPI/OpenMP.

[BibT_eX]

[DOI]

Proceedings of the 41st International Conference on Parallel Processing, 2012

2011

Checkpointing.

[BibT_eX]

[DOI]

Proceedings of the Encyclopedia of Parallel Computing, 2011

Formal analysis of MPI-based parallel programs.

[BibT_eX]

[DOI]

Commun. ACM, 2011

Large scale debugging of parallel tasks with AutomaDeD.

[BibT_eX]

[DOI]

Proceedings of the Conference on High Performance Computing Networking, 2011

Order Preserving Event Aggregation in TBONs.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in the Message Passing Interface, 2011

Creating a Tool Set for Optimizing Topology-Aware Node Mappings.

[BibT_eX]

[DOI]

Proceedings of the Tools for High Performance Computing 2011, 2011

Reconciling Sampling and Direct Instrumentation for Unintrusive Call-Path Profiling of MPI Programs.

[BibT_eX]

[DOI]

Zoltán Szebenyi

Felix Wolf

Brian J. N. Wylie

Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

Exploiting Data Similarity to Reduce Memory Footprints.

[BibT_eX]

[DOI]

Susmit Biswas

Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

Challenges of Scaling Algebraic Multigrid Across Modern Multicore Architectures.

[BibT_eX]

[DOI]

Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

Modeling the performance of an algebraic multigrid cycle on HPC platforms.

[BibT_eX]

[DOI]

Proceedings of the 25th International Conference on Supercomputing, 2011, Tucson, AZ, USA, May 31, 2011

Interpreting Performance Data across Intuitive Domains.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Parallel Processing, 2011

Practical performance prediction under Dynamic Voltage Frequency Scaling.

[BibT_eX]

[DOI]

Proceedings of the 2011 International Green Computing Conference and Workshops, 2011

Scalable memory registration for high performance networks using helper threads.

[BibT_eX]

[DOI]

Proceedings of the 8th Conference on Computing Frontiers, 2011

Large Scale Verification of MPI Programs Using Lamport Clocks with Lazy Update.

[BibT_eX]

[DOI]

Anh Vo

Robert M. Kirby

Proceedings of the 2011 International Conference on Parallel Architectures and Compilation Techniques, 2011

2010

Transforming MPI source code based on communication patterns.

[BibT_eX]

[DOI]

Robert Preissl

Future Gener. Comput. Syst., 2010

On the Performance of an Algebraic Multigrid Solver on Multicore Clusters.

[BibT_eX]

[DOI]

Allison H. Baker

Ulrike Meier Yang

Proceedings of the High Performance Computing for Computational Science - VECPAR 2010, 2010

A Scalable and Distributed Dynamic Formal Verifier for MPI Programs.

[BibT_eX]

[DOI]

Anh Vo

Sriram Aananthakrishnan

Proceedings of the Conference on High Performance Computing Networking, 2010

ScalaTrace: Tracing, Analysis and Modeling of HPC Codes at Scale.

[BibT_eX]

[DOI]

Frank Mueller

Xing Wu

Proceedings of the Applied Parallel and Scientific Computing, 2010

Hybrid MPI/OpenMP power-aware computing.

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Power-aware MPI task aggregation prediction for high-end computing systems.

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Using focused regression for accurate time-constrained scaling of scientific applications.

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Clustering performance data efficiently at massive scales.

[BibT_eX]

[DOI]

Robert J. Fowler

Daniel A. Reed

Proceedings of the 24th International Conference on Supercomputing, 2010

Exploitation of Dynamic Communication Patterns through Static Analysis.

[BibT_eX]

[DOI]

Robert Preissl

Proceedings of the 39th International Conference on Parallel Processing, 2010

Comparing Scalability Prediction Strategies on an SMP of CMPs.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2010 - Parallel Processing, 16th International Euro-Par Conference, Ischia, Italy, August 31, 2010

AutomaDeD: Automata-based debugging for dissimilar parallel tasks.

[BibT_eX]

[DOI]

Saurabh Bagchi

Proceedings of the 2010 IEEE/IFIP International Conference on Dependable Systems and Networks, 2010

10181 Executive Summary - Program Development for Extreme-Scale Computing.

[BibT_eX]

[DOI]

Proceedings of the Program Development for Extreme-Scale Computing, 02.05. - 07.05.2010, 2010

10181 Abstracts Collection - Program Development for Extreme-Scale Computing.

[BibT_eX]

[DOI]

Proceedings of the Program Development for Extreme-Scale Computing, 02.05. - 07.05.2010, 2010

Scaling Algebraic Multigrid Solvers: On the Road to Exascale.

[BibT_eX]

[DOI]

Proceedings of the Competence in High Performance Computing 2010, 2010

2009

ScalaTrace: Scalable compression and replay of communication traces for high-performance computing.

[BibT_eX]

[DOI]

J. Parallel Distributed Comput., 2009

Scalable temporal order analysis for large scale debugging.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE Conference on High Performance Computing, 2009

8th International Special Session on Current Trends in Numerical Simulation for Parallel Engineering Environments.

[BibT_eX]

[DOI]

Michael Bader

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2009

MUST: A Scalable Approach to Runtime Error Detection in MPI Programs.

[BibT_eX]

[DOI]

Proceedings of the Tools for High Performance Computing 2009, 2009

PSMalloc: content based memory management for MPI applications.

[BibT_eX]

[DOI]

Proceedings of the 10th workshop on MEmory performance, 2009

Machine learning based online performance prediction for runtime parallelization and task scheduling.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2009

Adagio: making DVS practical for complex HPC applications.

[BibT_eX]

[DOI]

Vincent W. Freeh

Tyler K. Bletsch

Proceedings of the 23rd international conference on Supercomputing, 2009

A graph based approach for MPI deadlock detection.

[BibT_eX]

[DOI]

Proceedings of the 23rd international conference on Supercomputing, 2009

2008

Efficient architectural design space exploration via predictive modeling.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2008

Open | SpeedShop: An open source infrastructure for parallel performance analysis.

[BibT_eX]

[DOI]

Sci. Program., 2008

BlueGene/L applications: Parallelism On a Massive Scale.

[BibT_eX]

[DOI]

Int. J. High Perform. Comput. Appl., 2008

Lessons learned at 208K: towards debugging millions of cores.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE Conference on High Performance Computing, 2008

Scalable load-balance measurement for SPMD codes.

[BibT_eX]

[DOI]

Robert J. Fowler

Daniel A. Reed

Proceedings of the ACM/IEEE Conference on High Performance Computing, 2008

7th International Special Session on Current Trends in Numerical Simulation for Parallel Engineering Environments: New Directions and Work-in-Progress (ParSim 2008).

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2008

On the Performance of Transparent MPI Piggyback Messages.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2008

Preserving time in large-scale communication traces.

[BibT_eX]

[DOI]

Prasun Ratn

Frank Mueller

Proceedings of the 22nd Annual International Conference on Supercomputing, 2008

A regression-based approach to scalability prediction.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual International Conference on Supercomputing, 2008

Detecting Patterns in MPI Communication Traces.

[BibT_eX]

[DOI]

Proceedings of the 2008 International Conference on Parallel Processing, 2008

Overcoming Scalability Challenges for Tool Daemon Launching.

[BibT_eX]

[DOI]

Proceedings of the 2008 International Conference on Parallel Processing, 2008

Using MPI Communication Patterns to Guide Source Code Transformations.

[BibT_eX]

[DOI]

Robert Preissl

Proceedings of the Computational Science, 2008

Topic 2: Performance Prediction and Evaluation.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2008, 2008

Prediction models for multi-dimensional power-performance optimization on many cores.

[BibT_eX]

[DOI]

Matthew Curtis-Maury

Ankur Shah

Filip Blagojevic

Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques, 2008

2007

Dynamic Binary Instrumentation and Data Aggregation on Large Scale Systems.

[BibT_eX]

[DOI]

Steven Y. Ko

Int. J. Parallel Program., 2007

Predicting parallel application performance via machine learning approaches.

[BibT_eX]

[DOI]

Karan Singh

Rich Caruana

Concurr. Comput. Pract. Exp., 2007

PNMPI tools: a whole lot greater than the sum of their parts.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE Conference on High Performance Networking and Computing, 2007

Bounding energy consumption in large-scale MPI programs.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE Conference on High Performance Networking and Computing, 2007

6th International Special Session on Current Trends in Numerical Simulation for Parallel Engineering Environments New Directions and Work-in-Progress ParSim 2007.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 14th European PVM/MPI User's Group Meeting, Paris, France, September 30, 2007

Methods of inference and learning for performance modeling of parallel applications.

[BibT_eX]

[DOI]

Benjamin C. Lee

David M. Brooks

Karan Singh

Proceedings of the 12th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2007

Benchmarking the Stack Trace Analysis Tool for BlueGene/L.

[BibT_eX]

Proceedings of the Parallel Computing: Architectures, 2007

Scalable Compression and Replay of Communication Traces in Massively P arallel E nvironments.

[BibT_eX]

[DOI]

Michael Noeth

Frank Mueller

Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

Stack Trace Analysis for Large Scale Debugging.

[BibT_eX]

[DOI]

Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

Practical Differential Profiling.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2007, 2007

Identifying energy-efficient concurrency levels using machine learning.

[BibT_eX]

[DOI]

Proceedings of the 2007 IEEE International Conference on Cluster Computing, 2007

2006

Poster reception - Scalable compression and replay of communication traces in massively parallel environments.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

Gordon Bell finalists I - Large-scale electronic structure calculations of high-Z metals on the BlueGene/L platform.

[BibT_eX]

[DOI]

François Gygi

Erik W. Draeger

Christoph W. Ueberhuber

Juergen Lorenz

Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

Poster reception - Patterns in parallel programs: toward high-level understanding of large-scale traces.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

5th International Special Session on Current Trends in Numerical Simulation for Parallel Engineering Environments.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2006

Improving distributed memory applications testing by message perturbation.

[BibT_eX]

[DOI]

Richard W. Vuduc

Andreas Sæbjørnsen

Proceedings of the 4th Workshop on Parallel and Distributed Systems: Testing, 2006

Dynamic program phase detection in distributed shared-memory multiprocessors.

[BibT_eX]

[DOI]

José F. Martínez

Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

A Flexible and Dynamic Infrastructure for MPI Tool Interoperability.

[BibT_eX]

[DOI]

Proceedings of the 2006 International Conference on Parallel Processing (ICPP 2006), 2006

Exploring Unexpected Behavior in MPI.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing and Communications, 2006

Efficiently exploring architectural design spaces via predictive modeling.

[BibT_eX]

[DOI]

Rich Caruana

Proceedings of the 12th International Conference on Architectural Support for Programming Languages and Operating Systems, 2006

2005

Scalable dynamic binary instrumentation for Blue Gene/L.

[BibT_eX]

[DOI]

Andrew Bernat

Steven Y. Ko

SIGARCH Comput. Archit. News, 2005

Simulation as a tool for optimizing memory accesses on NUMA machines.

[BibT_eX]

[DOI]

Perform. Evaluation, 2005

Monitoring cache behavior on parallel SMP architectures and related programming tools.

[BibT_eX]

[DOI]

Ralph Müller-Pfefferkorn

Future Gener. Comput. Syst., 2005

4th International Special Session on: Current Trends in Numerical Simulation for Parallel Engineering Environments ParSim 2005.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2005

Improving the computational intensity of unstructured mesh applications.

[BibT_eX]

[DOI]

Brian S. White

Brian Miller

Proceedings of the 19th Annual International Conference on Supercomputing, 2005

DynTG: A Tool for Interactive, Dynamic Instrumentation.

[BibT_eX]

[DOI]

John May

John C. Gyllenhaal

Proceedings of the Computational Science, 2005

An Approach to Performance Prediction for Parallel Applications.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2005, Parallel Processing, 11th International Euro-Par Conference, Lisbon, Portugal, August 30, 2005

Extracting Critical Path Graphs from MPI Applications.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE International Conference on Cluster Computing (CLUSTER 2005), September 26, 2005

Owl: next generation system monitoring.

[BibT_eX]

[DOI]

Proceedings of the Second Conference on Computing Frontiers, 2005

2004

SIMT/OMP: A Toolset to Study and Exploit Memory Locality of OpenMP Applications on NUMA Architectures.

[BibT_eX]

[DOI]

Proceedings of the Shared Memory Parallel Programming with OpenMP, 2004

Implementation and Evaluation of a Scalable Application-Level Checkpoint-Recovery Scheme for MPI Programs.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE SC2004 Conference on High Performance Networking and Computing, 2004

Current Trends in Numerical Simulation for Parallel Engineering Environments. ParSim 2004.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2004

Application-level checkpointing for shared memory programs.

[BibT_eX]

[DOI]

Proceedings of the 11th International Conference on Architectural Support for Programming Languages and Operating Systems, 2004

SimSnap: Fast-Forwarding via Native Execution and Application-Level Checkpointing.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Workshop on Interaction between Compilers and Computer Architecture (INTERACT-8 2004), 2004

2003

Pathways of Relevance: Exploring Inflows of Knowledge into Subunits of Multinational Corporations.

[BibT_eX]

[DOI]

Organ. Sci., 2003

ARS: an adaptive runtime system for locality optimization.

[BibT_eX]

[DOI]

Future Gener. Comput. Syst., 2003

SMiLE: an integrated, multi-paradigm software infrastructure for SCI-basedclusters.

[BibT_eX]

[DOI]

Future Gener. Comput. Syst., 2003

Interactive Locality Optimization on NUMA Architectures.

[BibT_eX]

[DOI]

Proceedings of the Proceedings ACM 2003 Symposium on Software Visualization, 2003

Identifying and Exploiting Spatial Regularity in Data Memory References.

[BibT_eX]

[DOI]

Tushar Mohan

Proceedings of the ACM/IEEE SC2003 Conference on High Performance Networking and Computing, 2003

Special Session of EuroPVM/MPI 2003: Current Trends in Numerical Simulation for Parallel Engineering Environments - ParSim 2003.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface,10th European PVM/MPI Users' Group Meeting, Venice, Italy, September 29, 2003

A Framework for Portable Shared Memory Programming.

[BibT_eX]

[DOI]

Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS 2003), 2003

CAD Grid: Corporate-Wide Resource Sharing for Parameter Studies.

[BibT_eX]

[DOI]

Ed Wheelhouse

Proceedings of the Euro-Par 2003. Parallel Processing, 2003

A Simulation Tool for Evaluating Shared Memory Systems.

[BibT_eX]

[DOI]

Proceedings of the Proceedings 36th Annual Simulation Symposium (ANSS-36 2003), Orlando, Florida, USA, March 30, 2003

2002

Memory access behavior analysis of NUMA-based shared memory programs.

[BibT_eX]

[DOI]

Sci. Program., 2002

A Comprehensive Electric Field Simulation Environment on Top of SCI.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 9th European PVM/MPI Users' Group Meeting, Linz, Austria, September 29, 2002

Current Trends in Numerical Simulation for Parallel Engineering Environments.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 9th European PVM/MPI Users' Group Meeting, Linz, Austria, September 29, 2002

Notes on Nondeterminism in Message Passing Programs.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 9th European PVM/MPI Users' Group Meeting, Linz, Austria, September 29, 2002

Performance Analysis for Teraflop Computers: A Distributed Automatic Approach.

[BibT_eX]

[DOI]

Proceedings of the 10th Euromicro Workshop on Parallel, 2002

Boosting the Performance of Electromagnetic Simulations on a PC-Cluster.

[BibT_eX]

[DOI]

Proceedings of the 2002 International Conference on Parallel Computing in Electrical Engineering (PARELEC 2002), 2002

A proposal for a new hardware cache monitoring architecture.

[BibT_eX]

[DOI]

Proceedings of The Workshop on Memory Systems Performance (MSP 2002), 2002

Improving Data Locality Using Dynamic Page Migration Based on Memory Access Histograms.

[BibT_eX]

[DOI]

Proceedings of the Computational Science - ICCS 2002, 2002

Using Semantic Information to Guide Efficient Parallel I/O on Clusters.

[BibT_eX]

[DOI]

Proceedings of the 11th IEEE International Symposium on High Performance Distributed Computing (HPDC-11 2002), 2002

SMiLE: An Integrated, Multi-Paradigm Software Infrastructure for SCI-Based Clusters.

[BibT_eX]

[DOI]

Proceedings of the 2nd IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2002), 2002

Overcoming the Problems Associated with the Existence of Too Many DSM APIs.

[BibT_eX]

[DOI]

Proceedings of the 2nd IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2002), 2002

2001

Shared memory programming on NUMA-based clusters using a general and open hybrid hardware, software Approach.

[BibT_eX]

[DOI]

PhD thesis, 2001

Parallel Volume Rendering based on Isosurface Extraction using Commodity Clusters.

[BibT_eX]

Proceedings of the IASTED International Conference on Visualization, 2001

SCI-Based LINUX PC-Clusters as a Platform for Electromagnetic Field Calculations.

[BibT_eX]

[DOI]

Proceedings of the Parallel Computing Technologies, 2001

Visualizing the Memory Access Behavior of Shared Memory Applications on NUMA Architectures.

[BibT_eX]

[DOI]

Proceedings of the Computational Science - ICCS 2001, 2001

Meeting the Computational Demands of Nuclear Medical Imaging Using Commodity Clusters.

[BibT_eX]

[DOI]

Proceedings of the Computational Science - ICCS 2001, 2001

2000

Workshop on High-Level Parallel Programming Models and Supportive Environments (HIPS 2000).

[BibT_eX]

[DOI]

Proceedings of the Parallel and Distributed Processing, 2000

Using the SMiLE Monitoring Infrastructure to Detect and Lower the Inefficiency of Parallel Applications.

[BibT_eX]

[DOI]

Proceedings of the High-Performance Computing and Networking, 8th International Conference, 2000

NEPHEW: Applying a Toolset for the Efficient Deployment of a Medical Image Application on SCI-Based Clusters.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2000, Parallel Processing, 6th International Euro-Par Conference, Munich, Germany, August 29, 2000

Multilayer Online-Monitoring for Hybrid DSM Systems on Top of PC Clusters with a SMiLE.

[BibT_eX]

[DOI]

Jörg Trinitis

Proceedings of the Computer Performance Evaluation: Modelling Techniques and Tools, 2000

Multithreaded Programming of PC Clusters.

[BibT_eX]

[DOI]

Proceedings of the 2000 International Conference on Parallel Architectures and Compilation Techniques (PACT'00), 2000

1999

True Shared Memory Programming on SCI-Based Clusters.

[BibT_eX]

[DOI]

Proceedings of the SCI: Scalable Coherent Interface, 1999

SCI-VM: A Flexible Base for Transparent Shared Memory Programming Models on Clusters of PCs.

[BibT_eX]

[DOI]

Proceedings of the Parallel and Distributed Processing, 1999

Supporting Shared Memory and Message Passing on Clusters of PCs with a SMiLE.

[BibT_eX]

[DOI]

Markus Leberecht

Proceedings of the Network-Based Parallel Computing: Communication, 1999

Optimizing Data Locality for SCI-Based PC-Clusters with the SmiLE Monitoring Approach.

[BibT_eX]

[DOI]

Markus Leberecht

Proceedings of the 1999 International Conference on Parallel Architectures and Compilation Techniques, 1999

1998

SISCI-Pthreads, SMP-like programming on an SCI-cluster.

[BibT_eX]

[DOI]