Sascha Hunold

Orcid: 0000-0002-5280-3855

Affiliations:
  • TU Wien, Austria


According to our database1, Sascha Hunold authored at least 73 papers between 2004 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
pSTL-Bench: A Micro-Benchmark Suite for Assessing Scalability of C++ Parallel STL Implementations.
CoRR, 2024

2023
Using Mixed-Radix Decomposition to Enumerate Computational Resources of Deeply Hierarchical Architectures.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

Verifying Performance Guidelines for MPI Collectives at Scale.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

Synchronizing MPI Processes in Space and Time.
Proceedings of the 30th European MPI Users' Group Meeting, 2023

Exploring Mapping Strategies for Co-allocated HPC Applications.
Proceedings of the Euro-Par 2023: Parallel Processing Workshops - Euro-Par 2023 International Workshops, Limassol, Cyprus, August 28, 2023

Algorithm Selection of MPI Collectives Considering System Utilization.
Proceedings of the Euro-Par 2023: Parallel Processing Workshops - Euro-Par 2023 International Workshops, Limassol, Cyprus, August 28, 2023

Uniform Algorithms for Reduce-scatter and (most) other Collectives for MPI.
Proceedings of the IEEE International Conference on Cluster Computing, 2023

2022
OMPICollTune: Autotuning MPI Collectives by Incremental Online Learning.
Proceedings of the IEEE/ACM International Workshop on Performance Modeling, 2022

mpisee: MPI Profiling for Communication and Communicator Structure.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2022

An Overhead Analysis of MPI Profiling and Tracing Tools.
Proceedings of the PERMAVOST@HPDC 2022: Proceedings of the 2nd Workshop on Performance EngineeRing, 2022

A Quantitative Analysis of OpenMP Task Runtime Systems.
Proceedings of the Benchmarking, Measuring, and Optimizing, 2022

2021
MPI collective communication through a single set of interfaces: A case for orthogonality.
Parallel Comput., 2021

MicroBench Maker: Reproduce, Reuse, Improve.
Proceedings of the 2021 International Workshop on Performance Modeling, 2021

Teaching Complex Scheduling Algorithms.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2021

2020
Scheduling.jl - Collaborative and Reproducible Scheduling Research with Julia.
CoRR, 2020

Collectives and Communicators: A Case for Orthogonality: (Or: How to get rid of MPI neighbor and enhance Cartesian collectives).
Proceedings of the EuroMPI/USA '20: 27th European MPI Users' Group Meeting, 2020

Benchmarking Julia's Communication Performance: Is Julia HPC ready or Full HPC?
Proceedings of the 2020 IEEE/ACM Performance Modeling, 2020

Decomposing MPI Collectives for Exploiting Multi-lane Communication.
Proceedings of the IEEE International Conference on Cluster Computing, 2020

Efficient Process-to-Node Mapping Algorithms for Stencil Computations.
Proceedings of the IEEE International Conference on Cluster Computing, 2020

Predicting MPI Collective Communication Performance Using Machine Learning.
Proceedings of the IEEE International Conference on Cluster Computing, 2020

2019
LigandScout Remote: A New User-Friendly Interface for HPC and Cloud Resources.
J. Chem. Inf. Model., 2019

Cartesian Collective Communication.
Proceedings of the 48th International Conference on Parallel Processing, 2019

2018
Algorithm Selection of MPI Collectives Using Machine Learning Techniques.
Proceedings of the 2018 IEEE/ACM Performance Modeling, 2018

Autotuning MPI Collectives using Performance Guidelines.
Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region, 2018

Hierarchical Clock Synchronization in MPI.
Proceedings of the IEEE International Conference on Cluster Computing, 2018

2017
Scheduling Independent Moldable Tasks on Multi-Cores with GPUs.
IEEE Trans. Parallel Distributed Syst., 2017

On expected and observed communication performance with MPI derived datatypes.
Parallel Comput., 2017

Tuning MPI Collectives by Verifying Performance Guidelines.
CoRR, 2017

Introduction to REPPAR Workshop.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

Predicting the Energy-Consumption of MPI Applications at Scale Using Only a Single Node.
Proceedings of the 2017 IEEE International Conference on Cluster Computing, 2017

2016
Reproducible MPI Benchmarking is Still Not as Easy as You Think.
IEEE Trans. Parallel Distributed Syst., 2016

Message-Combining Algorithms for Isomorphic, Sparse Collective Communication.
CoRR, 2016

PGMPI: Automatically Verifying Self-Consistent MPI Performance Guidelines.
CoRR, 2016

MPI Derived Datatypes: Performance Expectations and Status Quo.
CoRR, 2016

Automatic Verification of Self-consistent MPI Performance Guidelines.
Proceedings of the Euro-Par 2016: Parallel Processing, 2016

2015
MPI Benchmarking Revisited: Experimental Design and Reproducibility.
CoRR, 2015

A Survey on Reproducibility in Parallel Computing.
CoRR, 2015

One step toward bridging the gap between theory and practice in moldable task scheduling with precedence constraints.
Concurr. Comput. Pract. Exp., 2015

Isomorphic, Sparse MPI-like Collective Communication Operations for Parallel Stencil Computations.
Proceedings of the 22nd European MPI Users' Group Meeting, 2015

On the Impact of Synchronizing Clocks and Processes on Benchmarking MPI Collectives.
Proceedings of the 22nd European MPI Users' Group Meeting, 2015

2014
Fair scheduling of bag-of-tasks applications using distributed Lagrangian optimization.
J. Parallel Distributed Comput., 2014

Reproducible MPI Micro-Benchmarking Isn't As Easy As You Think.
Proceedings of the 21st European MPI Users' Group Meeting, 2014

Implementing a classic: zero-copy all-to-all communication with mpi datatypes.
Proceedings of the 2014 International Conference on Supercomputing, 2014

2013
On the State and Importance of Reproducible Experimental Research in Parallel Computing.
CoRR, 2013

Scheduling Moldable Tasks with Precedence Constraints and Arbitrary Speedup Functions on Multiprocessors.
Proceedings of the Parallel Processing and Applied Mathematics, 2013

2011
From Simulation to Experiment: A Case Study on Multiprocessor Task Scheduling.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

Evolutionary Scheduling of Parallel Tasks Graphs onto Homogeneous Clusters.
Proceedings of the 2011 IEEE International Conference on Cluster Computing (CLUSTER), 2011

2010
Jedule: A Tool for Visualizing Schedules of Parallel Applications.
Proceedings of the 39th International Conference on Parallel Processing, 2010

Combining Object-Oriented Design and SOA with Remote Objects over Web Services.
Proceedings of the 8th IEEE European Conference on Web Services (ECOWS 2010), 2010

Low-Cost Tuning of Two-Step Algorithms for Scheduling Mixed-Parallel Applications onto Homogeneous Clusters.
Proceedings of the 10th IEEE/ACM International Conference on Cluster, 2010

BPEL Remote Objects: Integrating BPEL Processes into Object-Oriented Applications.
Proceedings of the 2010 IEEE International Conference on Services Computing, 2010

2009
Evaluation der Leistungsfähigkeit von gemischt-parallelen Programmen in homogenen und heterogenen Umgebungen unter Berücksichtigung effizienter Schedulingstrategien.
PhD thesis, 2009

Workshop Modellgetriebene Softwarearchitektur - Evolution, Integration und Migration (MSEIM2009).
Proceedings of the Software Engineering 2009, 2009

Modellgetriebene Softwarearchitektur - Evolution, Integration und Migration (MSEIM 2009).
Proceedings of the Software Engineering 2009: Fachtagung des GI-Fachbereichs Softwaretechnik 02.-06.03. 2009 in Kaiserslautern, 2009

Softwaremodernisierung durch werkzeugunterstütztes Verschieben von Codeblöcken.
Proceedings of the Software Engineering 2009, 2009

Reducing the Class Coupling of Legacy Code by a Metrics-Based Relocation of Class Members.
Proceedings of the Advances in Software Engineering Techniques, 2009

Load Balancing Concurrent BPEL Processes by Dynamic Selection of Web Service Endpoints.
Proceedings of the ICPPW 2009, 2009

Pattern-Based Refactoring of Legacy Software Systems.
Proceedings of the Enterprise Information Systems, 11th International Conference, 2009

2008
Combining building blocks for parallel multi-level matrix multiplication.
Parallel Comput., 2008

Inkrementelle Transformation einer monolithischen Geschäftssoftware.
Proceedings of the Software Engineering 2008, 2008

Workshop Modellgetriebene Softwarearchitektur - Evolution, Integration und Migration (MSEIM 2008).
Proceedings of the Software Engineering 2008, 2008

Workshop Modellgetriebene Softwarearchitektur -Evolution, Integration und Migration.
Proceedings of the Software Engineering 2008. Fachtagung des GI-Fachbereichs Softwaretechnik, 2008

Transformation of Legacy Software into Client/Server Applications through Pattern-Based Rearchitecturing.
Proceedings of the 32nd Annual IEEE International Computer Software and Applications Conference, 2008

Redistribution aware two-step scheduling for mixed-parallel applications.
Proceedings of the 2008 IEEE International Conference on Cluster Computing, 29 September, 2008

Scheduling Dynamic Workflows onto Clusters of Clusters using Postponing.
Proceedings of the 8th IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2008), 2008

2007
Sequential and parallel implementation of a constraint-based algorithm for searching protein structures.
Proceedings of the 2007 IEEE International Conference on Cluster Computing, 2007

Dynamic scheduling of multi-processor tasks on clusters of clusters.
Proceedings of the 2007 IEEE International Conference on Cluster Computing, 2007

2006
Design and Evaluation of a Parallel Data Redistribution Component for TGrid.
Proceedings of the Parallel and Distributed Processing and Applications, 2006

TGrid - Grid runtime support for hierarchically structured task-parallel programs.
Proceedings of the 2006 IEEE International Conference on Cluster Computing, 2006

2005
Reducing the Overhead of Intra-Node Communication in Clusters of SMPs.
Proceedings of the Parallel and Distributed Processing and Applications, 2005

Automatic Tuning of PDGEMM Towards Optimal Performance.
Proceedings of the Euro-Par 2005, Parallel Processing, 11th International Euro-Par Conference, Lisbon, Portugal, August 30, 2005

2004
Multilevel hierarchical matrix multiplication on clusters.
Proceedings of the 18th Annual International Conference on Supercomputing, 2004

Hierarchical Matrix-Matrix Multiplication Based on Multiprocessor Tasks.
Proceedings of the Computational Science, 2004


  Loading...