Gudula Rünger

Orcid: 0000-0002-5364-2088

Affiliations:
  • Chemnitz University of Technology, Germany


According to our database1, Gudula Rünger authored at least 225 papers between 1989 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Analyzing Data Reordering of a combined MPI and AVX execution of a Jacobi Method.
Proceedings of the 31st Euromicro International Conference on Parallel, 2023

Performance and Energy Evaluation for Solving a Schrödinger-Poisson System on Multicore Processors.
Proceedings of the Computer Performance Engineering and Stochastic Modelling, 2023

Parallel Programming - for Multicore and Cluster Systems, Third Edition
Springer, ISBN: 978-3-031-28923-1, 2023

2022
Message from the PDSEC-22 Workshop Chairs.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2022

2021
A performance- and energy-oriented extended tuning process for time-step-based scientific applications.
J. Supercomput., 2021

Performance and energy consumption of a Gram-Schmidt process for vector orthogonalization on a processor integrated GPU.
Sustain. Comput. Informatics Syst., 2021

Modeling the effect of application-specific program transformations on energy and performance improvements of parallel ODE solvers.
J. Comput. Sci., 2021

A Workflow-Based Support for the Automatic Creation and Selection of Energy-Efficient Task-Schedules on DVFS Processors.
Proceedings of Sixth International Congress on Information and Communication Technology, 2021

2020
Performance and energy consumption of the SIMD Gram-Schmidt process for vector orthogonalization.
J. Supercomput., 2020

The search-based scheduling algorithm HP* for parallel tasks on heterogeneous platforms.
Concurr. Comput. Pract. Exp., 2020

A Parameter Selection Process by Data Analysis for Tuning Multi-threaded Time-Stepping Algorithms.
Proceedings of the 2020 Seventh International Conference on Software Defined Systems, 2020

Performance and efficiency investigations of SIMD programs of Coulomb solvers on multi-and many-core systems with vector units.
Proceedings of the 28th Euromicro International Conference on Parallel, 2020

Workshop 13: PDSEC Parallel and Distributed Scientific and Engineering Computing.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium Workshops, 2020

2019
Model-based optimization of the energy efficiency of multi-threaded applications.
Sustain. Comput. Informatics Syst., 2019

A scheduling selection process for energy-efficient task execution on DVFS processors.
Concurr. Comput. Pract. Exp., 2019

Multiprocessor Task Programming and Flexible Load Balancing for Time-Stepping Methods on Heterogeneous Cloud Infrastructures.
Proceedings of the 2019 IEEE SmartWorld, 2019

Enabling Scalability, Adaptivity, and Resilience in Cloud Applications by Software-defined M-Task-based Programming.
Proceedings of the 6th International Conference on Software Defined Systems, 2019

DVFS RK: Performance and Energy Modeling of Frequency-Scaled Multithreaded Runge-Kutta Methods.
Proceedings of the 27th Euromicro International Conference on Parallel, 2019

A Web-Based Support for the Management and Evaluation of Measurement Data from Stress-Strain and Continuous-Cooling-Transformation Experiments.
Proceedings of the Information Systems Architecture and Technology: Proceedings of 40th Anniversary International Conference on Information Systems Architecture and Technology - ISAT 2019, 2019

Introduction to PDSEC-19.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2019

On the Energy Consumption and Accuracy of Multithreaded Embedded Runge-Kutta Methods.
Proceedings of the 17th International Conference on High Performance Computing & Simulation, 2019

Performance and Energy Evaluation of Parallel Particle Simulation Algorithms for Different Input Particle Data.
Proceedings of the Position Papers of the 2019 Federated Conference on Computer Science and Information Systems, 2019

Search-Based Scheduling for Parallel Tasks on Heterogeneous Platforms.
Proceedings of the Euro-Par 2019: Parallel Processing Workshops, 2019


2018
Performance and energy metrics for multi-threaded applications on DVFS processors.
Sustain. Comput. Informatics Syst., 2018

Tuning linear algebra for energy efficiency on multicore machines by adapting the ATLAS library.
Future Gener. Comput. Syst., 2018

Flexible all-to-all data redistribution methods for grid-based particle codes.
Concurr. Comput. Pract. Exp., 2018

Energy and Performance Analysis of Parallel Particle Solvers from the ScaFaCoS Library.
Proceedings of the 2018 ACM/SPEC International Conference on Performance Engineering, 2018

Exploring Self-Adaptivity Towards Performance and Energy for Time-Stepping Methods.
Proceedings of the 30th International Symposium on Computer Architecture and High Performance Computing, 2018

How do Loop Transformations Affect the Energy Consumption of Multi-Threaded Runge-Kutta Methods?
Proceedings of the 26th Euromicro International Conference on Parallel, 2018

A Hybrid CPU/GPU Implementation of Computationally Intensive Particle Simulations Using OpenCL.
Proceedings of the 17th International Symposium on Parallel and Distributed Computing, 2018

Introduction to PDSEC 2018 and Keynotes.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium Workshops, 2018

Energy and Performance Improvement of Parallel ODE Solvers by Application-Specific Program Transformations.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium Workshops, 2018

Symbolic Matrix Multiplication for Multithreaded Sparse GEMM Utilizing Sparse Matrix Formats.
Proceedings of the 2018 International Conference on High Performance Computing & Simulation, 2018

Examining Energy Efficiency of Vectorization Techniques Using a Gaussian Elimination.
Proceedings of the 2018 International Conference on High Performance Computing & Simulation, 2018

Analysis and Modeling of Resource Contention Effects based on Benchmark Applications.
Proceedings of the 2018 International Conference on High Performance Computing & Simulation, 2018

On the Autotuning Potential of Time-stepping methods from Scientific Computing.
Proceedings of the 2018 Federated Conference on Computer Science and Information Systems, 2018

On the energy consumption of Load/Store AVX instructions.
Proceedings of the 2018 Federated Conference on Computer Science and Information Systems, 2018

2017
Integrating Generic FEM Simulations into Complex Simulation Applications.
Scalable Comput. Pract. Exp., 2017

Comparison of Time and Energy Oriented Scheduling for Task-Based Programs.
Proceedings of the Parallel Processing and Applied Mathematics, 2017

Towards New Metrics for Appraising Performance and Energy Efficiency of Parallel Scientific Programs.
Proceedings of the 2017 IEEE International Conference on Internet of Things (iThings) and IEEE Green Computing and Communications (GreenCom) and IEEE Cyber, 2017

Tuning Energy Effort and Execution Time of Application Software.
Proceedings of the Information Systems Architecture and Technology: Proceedings of 38th International Conference on Information Systems Architecture and Technology - ISAT 2017, 2017

Introduction to PDSEC Workshop.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

Resource Contention Aware Execution of Multiprocessor Tasks on Heterogeneous Platforms.
Proceedings of the Euro-Par 2017: Parallel Processing Workshops, 2017

2016
Water-Level scheduling for parallel tasks in compute-intensive application components.
J. Supercomput., 2016

HeteroPar 2014, APCIE 2014, and TASUS 2014 Special Issue.
Concurr. Comput. Pract. Exp., 2016

PDSEC Introduction and Committees.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016

Reducing the Power Consumption of Matrix Multiplications by Vectorization.
Proceedings of the 2016 IEEE Intl Conference on Computational Science and Engineering, 2016

Transparent Redirection of File-Based Data Accesses for Distributed Scientific Applications.
Proceedings of the 2016 IEEE Intl Conference on Computational Science and Engineering, 2016

2015
Applications for ultrascale computing.
Supercomput. Front. Innov., 2015

Energy-efficient Algorithms for Ultrascale Systems.
Supercomput. Front. Innov., 2015

Sustainability through flexibility: Building complex simulation programs for distributed computing systems.
Simul. Model. Pract. Theory, 2015

Modeling and analyzing the energy consumption of fork-join-based task parallel programs.
Concurr. Comput. Pract. Exp., 2015

PDSEC Introduction and Committees.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, 2015

Towards energy-efficient linear algebra with an ATLAS library tuned for energy consumption.
Proceedings of the 2015 International Conference on High Performance Computing & Simulation, 2015

2014
Energy measurement, modeling, and prediction for processors with frequency scaling.
J. Supercomput., 2014

An execution time and energy model for an energy-aware execution of a conjugate gradient method with CPU/GPU collaboration.
J. Parallel Distributed Comput., 2014

Energy measurement and prediction for multi-threaded programs.
Proceedings of the 2014 Spring Simulation Multiconference, 2014

PDSEC Introduction and Committees.
Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014

2013
Programming support and scheduling for communicating parallel tasks.
J. Parallel Distributed Comput., 2013

In-place algorithms for the symmetric all-to-all exchange with MPI.
Proceedings of the 20th European MPI Users's Group Meeting, 2013

Execution Schemes for the NPB-MZ Benchmarks on Hybrid Architectures: A Comparative Study.
Proceedings of the Parallel Computing: Accelerating Computational Science and Engineering (CSE), 2013

PDSEC Introduction.
Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013

Efficient Data Redistribution Methods for Coupled Parallel Particle Codes.
Proceedings of the 42nd International Conference on Parallel Processing, 2013

Dynamic Distribution of Workload Between CPU and GPU for a Parallel Conjugate Gradient Method in an Adaptive FEM.
Proceedings of the International Conference on Computational Science, 2013

Layer-Based Scheduling of Parallel Tasks for Heterogeneous Cluster Platforms.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2013

High-Resolution Power Profiling of GPU Functions Using Low-Resolution Measurement.
Proceedings of the Euro-Par 2013 Parallel Processing, 2013

Parallel Programming - for Multicore and Cluster Systems; 2nd Edition.
Springer, ISBN: 978-3-642-37800-3, 2013

2012
Combined scheduling and mapping for scalable computing with parallel tasks.
Sci. Program., 2012

SEParAT: scheduling support environment for parallel application task graphs.
Clust. Comput., 2012

An execution environment for flexible task-oriented software on multicore systems.
Concurr. Eng. Res. Appl., 2012

Analytical modeling and simulation of the energy consumption of independent tasks.
Proceedings of the Winter Simulation Conference, 2012

Interaction List Compression in Large Parallel Particle Simulations on Multicore Systems.
Proceedings of the 20th Euromicro International Conference on Parallel, 2012

Energy-Aware Execution of Fork-Join-Based Task Parallelism.
Proceedings of the 20th IEEE International Symposium on Modeling, 2012

PDSEC Introduction.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum, 2012

Prioritization of Product Requirements using the Analytic Hierarchy Process.
Proceedings of the ICEIS 2012 - Proceedings of the 14th International Conference on Enterprise Information Systems, Volume 2, Wroclaw, Poland, 28 June, 2012

Automatic Tuning of the Fast Multipole Method Based on Integrated Performance Prediction.
Proceedings of the 14th IEEE International Conference on High Performance Computing and Communication & 9th IEEE International Conference on Embedded Software and Systems, 2012

Array-Based Reduction Operations for a Parallel Adaptive FEM.
Proceedings of the Facing the Multicore-Challenge, 2012

Towards an Energy Model for Modular Parallel Scientific Applications.
Proceedings of the 2012 IEEE International Conference on Green Computing and Communications, 2012

Parallele Programmierung, 3. Auflage.
eXamen.press, Springer, ISBN: 978-3-642-13603-0, 2012

2011
Beschleunigung physikalischer Simulationen durch Grafikprozessoren (Accelerating Physical Simulations Using Graphics Processing Units).
it Inf. Technol., 2011

Optimizing layer-based scheduling algorithms for parallel tasks with dependencies.
Concurr. Comput. Pract. Exp., 2011

Modeling the energy consumption for concurrent executions of parallel tasks.
Proceedings of the 2011 Spring Simulation Multi-conference, 2011

Component-based programming techniques for coarse-grained parallelism.
Proceedings of the 2011 Spring Simulation Multi-conference, 2011

Scheduling Support for Communicating Parallel Tasks.
Proceedings of the Languages and Compilers for Parallel Computing, 2011

Semi-dynamic Scheduling of Parallel Tasks for Heterogeneous Clusters.
Proceedings of the 10th International Symposium on Parallel and Distributed Computing, 2011

PDSEC Introduction.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

A Partitioning Algorithm for Parallel Sorting on Distributed Memory Systems.
Proceedings of the 13th IEEE International Conference on High Performance Computing & Communication, 2011

Introduction.
Proceedings of the Euro-Par 2011 Parallel Processing - 17th International Conference, 2011

2010
Fast recursive matrix multiplication for multi-core architectures.
Proceedings of the International Conference on Computational Science, 2010

An In-Place Algorithm for Irregular All-to-All Communication with Limited Memory.
Proceedings of the Recent Advances in the Message Passing Interface, 2010

Simulating anomalous diffusion on graphics processing units.
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Message from the PDSEC-10 workshop chairs.
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Task-block identification and movement for layer-based scheduling algorithms.
Proceedings of the 2010 International Conference on High Performance Computing & Simulation, 2010

Adaptive Execution of Software Systems on Parallel Multicore Architectures.
Proceedings of the ICEIS 2010 - Proceedings of the 12th International Conference on Enterprise Information Systems, Volume 3, ISAS, Funchal, Madeira, Portugal, June 8, 2010

Software Architectures for Flexible Task-Oriented Program Execution on Multicore Systems.
Proceedings of the Complex Systems Design & Management, 2010

Flexible Workflows for an Energy-Oriented Product Development Process.
Proceedings of the INFORMATIK 2010 - Business Process and Service Science - Proceedings of ISSS and BPSC, September 27, 2010

Parallel Programming - for Multicore and Cluster Systems.
Springer, ISBN: 978-3-642-04817-3, 2010

2009
Softwaremodernisierung durch werkzeugunterstütztes Verschieben von Codeblöcken.
Proceedings of the Software Engineering 2009, 2009

Scalable computing with parallel tasks.
Proceedings of the 2nd Workshop on Many-Task Computing on Grids and Supercomputers, 2009

Fine-Grained Data Distribution Operations for Particle Codes.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2009

Optimization of Layer-based Scheduling Algorithms for Mixed Parallel Applications with Precedence Constraints Using Move-blocks.
Proceedings of the 17th Euromicro International Conference on Parallel, 2009

Parallelization Strategies for ODE Solvers on Multicore Cluster Systems.
Proceedings of the Parallel Computing: From Multicores and GPU's to Petascale, 2009

Message from the PDSEC-09 workshop chairs.
Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009

Reducing the Class Coupling of Legacy Code by a Metrics-Based Relocation of Class Members.
Proceedings of the Advances in Software Engineering Techniques, 2009

Pattern-Based Refactoring of Legacy Software Systems.
Proceedings of the Enterprise Information Systems, 11th International Conference, 2009

Parallelization Strategies for Mixed Regular-Irregular Applications on Multicore-Systems.
Proceedings of the Advanced Parallel Processing Technologies, 8th International Symposium, 2009

2008
Combining building blocks for parallel multi-level matrix multiplication.
Parallel Comput., 2008

An adaptive extension library for improving collective communication operations.
Concurr. Comput. Pract. Exp., 2008

Inkrementelle Transformation einer monolithischen Geschäftssoftware.
Proceedings of the Software Engineering 2008, 2008

MPI Reduction Operations for Sparse Floating-point Data.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2008

A Transformation Framework for Communicating Multiprocessor-Tasks.
Proceedings of the 16th Euromicro International Conference on Parallel, 2008

Performance effects of gram-schmidt orthogonalization on multi-core infiniband clusters.
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

Cache optimization for mixed regular and irregular computations.
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

Towards an adaptive task pool implementation.
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

Mapping Algorithms for Multiprocessor Tasks on Multi-Core Clusters.
Proceedings of the 2008 International Conference on Parallel Processing, 2008

Models for Parallel Workflow Processing on Multi-Core Architectures.
Proceedings of the ICEIS 2008, 2008

Transformation of Legacy Software into Client/Server Applications through Pattern-Based Rearchitecturing.
Proceedings of the 32nd Annual IEEE International Computer Software and Applications Conference, 2008

2007
Modellgetriebene Transformation von Legacy Business-Software.
Softwaretechnik-Trends, 2007

Mixed task and data parallel executions in general linear methods.
Sci. Program., 2007

Performance Measurements and Analysis of the BlueGene/L MPI Implementation.
Proceedings of the Parallel Computing: Architectures, 2007

Layer-Based Scheduling Algorithms for Multiprocessor-Tasks with Precedence Constraints.
Proceedings of the Parallel Computing: Architectures, 2007

Communicating Multiprocessor-Tasks.
Proceedings of the Languages and Compilers for Parallel Computing, 2007

Incremental Transformation of Business Software.
Proceedings of the Enterprise Information Systems, 9th International Conference, 2007

Transformation of legacy business software into client-server architectures.
Proceedings of the ICEIS 2007, 2007

A Scheduling Toolkit for Multiprocessor-Task Programming with Dependencies.
Proceedings of the Euro-Par 2007, 2007

Library Support for Parallel Sorting in Scientific Computations.
Proceedings of the Euro-Par 2007, 2007

Dynamic scheduling of multi-processor tasks on clusters of clusters.
Proceedings of the 2007 IEEE International Conference on Cluster Computing, 2007

Parallele Programmierung, 2. Auflage.
eXamen.press, Springer, ISBN: 978-3-540-46549-2, 2007

2006
A Data re-distribution Library for Multi-processor Task Programming.
Int. J. Found. Comput. Sci., 2006

Task Pool Teams: a hybrid programming environment for irregular algorithms on SMP clusters.
Concurr. Comput. Pract. Exp., 2006

Optimizing MPI collective communication by orthogonal structures.
Clust. Comput., 2006

Design and Evaluation of a Parallel Data Redistribution Component for TGrid.
Proceedings of the Parallel and Distributed Processing and Applications, 2006

Combining Measures for Temporal and Spatial Locality.
Proceedings of the Frontiers of High Performance Computing and Networking, 2006

Anticipated distributed task scheduling for grid environments.
Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

A decomposition approach for optimizing the performance of MPI libraries.
Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

Task Pool Teams Implementation of the Master Equation Approach for Random Sierpinski Carpets.
Proceedings of the Euro-Par 2006, Parallel Processing, 12th International Euro-Par Conference, Dresden, Germany, August 28, 2006

Topic 8: Distributed Systems and Algorithms.
Proceedings of the Euro-Par 2006, Parallel Processing, 12th International Euro-Par Conference, Dresden, Germany, August 28, 2006

TGrid - Grid runtime support for hierarchically structured task-parallel programs.
Proceedings of the 2006 IEEE International Conference on Cluster Computing, 2006

A Component Based Software Architecture for E-Government Applications.
Proceedings of the The First International Conference on Availability, 2006

2005
Tlib - a library to support programming with hierarchical multi-processor tasks.
J. Parallel Distributed Comput., 2005

Load imbalance aspects in atmosphere simulations.
Int. J. Comput. Sci. Eng., 2005

Modular construction of model partitioning processes for parallel logic simulation.
Int. J. Comput. Sci. Eng., 2005

Adaptive Selection of Communication Methods to Optimize Collective MPI Operations.
Proceedings of the Parallel Computing: Current & Future Issues of High-End Computing, 2005

M-Task-Programming for Heterogeneous Systems and Grid Environments.
Proceedings of the 19th International Parallel and Distributed Processing Symposium (IPDPS 2005), 2005

Comparison of Different Parallel Modified Gram-Schmidt Algorithms.
Proceedings of the Euro-Par 2005, Parallel Processing, 11th International Euro-Par Conference, Lisbon, Portugal, August 30, 2005


2004
Improving locality for ODE solvers by program transformations.
Sci. Program., 2004

Derivation of a logarithmic time carry lookahead addition circuit.
J. Funct. Program., 2004

Program-Based Locality Measures For Scientific Computing.
Int. J. Found. Comput. Sci., 2004

Group-SPMD programming with orthogonal processor groups.
Concurr. Comput. Pract. Exp., 2004

Parallel Algorithms for the Determination of Lyapunov Characteristics of Large Nonlinear Dynamical Systems.
Proceedings of the Applied Parallel Computing, 2004

Performance Analysis for Parallel Adaptive FEM on SMP Clusters.
Proceedings of the Applied Parallel Computing, 2004

Functional Realization of Coordination Environments for Mixed Parallelism.
Proceedings of the 18th International Parallel and Distributed Processing Symposium (IPDPS 2004), 2004

A Source Code Analyzer for Performance Prediction.
Proceedings of the 18th International Parallel and Distributed Processing Symposium (IPDPS 2004), 2004

Multilevel hierarchical matrix multiplication on clusters.
Proceedings of the 18th Annual International Conference on Supercomputing, 2004

Hierarchical Matrix-Matrix Multiplication Based on Multiprocessor Tasks.
Proceedings of the Computational Science, 2004

An Adaptive, 3-Dimensional, Hexahedral Finite Element Implementation for Distributed Memory.
Proceedings of the Computational Science, 2004

Execution Schemes for Parallel Adams Methods.
Proceedings of the Euro-Par 2004 Parallel Processing, 2004

A Data Management and Communication Layer for Adaptive, Hexahedral FEM.
Proceedings of the Euro-Par 2004 Parallel Processing, 2004

Improving the execution time of global communication operations.
Proceedings of the First Conference on Computing Frontiers, 2004

Dynamic Loop Scheduling with Processor Groups.
Proceedings of the ISCA 17th International Conference on Parallel and Distributed Computing Systems, 2004

2003
A Communication API for Implementing Irregular Algorithms on SMP Clusters.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface,10th European PVM/MPI Users' Group Meeting, Venice, Italy, September 29, 2003

A Comparative Study of MPI Implementations on a Cluster of SMP Workstations.
Proceedings of the Parallel Computing: Software Technology, 2003

On Compiler Support for Mixed Task and Data Parallelism.
Proceedings of the Parallel Computing: Software Technology, 2003

A Distributed Hierarchical Programming Model for Heterogeneous Cluster of SMPs.
Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS 2003), 2003

Task Pool Teams for Implementing Irregular Algorithms on Clusters of SMPs.
Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS 2003), 2003

2002
Library support for hierarchical multi-processor tasks.
Proceedings of the 2002 ACM/IEEE conference on Supercomputing, 2002

Workshop Introduction.
Proceedings of the 16th International Parallel and Distributed Processing Symposium (IPDPS 2002), 2002

Selecting Data Distributions for Unbounded Loops.
Proceedings of the 16th International Parallel and Distributed Processing Symposium (IPDPS 2002), 2002

Pipelining for Locality Improvement in RK Methods.
Proceedings of the Euro-Par 2002, 2002

2001
Library support for orthogonal processor groups.
Proceedings of the Thirteenth Annual ACM Symposium on Parallel Algorithms and Architectures, 2001

ORT: a communication library for orthogonal processor groups.
Proceedings of the 2001 ACM/IEEE conference on Supercomputing, 2001

A Hierarchical Computation Model for Distributed Shared-Memory Machines.
Proceedings of the Ninth Euromicro Workshop on Parallel and Distributed Processing, 2001

Cyclic Reduction on Distributed Shared Memory Machines.
Proceedings of the Ninth Euromicro Workshop on Parallel and Distributed Processing, 2001

Optimizing locality for ODE solvers.
Proceedings of the 15th international conference on Supercomputing, 2001

Orthogonal Processor Groups for Message-Passing Programs.
Proceedings of the High-Performance Computing and Networking, 9th International Conference, 2001

2000
A Transformation Approach to Derive Efficient Parallel Implementations.
IEEE Trans. Software Eng., 2000

Deriving Array Distributions by Optimization Techniques.
J. Supercomput., 2000

Abstract Parallel Machines.
Comput. Artif. Intell., 2000

A Side-Effect-Free Hierarchical Radiosity Algorithm.
Proceedings of the Applied Computing 2000, 2000

Combining Thread Programming with Message Passing for Atmosphere Simulation.
Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 2000

Set Operations for Orthogonal Processor Groups.
Proceedings of the Languages and Compilers for Parallel Computing, 2000

Cost Hierarchies for Abstract Parallel Machines.
Proceedings of the Languages and Compilers for Parallel Computing, 2000

Modelling the Runtime of Scientific Programs on Parallel Computers.
Proceedings of the 2000 International Workshop on Parallel Processing, 2000

Parallele und verteilte Programmierung.
Springer, ISBN: 978-3-540-66009-5, 2000

1999
Compiler support for task scheduling in hierarchical execution models.
J. Syst. Archit., 1999

Scalability of Sparse Cholesky Factorization.
Int. J. High Speed Comput., 1999

Diagonal-Implicitly Iterated Runge-Kutta Methods on Distributed Memory Machines.
Int. J. High Speed Comput., 1999

Parallel execution of embedded and iterated Runge-Kutta methods.
Concurr. Pract. Exp., 1999

Matrix Computations Behind the Hierarchical Radiosity Method.
Proceedings of the 1999 ACM Symposium on Applied Computing, 1999

A Coordination Language for Mixed Task and and Data Parallel Programs.
Proceedings of the 1999 ACM Symposium on Applied Computing, 1999

Scheduling of Data Parallel Modules for Scientific Computing.
Proceedings of the Ninth SIAM Conference on Parallel Processing for Scientific Computing, 1999

Parallel cloud modeling.
Proceedings of the Parallel Computing: Fundamentals & Applications, 1999

1998
A Shared-Memory Implementation of the Hierarchical Radiosity Method.
Theor. Comput. Sci., 1998

Execution behavior analysis and performance prediction for a shared-memory implementation of an irregular particle simulation method.
Simul. Pract. Theory, 1998

1997
Load balancing schemes for extrapolation methods.
Concurr. Pract. Exp., 1997

Integrating library modules into special purpose parallel algorithms.
Proceedings of the International Symposium on Software Engineering for Parallel and Distributed Systems, 1997

Parallel Simulation of Flows in Sewer Network Systems.
Proceedings of the Parallel Computing: Fundamentals, 1997

Parallel Execution of Embedded Runge-Kutta Methods.
Proceedings of the Parallel Computing: Fundamentals, 1997

Modeling the Communication Behavior of the Intel Paragon.
Proceedings of the MASCOTS 1997, 1997

Scalability of Parallel Sparse Cholesky Factorization.
Proceedings of the Euro-Par '97 Parallel Processing, 1997

A Methodology for Deriving Parallel Programs with a Family of Parallel Abstract Machines.
Proceedings of the Euro-Par '97 Parallel Processing, 1997

1996
Deriving structured parallel implementations for numerical methods.
Microprocess. Microprogramming, 1996

Parallel Implementations of Iterated Runge-Kutta Methods.
Int. J. High Perform. Comput. Appl., 1996

Scheduling of multiprocessor tasks for numerical applications.
Proceedings of the Eighth IEEE Symposium on Parallel and Distributed Processing, 1996

Shared-Memory Implementation of an Irregular Particle Simulation Method.
Proceedings of the Euro-Par '96 Parallel Processing, 1996

Comparing Task and Data Parallel Execution Schemes for the DIIRK Method.
Proceedings of the Euro-Par '96 Parallel Processing, 1996

Scalability and Granularity Issues of the Hierarchical Radiosity Method.
Proceedings of the Euro-Par '96 Parallel Processing, 1996

The compiler TwoL for the design of parallel implementations.
Proceedings of the Fifth International Conference on Parallel Architectures and Compilation Techniques, 1996

1995
2DT-FP: A parallel functional programming language on two-dimensional data.
Int. J. Parallel Program., 1995

Iterated Runge-Kutta methods on distributed memory multiprocessors.
Proceedings of the 3rd Euromicro Workshop on Parallel and Distributed Processing (PDP '95), 1995

An Object Oriented Implementation of Distributed Graph-Based Computations.
Proceedings of the Parallel Computing: State-of-the-Art and Perspectives, 1995

Performance predictions for parallel diagonal-implicitly iterated Runge-Kutta methods.
Proceedings of the Ninth Workshop on Parallel and Distributed Simulation, 1995

An application specific parallel programming paradigm.
Proceedings of the High-Performance Computing and Networking, 1995

Parallel solution of a Schrödinger-Poisson system.
Proceedings of the High-Performance Computing and Networking, 1995

Formal Specification of Interconnection Networks.
Proceedings of the Functional Programming, Glasgow, UK, 1995, 1995

Optimal Data Distributions for LU Decomposition.
Proceedings of the Euro-Par '95 Parallel Processing, 1995

1994
A Process Oriented Semantics of the PRAM-Language FORK.
Comput. Lang., 1994

Load Balancing for Extraplation Methods on Distributed Memory Multiprocessors.
Proceedings of the PARLE '94: Parallel Architectures and Languages Europe, 1994

A Case Study in Parallel Program Derivation: the Heat Equation Algorithm.
Proceedings of the 1994 Glasgow Workshop on Functional Programming, 1994

Hypercube Implementation and Performance Analysis for Extrapolation Models.
Proceedings of the Parallel Processing: CONPAR 94, 1994

Implementing 2DT on a Multiprocessor.
Proceedings of the Compiler Construction, 5th International Conference, 1994

1993
2DT-FP: An FP Based Programming Language for Efficient Parallel Programming of Multiprocessor Networks.
Proceedings of the PARLE '93, 1993

1989
Über ein Schrödinger-Poisson-System
PhD thesis, 1989


  Loading...