Christoph W. Kessler

J. Syst. Archit., 2021

SkePU 3: Portable High-Level Programming of Heterogeneous Systems and HPC Clusters.

[BibT_eX]

[DOI]

Int. J. Parallel Program., 2021

Temperature-Aware Energy-Optimal Scheduling of Moldable Streaming Tasks onto 2D-Mesh-Based Many-Core CPUs with DVFS.

[BibT_eX]

[DOI]

Proceedings of the Job Scheduling Strategies for Parallel Processing, 2021

Combining Design Space Exploration with Task Scheduling of Moldable Streaming Tasks on Reconfigurable Platforms.

[BibT_eX]

[DOI]

Proceedings of the Applied Reconfigurable Computing. Architectures, Tools, and Applications, 2021

2020

Hybrid CPU-GPU execution support in the skeleton programming framework SkePU.

[BibT_eX]

[DOI]

Tomas Öhberg

J. Supercomput., 2020

Static Scheduling of Moldable Streaming Tasks With Task Fusion for Parallel Systems With DVFS.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2020

Programming languages for data-Intensive HPC applications: A systematic mapping study.

[BibT_eX]

[DOI]

Parallel Comput., 2020

Leveraging access mode declarations in a model for memory consistency in heterogeneous systems.

[BibT_eX]

[DOI]

Ludovic Henrio

J. Log. Algebraic Methods Program., 2020

Guest Editor's Note: High-Level Parallel Programming 2019.

[BibT_eX]

[DOI]

Int. J. Parallel Program., 2020

Portable exploitation of parallel and heterogeneous HPC architectures in neural simulation using SkePU.

[BibT_eX]

[DOI]

Proceedings of the SCOPES '20: 23rd International Workshop on Software and Compilers for Embedded Systems, 2020

Voltage Island-Aware Energy-Efficient Scheduling of Parallel Streaming Tasks on Many-Core CPUs.

[BibT_eX]

[DOI]

Proceedings of the 28th Euromicro International Conference on Parallel, 2020

Maximizing Profit in Energy-Efficient Moldable Task Execution with Deadline.

[BibT_eX]

[DOI]

Proceedings of the 28th Euromicro International Conference on Parallel, 2020

Robustness and Energy-elasticity of Crown Schedules for Sets of Parallelizable Tasks on Many-core Systems with DVFS.

[BibT_eX]

[DOI]

Matteo Alessandro Francavilla

Proceedings of the 28th Euromicro International Conference on Parallel, 2020

2019

Companion data of a Systematic Mapping Study of Programming Languages for Data-Intensive HPC Applications.

[BibT_eX]

[DOI]

Hugo F. M. C. Martiniano

Dataset, October, 2019

Companion data of a Systematic Mapping Study of Programming Languages for Data-Intensive HPC Applications.

[BibT_eX]

[DOI]

Hugo F. M. C. Martiniano

Dataset, May, 2019

Parallelization of Hierarchical Matrix Algorithms for Electromagnetic Scattering Problems.

[BibT_eX]

[DOI]

Elisabeth Larsson

Afshin Zafari

Marco Righero

Proceedings of the High-Performance Modelling and Simulation for Big Data Applications, 2019

Extending smart containers for data locality-aware skeleton programming.

[BibT_eX]

[DOI]

Concurr. Comput. Pract. Exp., 2019

Global optimization of operand transfer fusion in heterogeneous computing.

[BibT_eX]

[DOI]

Proceedings of the 22nd International Workshop on Software and Compilers for Embedded Systems, 2019

Multi-Variant User Functions for Platform-Aware Skeleton Programming.

[BibT_eX]

[DOI]

Proceedings of the Parallel Computing: Technology Trends, 2019

Scheduling Moldable Parallel Streaming Tasks on Heterogeneous Platforms with Frequency Scaling.

[BibT_eX]

[DOI]

Proceedings of the 27th European Signal Processing Conference, 2019

Adaptive Crown Scheduling for Streaming Tasks on Many-Core Systems with Discrete DVFS.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2019: Parallel Processing Workshops, 2019

Co-Optimizing Core Allocation, Mapping and DVFS in Streaming Programs with Moldable Tasks for Energy Efficient Execution on Manycore Architectures.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Application of Concurrency to System Design, 2019

2018

SkePU 2: Flexible and Type-Safe Skeleton Programming for Heterogeneous Parallel Systems.

[BibT_eX]

[DOI]

Int. J. Parallel Program., 2018

EXA2PRO programming environment: architecture and applications.

[BibT_eX]

[DOI]

Dimitrios Soudris

Lazaros Papadopoulos

Athanasios I. Papadopoulos

Dionysios D. Kehagias

Panos Seferlis

Alexander Chatzigeorgiou

Apostolos Ampatzoglou

Proceedings of the 18th International Conference on Embedded Computer Systems: Architectures, 2018

Lazy Allocation and Transfer Fusion Optimization for GPU-Based Heterogeneous Systems.

[BibT_eX]

[DOI]

Proceedings of the 26th Euromicro International Conference on Parallel, 2018

Ensuring Memory Consistency in Heterogeneous Systems Based on Access Mode Declarations.

[BibT_eX]

[DOI]

Ludovic Henrio

Proceedings of the 2018 International Conference on High Performance Computing & Simulation, 2018

2017

Benchmarking OpenCL, OpenACC, OpenMP, and CUDA: Programming Productivity, Performance, and Energy Consumption.

[BibT_eX]

[DOI]

Proceedings of the 2017 Workshop on Adaptive Resource Management and Scheduling for Cloud Computing, 2017

Asymmetric Crown Scheduling.

[BibT_eX]

[DOI]

Manfred Torggler

Proceedings of the 25th Euromicro International Conference on Parallel, 2017

VectorPU: A Generic and Efficient Data-container and Component Model for Transparent Data Transfer on GPU-based Heterogeneous Systems.

[BibT_eX]

[DOI]

Proceedings of the 8th Workshop and 6th Workshop on Parallel Programming and Run-Time Management Techniques for Many-core Architectures and Design Tools and Architectures for Multicore Embedded Computing Platforms, 2017

2016

Smart Containers and Skeleton Programming for GPU-Based Systems.

[BibT_eX]

[DOI]

Int. J. Parallel Program., 2016

Energy-Optimized Static Scheduling for Many-Cores with Task Parallelization, DVFS and Core Consolidation.

[BibT_eX]

[DOI]

Proceedings of the 19th International Workshop on Software and Compilers for Embedded Systems, 2016

An Extensible Platform Description Language Supporting Retargetable Toolchains and Adaptive Execution.

[BibT_eX]

[DOI]

Proceedings of the 19th International Workshop on Software and Compilers for Embedded Systems, 2016

Efficient Execution of SkePU Skeleton Programs on the Low-Power Multicore Processor Myriad2.

[BibT_eX]

[DOI]

Sebastian Thorarensen

Proceedings of the 24th Euromicro International Conference on Parallel, 2016

2015

Performance-aware composition framework for GPU-based systems.

[BibT_eX]

[DOI]

J. Supercomput., 2015

MeterPU: A Generic Measurement Abstraction API Enabling Energy-Tuned Skeleton Backend Selection.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE TrustCom/BigDataSE/ISPA, 2015

Fast Crown Scheduling Heuristics for Energy-Efficient Mapping and Scaling of Moldable Streaming Tasks on Many-Core Systems.

[BibT_eX]

[DOI]

Proceedings of the 18th International Workshop on Software and Compilers for Embedded Systems, 2015

Portable Parallelization of the EDGE CFD Application for GPU-based Systems using the SkePU Skeleton Programming Library.

[BibT_eX]

[DOI]

Proceedings of the Parallel Computing: On the Road to Exascale, 2015

Improving Energy-Efficiency of Static Schedules by Core Consolidation and Switching Off Unused Cores.

[BibT_eX]

[DOI]

Proceedings of the Parallel Computing: On the Road to Exascale, 2015

Optimized variant-selection code generation for loops on heterogeneous multicore systems.

[BibT_eX]

[DOI]

Proceedings of the Parallel Computing: On the Road to Exascale, 2015

Mimer and Schedeval: Tools for Comparing Static Schedulers for Streaming Applications on Manycore Architectures.

[BibT_eX]

[DOI]

Johan Janzen

Proceedings of the 44th International Conference on Parallel Processing Workshops, 2015

XPDL: Extensible Platform Description Language to Support Energy Modeling and Optimization.

[BibT_eX]

[DOI]

Proceedings of the 44th International Conference on Parallel Processing Workshops, 2015

2014

Fast Crown Scheduling Heuristics for Energy-Efficient Mapping and Scaling of Moldable Streaming Tasks on Manycore Systems.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2014

NUMA Computing with Hardware and Software Co-Support on Configurable Emulated Shared Memory Architectures.

[BibT_eX]

[DOI]

Int. J. Netw. Comput., 2014

Optimized Composition: Generating Efficient Code for Heterogeneous Systems from Multi-Variant Components, Skeletons and Containers.

[BibT_eX]

[DOI]

CoRR, 2014

The PEPPHER composition tool: performance-aware composition for GPU-based systems.

[BibT_eX]

[DOI]

Computing, 2014

Pruning Strategies in Adaptive Off-Line Tuning for Optimized Composition of Components on Heterogeneous Systems.

[BibT_eX]

[DOI]

Proceedings of the 43rd International Conference on Parallel Processing Workshops, 2014

Global Optimization of Execution Mode Selection for the Reconfigurable PRAM-NUMA Multicore Architecture REPLICA.

[BibT_eX]

[DOI]

Proceedings of the Second International Symposium on Computing and Networking, 2014

Optimized Selection of Runtime Mode for the Reconfigurable PRAM-NUMA Architecture REPLICA Using Machine-Learning.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2014: Parallel Processing Workshops, 2014

A Quantitative Comparison of PRAM based Emulated Shared Memory Architectures to Current Multicore CPUs and GPUs.

[BibT_eX]

[DOI]

Proceedings of the ARCS 2014, 2014

2013

Compiling for VLIW DSPs.

[BibT_eX]

[DOI]

Proceedings of the Handbook of Signal Processing Systems, 2013

Extensible Recognition of Algorithmic Patterns in DSP Programs for Automatic Parallelization.

[BibT_eX]

[DOI]

Amin Shafiee Sarvestani

Int. J. Parallel Program., 2013

Crown scheduling: Energy-efficient resource allocation, mapping and discrete frequency scaling for collections of malleable streaming tasks.

[BibT_eX]

[DOI]

Proceedings of the 2013 23rd International Workshop on Power and Timing Modeling, 2013

Hardware and Software Support for NUMA Computing on Configurable Emulated Shared Memory Architectures.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013

A Framework for Performance-Aware Composition of Applications for GPU-Based Systems.

[BibT_eX]

[DOI]

Proceedings of the 42nd International Conference on Parallel Processing, 2013

Adaptive Implementation Selection in the SkePU Skeleton Programming Library.

[BibT_eX]

[DOI]

Proceedings of the Advanced Parallel Processing Technologies, 2013

2012

Integrated Code Generation for Loops.

[BibT_eX]

[DOI]

ACM Trans. Embed. Comput. Syst., 2012

Engineering Parallel Sorting for the Intel SCC.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Computational Science, 2012

Executing PRAM Programs on GPUs.

[BibT_eX]

[DOI]

Jurgen Brenner

Proceedings of the International Conference on Computational Science, 2012

Optimized On-Chip-Pipelining for Memory-Intensive Computations on Multi-Core Processors with Explicit Memory Hierarchy.

[BibT_eX]

[DOI]

Rikard Hultén

J. Univers. Comput. Sci., 2012

Optimized composition of performance-aware parallel components.

[BibT_eX]

[DOI]

Welf Löwe

Concurr. Comput. Pract. Exp., 2012

Adaptive Off-Line Tuning for Optimized Composition of Components for Heterogeneous Many-Core Systems.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing for Computational Science, 2012

Poster: Leveraging PEPPHER Technology for Performance Portable Supercomputing.

[BibT_eX]

[DOI]

Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Abstract: Leveraging PEPPHER Technology for Performance Portable Supercomputing.

[BibT_eX]

[DOI]

Proceedings of the 2012 SC Companion: High Performance Computing, 2012

The PEPPHER Composition Tool: Performance-Aware Dynamic Composition of Applications for GPU-Based Systems.

[BibT_eX]

[DOI]

Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Modelling Power Consumption of the Intel SCC.

[BibT_eX]

[DOI]

Patrick Cichowski

Proceedings of the 6th Many-core Applications Research Community (MARC) Symposium. Proceedings of the 6th MARC Symposium, 2012

Design of the Language Replica for Hybrid PRAM-NUMA Many-core Architectures.

[BibT_eX]

[DOI]

Proceedings of the 10th IEEE International Symposium on Parallel and Distributed Processing with Applications, 2012

Programmability and performance portability aspects of heterogeneous multi-/manycore systems.

[BibT_eX]

[DOI]

Proceedings of the 2012 Design, Automation & Test in Europe Conference & Exhibition, 2012

Flexible Scheduling and Thread Allocation for Synchronous Parallel Tasks.

[BibT_eX]

[DOI]

Proceedings of the ARCS 2012 Workshops, 28. Februar - 2. März 2012, München, Germany, 2012

Programming the Cell Processor.

[BibT_eX]

[DOI]

Fundamentals of Multicore Software Development, 2012

2011

PEPPHER: Efficient and Productive Usage of Hybrid Computing Systems.

[BibT_eX]

[DOI]

IEEE Micro, 2011

Programmiertechniken für den Cell-Prozessor (Programming Techniques for the Cell Processor).

[BibT_eX]

[DOI]

it Inf. Technol., 2011

Comparing Machine Learning Approaches for Context-Aware Composition.

[BibT_eX]

[DOI]

Antonina Danylenko

Welf Löwe

Proceedings of the Software Composition - 10th International Conference, 2011

Balancing CPU Load for Irregular MPI Applications.

[BibT_eX]

[DOI]

Mudassar Majeed

Proceedings of the Applications, Tools and Techniques on the Road to Exascale Computing, Proceedings of the conference ParCo 2011, 31 August, 2011

Flexible Runtime Support for Efficient Skeleton Programming on Heterogeneous GPU-based Systems.

[BibT_eX]

[DOI]

Samuel Thibault

Proceedings of the Applications, Tools and Techniques on the Road to Exascale Computing, Proceedings of the conference ParCo 2011, 31 August, 2011

The PEPPHER Approach to Programmability and Performance Portability for Heterogeneous many-core Architectures.

[BibT_eX]

[DOI]

Proceedings of the Applications, Tools and Techniques on the Road to Exascale Computing, Proceedings of the conference ParCo 2011, 31 August, 2011

Investigation of main memory bandwidth on Intel Single-Chip Cloud Computer.

[BibT_eX]

[DOI]

Proceedings of the 3rd Many-core Applications Research Community (MARC) Symposium. Proceedings of the 3rd MARC Symposium, 2011

Auto-tuning SkePU: a multi-backend skeleton programming framework for multi-GPU systems.

[BibT_eX]

[DOI]

Johan Enmyren

Proceedings of the 4th International Workshop on Multicore Software Engineering, 2011

Case Study of Efficient Parallel Memory Access Programming for the Embedded Heterogeneous Multicore DSP Architecture ePUMA.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Complex, 2011

2010

Theory and Algorithms for Parallel Computation.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2010 - Parallel Processing, 16th International Euro-Par Conference, Ischia, Italy, August 31, 2010

Optimized On-Chip-Pipelined Mergesort on the Cell/B.E.

[BibT_eX]

[DOI]

Rikard Hultén

Proceedings of the Euro-Par 2010 - Parallel Processing, 16th International Euro-Par Conference, Ischia, Italy, August 31, 2010

Program Composition and Optimization: An Introduction.

[BibT_eX]

[DOI]

Proceedings of the Program Composition and Optimization: Autotuning, Scheduling, Metaprogramming and Beyond, 09.05., 2010

10191 Executive Summary - Program Composition and Optimization : Autotuning, Scheduling, Metaprogramming and Beyond.

[BibT_eX]

[DOI]

Proceedings of the Program Composition and Optimization: Autotuning, Scheduling, Metaprogramming and Beyond, 09.05., 2010

10191 Abstracts Collection - Program Composition and Optimization : Autotuning, Scheduling, Metaprogramming and Beyond.

[BibT_eX]

[DOI]

Proceedings of the Program Composition and Optimization: Autotuning, Scheduling, Metaprogramming and Beyond, 09.05., 2010

Platform-independent modeling of explicitly parallel programs.

[BibT_eX]

[DOI]

Wladimir Schamai

Peter Fritzson

Proceedings of the ARCS '10, 2010

Compiling for VLIW DSPs.

[BibT_eX]

[DOI]

Proceedings of the Handbook of Signal Processing Systems, 2010

2009

Message from the PDSEC-09 workshop chairs.

[BibT_eX]

[DOI]

Laurence Tianruo Yang

Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009

Integrated Modulo Scheduling for Clustered VLIW Architectures.

[BibT_eX]

[DOI]

Proceedings of the High Performance Embedded Architectures and Compilers, 2009

2008

Automatic parallelization of simulation code for equation-based models with software pipelining and measurements on three platforms.

[BibT_eX]

[DOI]

SIGARCH Comput. Archit. News, 2008

Optimized on-chip pipelining of memory-intensive computations on the cell BE.

[BibT_eX]

[DOI]

SIGARCH Comput. Archit. News, 2008

Profile-Guided Composition.

[BibT_eX]

[DOI]

Proceedings of the Software Composition - 7th International Symposium, 2008

Optimal vs. heuristic integrated code generation for clustered VLIW architectures.

[BibT_eX]

[DOI]

Oskar Skoog

Proceedings of the 11th International Workshop on Software and Compilers for Embedded Systems, 2008

BlockLib: a skeleton library for cell broadband engine.

[BibT_eX]

[DOI]

Markus Ålind

Proceedings of the 1st International Workshop on Multicore Software Engineering, 2008

Optimized Pipelined Parallel Merge Sort on the Cell BE.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2008 Workshops, 2008

Hybrid Parallel Sort on the Cell Processor.

[BibT_eX]

[DOI]

Proceedings of the 9th Workshop on Parallel Systems and Algorithms (PASA) held at the 21st Conference on the Architecture of Computing Systems (ARCS), 2008

2007

Classification and generation of schedules for VLIW processors.

[BibT_eX]

[DOI]

Concurr. Comput. Pract. Exp., 2007

A Survey of Reasoning in Parallelization.

[BibT_eX]

[DOI]

Proceedings of the 8th ACIS International Conference on Software Engineering, 2007

A Framework for Performance-Aware Composition of Explicitly Parallel Components.

[BibT_eX]

Welf Löwe

Proceedings of the Parallel Computing: Architectures, 2007

A Formal Framework for Automated Round-Trip Software Engineering in Static Aspect Weaving and Transformations.

[BibT_eX]

[DOI]

Proceedings of the 29th International Conference on Software Engineering (ICSE 2007), 2007

2006

Optimal integrated code generation for VLIW architectures.

[BibT_eX]

[DOI]

Concurr. Comput. Pract. Exp., 2006

NestStepModelica - Mathematical Modeling and Bulk-Synchronous Parallel Simulation.

[BibT_eX]

[DOI]

Peter Fritzson

Proceedings of the Applied Parallel Computing. State of the Art in Scientific Computing, 2006

Automated Round-trip Software Engineering in Aspect Weaving Systems.

[BibT_eX]

[DOI]

Peter Bunus

Proceedings of the 21st IEEE/ACM International Conference on Automated Software Engineering (ASE 2006), 2006

Crosscutting Concerns in Parallelization by Invasive Software Composition and Aspect Weaving.

[BibT_eX]

[DOI]

Proceedings of the 39th Hawaii International International Conference on Systems Science (HICSS-39 2006), 2006

Optimal Integrated VLIW Code Generation with Integer Linear Programming.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2006, Parallel Processing, 12th International Euro-Par Conference, Dresden, Germany, August 28, 2006

Load balancing of irregular parallel divide-and-conquer algorithms in group-SPMD programming environments.

[BibT_eX]

[DOI]

Proceedings of the ARCS 2006, 2006

2005

05101 Abstracts Collection - Scheduling for Parallel Architectures: Theory, Applications, Challenges.

[BibT_eX]

[DOI]

Proceedings of the Scheduling for Parallel Architectures: Theory, Applications, Challenges, 2005

05101 Executive Summary - Scheduling for Parallel Architectures: Theory, Applications, Challenges.

[BibT_eX]

[DOI]

Proceedings of the Scheduling for Parallel Architectures: Theory, Applications, Challenges, 2005

Parallelisation of Sequential Programs by Invasive Composition and Aspect Weaving.

[BibT_eX]

[DOI]

Proceedings of the Advanced Parallel Processing Technologies, 6th International Workshop, 2005

2004

Managing distributed shared arrays in a bulk-synchronous parallel programming environment.

[BibT_eX]

[DOI]

Concurr. Comput. Pract. Exp., 2004

A practical access to the theory of parallel algorithms.

[BibT_eX]

[DOI]

Proceedings of the 35th SIGCSE Technical Symposium on Computer Science Education, 2004

Towards a Bulk-Synchronous Distributed Shared Memory Programming Environment for Grids.

[BibT_eX]

[DOI]

Håkan Mattsson

Proceedings of the Applied Parallel Computing, 2004

Topic 10: Parallel Programming: Models, Methods and Programming Languages.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2004 Parallel Processing, 2004

Exploiting Symmetries for Optimal Integrated Code Generation.

[BibT_eX]

Proceedings of the International Conference on Embedded Systems and Applications, 2004

2002

Optimal integrated code generation for clustered VLIW architectures.

[BibT_eX]

[DOI]

Proceedings of the 2002 Joint Conference on Languages, 2002

Mid-term course evaluations with muddy cards.

[BibT_eX]

[DOI]

Simin Nadjm-Tehrani

Proceedings of the 7th Annual SIGCSE Conference on Innovation and Technology in Computer Science Education, 2002

A dialog between authors and teachers.

[BibT_eX]

[DOI]

Proceedings of the 7th Annual SIGCSE Conference on Innovation and Technology in Computer Science Education, 2002

2001

A Dynamic Programming Approach to Optimal Integrated Code Generation.

[BibT_eX]

[DOI]

Proceedings of the 2001 ACM SIGPLAN Workshop on Optimization of Middleware and Distributed Systems, 2001

Practical PRAM programming.

[BibT_eX]

Jesper Larsson Träff

Wiley series on parallel and distributed computing, Wiley, 2001

2000

NestStep: Nested Parallelism and Virtual Shared Memory for the BSP Model.

[BibT_eX]

[DOI]

J. Supercomput., 2000

Two program comprehension tools for automatic parallelization.

[BibT_eX]

[DOI]

Beniamino Di Martino

IEEE Concurr., 2000

1999

The SPARAMAT Approach to Automatic Comprehension of Sparse Matrix Computations

[BibT_eX]

Craig Smith

Universität Trier, Mathematik/Informatik, Forschungsbericht, 1999

NestStep: Nested Parallelism and Virtual Memory for the BSP Model.

[BibT_eX]

Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 1999

The SPARAMAT Approach to Automatic Comprehension of Sparse Matrix Computations.

[BibT_eX]

[DOI]

Craig Smith

Proceedings of the 7th International Workshop on Program Comprehension (IWPC '99), May 5-7, 1999, 1999

ForkLight: A Control-Synchronous Parallel Programming Language.

[BibT_eX]

[DOI]

Proceedings of the High-Performance Computing and Networking, 7th International Conference, 1999

1997

Two Program Comprehension Tools for Automatic Parallelization: A Comparative Study

[BibT_eX]

Beniamino Di Martino

Universität Trier, Mathematik/Informatik, Forschungsbericht, 1997

Practical PRAM Programming with Fork95 - A Tutorial

[BibT_eX]

Universität Trier, Mathematik/Informatik, Forschungsbericht, 1997

The Fork95 parallel programming language: Design, implementation, application.

[BibT_eX]

[DOI]

Int. J. Parallel Program., 1997

Language and library support for practical PRAM programming.

[BibT_eX]

Jesper Larsson Träff

Proceedings of the Fifth Euromicro Workshop on Parallel and Distributed Processing (PDP '97), 1997

Applicability of Program Comprehension to Sparse Matrix Computations.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par '97 Parallel Processing, 1997

Language Support for Synchronous Parallel Critical Sections.

[BibT_eX]

[DOI]

Proceedings of the 1997 Advances in Parallel and Distributed Computing Conference (APDC '97), 1997

1996

Pattern-Driven Automatic Parallelization.

[BibT_eX]

[DOI]

Sci. Program., 1996

A Library of Basic PRAM Algorithms and its Implementation in FORK.

[BibT_eX]

[DOI]

Jesper Larsson Träff

Proceedings of the 8th Annual ACM Symposium on Parallel Algorithms and Architectures, 1996

Scheduling Expression DAGs for Minimal Register Need.

[BibT_eX]

[DOI]

Proceedings of the Programming Languages: Implementations, 1996

Program comprehension engines for automatic parallelization: a comparative study.

[BibT_eX]

Beniamino Di Martino

Proceedings of the Software Engineering for Parallel and Distributed Systems, 1996

Parallel Fourier-Motzkin Elimination.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par '96 Parallel Processing, 1996

1995

Integrating Synchronous and Asynchronous Paradigms: The Fork95 Parallel Programming Language

[BibT_eX]

Universität Trier, Mathematik/Informatik, Forschungsbericht, 1995

Generating Optimal Contiguous Evaluations for Expression DAGs.

[BibT_eX]

[DOI]

Comput. Lang., 1995

Pattern-driven automatic program transformation and parallelization.

[BibT_eX]

[DOI]

Proceedings of the 3rd Euromicro Workshop on Parallel and Distributed Processing (PDP '95), 1995

Optimal Continguous Expression DAG Evaluations.

[BibT_eX]

[DOI]

Proceedings of the Fundamentals of Computation Theory, 10th International Symposium, 1995

1994

Integrating Scalable Parallel Libraries and Automatically Parallelizing Compilers

[BibT_eX]

Universität Trier, Mathematik/Informatik, Forschungsbericht, 1994

Automatische Parallelisierung numerischer Programme durch Mustererkennung.

[BibT_eX]

[DOI]

PhD thesis, 1994

Knowledge-Based Automatic Parallelization by Pattern Recognition.

[BibT_eX]

[DOI]

Proceedings of the Automatic Parallelization: New Approaches to Code Generation, 1994

1993

Efficient Register Allocation for Large Basic Blocks.

[BibT_eX]

[DOI]

Proceedings of the Programming Language Implementation and Logic Programming, 1993

Automatic Parallelization by Pattern-Matching.

[BibT_eX]

[DOI]

Wolfgang J. Paul

Proceedings of the Parallel Computation, 1993

1991

A Randomized Heuristic Approach to Register Allocation

[BibT_eX]

[DOI]

Wolfgang J. Paul

Proceedings of the Programming Language Implementation and Logic Programming, 1991

Scheduling Vector Straight Line Code on Vector Processors.

[BibT_eX]

[DOI]

Wolfgang J. Paul