Murray Cole

According to our database1, Murray Cole authored at least 65 papers between 1988 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Revealing Compiler Heuristics Through Automated Discovery and Optimization.
Proceedings of the IEEE/ACM International Symposium on Code Generation and Optimization, 2024

2021
Device Hopping: Transparent Mid-Kernel Runtime Switching for Heterogeneous Systems.
ACM Trans. Archit. Code Optim., 2021

Modernizing parallel code with pattern analysis.
Proceedings of the PPoPP '21: 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2021

HyFM: function merging for free.
Proceedings of the LCTES '21: 22nd ACM SIGPLAN/SIGBED International Conference on Languages, 2021

CoSPARSE: A Software and Hardware Reconfigurable SpMV Framework for Graph Analytics.
Proceedings of the 58th ACM/IEEE Design Automation Conference, 2021

Loop Parallelization using Dynamic Commutativity Analysis.
Proceedings of the IEEE/ACM International Symposium on Code Generation and Optimization, 2021

2020
Enforcing Deadlines for Skeleton-based Parallel Programming.
Proceedings of the IEEE Real-Time and Embedded Technology and Applications Symposium, 2020

Effective function merging in the SSA form.
Proceedings of the 41st ACM SIGPLAN International Conference on Programming Language Design and Implementation, 2020

A Hybrid Approach to Parallel Pattern Discovery in C++.
Proceedings of the 28th Euromicro International Conference on Parallel, 2020

HETSIM: Simulating Large-Scale Heterogeneous Systems using a Trace-driven, Synchronization and Dependency-Aware Framework.
Proceedings of the IEEE International Symposium on Workload Characterization, 2020

DelayRepay: delayed execution for kernel fusion in Python.
Proceedings of the DLS 2020: Proceedings of the 16th ACM SIGPLAN International Symposium on Dynamic Languages, 2020

Vectorization-aware loop unrolling with seed forwarding.
Proceedings of the CC '20: 29th International Conference on Compiler Construction, 2020

Transmuter: Bridging the Efficiency Gap using Memory and Dataflow Reconfiguration.
Proceedings of the PACT '20: International Conference on Parallel Architectures and Compilation Techniques, 2020

Parallelizing Parallel Programs: A Dynamic Pattern Analysis for Modernization of Legacy Parallel Code.
Proceedings of the PACT '20: International Conference on Parallel Architectures and Compilation Techniques, 2020

2019
Function Merging by Sequence Alignment.
Proceedings of the IEEE/ACM International Symposium on Code Generation and Optimization, 2019

2018
NUMA Optimizations for Algorithmic Skeletons.
Proceedings of the Euro-Par 2018: Parallel Processing, 2018

Towards a compiler analysis for parallel algorithmic skeletons.
Proceedings of the 27th International Conference on Compiler Construction, 2018

2015
Helium: a transparent inter-kernel optimizer for OpenCL.
Proceedings of the 8th Workshop on General Purpose Processing using GPUs, 2015

LIRA: Adaptive Contention-Aware Thread Placement for Parallel Runtime Systems.
Proceedings of the 5th International Workshop on Runtime and Operating Systems for Supercomputers, 2015

2014
Automatic Skeleton-Driven Memory Affinity for Transactional Worklist Applications.
Int. J. Parallel Program., 2014

Autotuning Wavefront Applications for Multicore Multi-GPU Hybrid Architectures.
Proceedings of the 2014 PPOPP International Workshop on Programming Models and Applications for Multicores and Manycores, 2014

2013
PARTANS: An autotuning framework for stencil computation on multi-GPU systems.
ACM Trans. Archit. Code Optim., 2013

Distributed computing practice for large-scale science and engineering applications.
Concurr. Comput. Pract. Exp., 2013

MaSiF: Machine learning guided auto-tuning of parallel skeletons.
Proceedings of the 20th Annual International Conference on High Performance Computing, 2013

2012
Autotuning Skeleton-Driven Optimizations for Transactional Worklist Applications.
IEEE Trans. Parallel Distributed Syst., 2012

2011
Parallel Skeletons.
Proceedings of the Encyclopedia of Parallel Computing, 2011

A machine learning-based approach for thread mapping on transactional memory applications.
Proceedings of the 18th International Conference on High Performance Computing, 2011

2010
Adaptive statistical scheduling of divisible workloads in heterogeneous systems.
J. Sched., 2010

Adaptive structured parallelism for distributed heterogeneous architectures: a methodological approach with pipelines and farms.
Concurr. Comput. Pract. Exp., 2010

2009
Characterising Effective Resource Analyses for Parallel and Distributed Coordination.
Proceedings of the Foundational and Practical Aspects of Resource Analysis, 2009

2008
An adaptive parallel pipeline pattern for grids.
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

Abstractions for Distributed Systems (DPA 2008).
Proceedings of the Euro-Par 2008 Workshops, 2008

2007
A Structural Approach for Modelling Performance of Systems Using Skeletons.
Proceedings of the Fifth Workshop on Quantitative Aspects of Programming Languages, 2007

Adaptive structured parallelism for computational grids.
Proceedings of the 12th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2007

Scheduling DAGs on Grids with Copying and Migration.
Proceedings of the Parallel Processing and Applied Mathematics, 2007

Reactive grid scheduling of DAG applications.
Proceedings of the IASTED International Conference on Parallel and Distributed Computing and Networks, 2007

Reliable DAG scheduling on grids with rewinding and migration.
Proceedings of the 1st International ICST Conference on Networks for Grid Applications, 2007

2006
Editorial.
Parallel Comput., 2006

Towards Fully Adaptive Pipeline Parallelism for Heterogeneous Distributed Environments.
Proceedings of the Parallel and Distributed Processing and Applications, 2006

Combining Measurement and Stochastic Modelling to Enhance Scheduling Decisions for a Parallel Mean Value Analysis Algorithm.
Proceedings of the Computational Science, 2006

2005
Evaluating the performance of pipeline-structured parallel programs with skeletons and process algebra.
Scalable Comput. Pract. Exp., 2005

Scheduling Skeleton-Based Grid Applications Using PEPA and NWS.
Comput. J., 2005

Integrating MPI-Skeletons with Web Services.
Proceedings of the Parallel Computing: Current & Future Issues of High-End Computing, 2005

Using eSkel to Implement the Multiple Baseline Stereo Application.
Proceedings of the Parallel Computing: Current & Future Issues of High-End Computing, 2005

Two Fundamental Concepts in Skeletal Parallel Programming.
Proceedings of the Computational Science, 2005

Flexible Skeletal Programming with eSkel.
Proceedings of the Euro-Par 2005, Parallel Processing, 11th International Euro-Par Conference, Lisbon, Portugal, August 30, 2005

Enhancing the effective utilisation of grid clusters by exploiting on-line performability analysis.
Proceedings of the 5th International Symposium on Cluster Computing and the Grid (CCGrid 2005), 2005

2004
Bringing skeletons out of the closet: a pragmatic manifesto for skeletal parallel programming.
Parallel Comput., 2004

Evaluating the Performance of Skeleton-Based High Level Parallel Programs.
Proceedings of the Computational Science, 2004

Why Structured Parallel Programming Matters.
Proceedings of the Euro-Par 2004 Parallel Processing, 2004

2002
The Integration of Task and Data Parallel Skeletons.
Parallel Process. Lett., 2002

Automated Cost Analysis of a Parallel Maximum Segment Sum Program Derivation.
Parallel Process. Lett., 2002

Static performance prediction of skeletal parallel programs.
Parallel Algorithms Appl., 2002

2001
Coordinating Heterogeneous Parallel Systems with Skeletons and Activity Graphs.
J. Syst. Integr., 2001

2000
Activity Graphs: A Model-Independent Intermediate Layer for Skeletal Coordination.
Proceedings of the Applied Computing 2000, 2000

1999
BSP-based Cost Analysis of Skeletal Programs.
Proceedings of the Selected papers from the 1st Scottish Functional Programming Workshop (SFP99), 1999

Algorithmic Skeletons.
Proceedings of the Research Directions in Parallel Functional Programming, 1999

1998
Languages.
Proceedings of the Euro-Par '98 Parallel Processing, 1998

1997
Recursive 3D Mesh Indexing with Improved Locality.
Proceedings of the High-Performance Computing and Networking, 1997

A Monadic Calculus for Parallel Costing of a Functional Language of Arrays.
Proceedings of the Euro-Par '97 Parallel Processing, 1997

On Dividing and Conquering Independently.
Proceedings of the Euro-Par '97 Parallel Processing, 1997

1995
Parallel Programming with List Homomorphisms.
Parallel Process. Lett., 1995

Implementing the hierarchical PRAM on the 2D mesh: analyses and experiments.
Proceedings of the Seventh IEEE Symposium on Parallel and Distributed Processing, 1995

1993
Parallel Programming, List Homomorphisms and the Maximum Segment Sum Problem.
Proceedings of the Parallel Computing: Trends and Applications, 1993

1988
Algorithmic skeletons : a structured approach to the management of parallel computation.
PhD thesis, 1988


  Loading...