Michael F. P. O'Boyle

Proceedings of the MAPS@PLDI 2022: 6th ACM SIGPLAN International Symposium on Machine Programming, 2022

Investigating magic numbers: improving the inlining heuristic in the Glasgow Haskell Compiler.

[BibT_eX]

[DOI]

Celeste Hollenbeck

Michel Steuwer

Proceedings of the Haskell '22: 15th ACM SIGPLAN International Haskell Symposium, Ljubljana, Slovenia, September 15, 2022

F3M: Fast Focused Function Merging.

[BibT_eX]

[DOI]

Pavlos Petoumenos

Proceedings of the IEEE/ACM International Symposium on Code Generation and Optimization, 2022

Loop Rolling for Code Size Reduction.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM International Symposium on Code Generation and Optimization, 2022

2021

Learning C to x86 Translation: An Experiment in Neural Compilation.

[BibT_eX]

[DOI]

Jordi Armengol-Estapé

CoRR, 2021

SparseAdapt: Runtime Control for Sparse Linear Algebra on a Reconfigurable Accelerator.

[BibT_eX]

[DOI]

Subhankar Pal

Aporva Amarnath

Siying Feng

Proceedings of the MICRO '21: 54th Annual IEEE/ACM International Symposium on Microarchitecture, 2021

ProGraML: A Graph-based Program Representation for Data Flow Analysis and Compiler Optimizations.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Prodigy: Improving the Memory Latency of Data-Indirect Irregular Workloads Using Hardware-Software Co-Design.

[BibT_eX]

[DOI]

Christos Vasiladiotis

Scott A. Mahlke

Trevor N. Mudge

Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2021

New Regular Expressions on Old Accelerators.

[BibT_eX]

[DOI]

Jackson Woodruff

Proceedings of the 58th ACM/IEEE Design Automation Conference, 2021

CoSPARSE: A Software and Hardware Reconfigurable SpMV Framework for Graph Analytics.

[BibT_eX]

[DOI]

Chaitali Chakrabarti

Proceedings of the 58th ACM/IEEE Design Automation Conference, 2021

Neural architecture search as program transformation exploration.

[BibT_eX]

[DOI]

Proceedings of the ASPLOS '21: 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2021

Program Lifting using Gray-Box Behavior.

[BibT_eX]

[DOI]

Proceedings of the 30th International Conference on Parallel Architectures and Compilation Techniques, 2021

2020

Deep Data Flow Analysis.

[BibT_eX]

[DOI]

CoRR, 2020

Retrofitting Symbolic Holes to LLVM IR.

[BibT_eX]

[DOI]

CoRR, 2020

TASO: Time and Space Optimization for Memory-Constrained DNN Inference.

[BibT_eX]

[DOI]

Andrew Anderson

Valentin Radu

David Gregg

Proceedings of the 32nd IEEE International Symposium on Computer Architecture and High Performance Computing, 2020

Automatic generation of specialized direct convolutions for mobile GPUs.

[BibT_eX]

[DOI]

Proceedings of the GPGPU@PPoPP '20: 13th Annual Workshop on General Purpose Processing using Graphics Processing Unit colocated with 25th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2020

Bayesian Meta-Learning for the Few-Shot Setting via Deep Kernels.

[BibT_eX]

[DOI]

Massimiliano Patacchiola

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

M3: Semantic API Migrations.

[BibT_eX]

[DOI]

Proceedings of the 35th IEEE/ACM International Conference on Automated Software Engineering, 2020

HETSIM: Simulating Large-Scale Heterogeneous Systems using a Trace-driven, Synchronization and Dependency-Aware Framework.

[BibT_eX]

[DOI]

Trevor N. Mudge

Proceedings of the IEEE International Symposium on Workload Characterization, 2020

BlockSwap: Fisher-guided Block Substitution for Network Compression on a Budget.

[BibT_eX]

[DOI]

Gavin Gray

Proceedings of the 8th International Conference on Learning Representations, 2020

Modeling black-box components with probabilistic synthesis.

[BibT_eX]

[DOI]

Jackson Woodruff

Proceedings of the GPCE '20: Proceedings of the 19th ACM SIGPLAN International Conference on Generative Programming: Concepts and Experiences, 2020

DelayRepay: delayed execution for kernel fusion in Python.

[BibT_eX]

[DOI]

Proceedings of the DLS 2020: Proceedings of the 16th ACM SIGPLAN International Symposium on Dynamic Languages, 2020

Automatically harnessing sparse acceleration.

[BibT_eX]

[DOI]

Proceedings of the CC '20: 29th International Conference on Compiler Construction, 2020

Optimizing Grouped Convolutions on Edge Devices.

[BibT_eX]

[DOI]

Proceedings of the 31st IEEE International Conference on Application-specific Systems, 2020

Transmuter: Bridging the Efficiency Gap using Memory and Dataflow Reconfiguration.

[BibT_eX]

[DOI]

Proceedings of the PACT '20: International Conference on Parallel Architectures and Compilation Techniques, 2020

2019

Augmenting Type Signatures for Program Synthesis.

[BibT_eX]

[DOI]

CoRR, 2019

BlockSwap: Fisher-guided Block Substitution for Network Compression.

[BibT_eX]

[DOI]

CoRR, 2019

Full-System Simulation of Mobile CPU/GPU Platforms.

[BibT_eX]

[DOI]

Bruno Bodin

Henrik Uhrenholt

Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2019

Performance Aware Convolutional Neural Network Channel Pruning for Embedded GPUs.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Workload Characterization, 2019

SLAMBench 3.0: Systematic Automated Reproducible Evaluation of SLAM Systems for Robot Vision Challenges and Scene Understanding.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Robotics and Automation, 2019

POSTER: Space and Time Optimal DNN Primitive Selection with Integer Linear Programming.

[BibT_eX]

[DOI]

Andrew Anderson

Valentin Radu

David Gregg

Proceedings of the 28th International Conference on Parallel Architectures and Compilation Techniques, 2019

Specialization Opportunities in Graphical Workloads.

[BibT_eX]

[DOI]

Lewis Crawford

Proceedings of the 28th International Conference on Parallel Architectures and Compilation Techniques, 2019

Type-Directed Program Synthesis and Constraint Generation for Library Portability.

[BibT_eX]

[DOI]

Proceedings of the 28th International Conference on Parallel Architectures and Compilation Techniques, 2019

2018

Machine Learning in Compiler Optimization.

[BibT_eX]

[DOI]

Proc. IEEE, 2018

Navigating the Landscape for Real-Time Localization and Mapping for Robotics and Virtual and Augmented Reality.

[BibT_eX]

[DOI]

Proc. IEEE, 2018

HAKD: Hardware Aware Knowledge Distillation.

[BibT_eX]

[DOI]

CoRR, 2018

Pruning neural networks: is it time to nip it in the bud?

[BibT_eX]

[DOI]

CoRR, 2018

Navigating the Landscape for Real-time Localisation and Mapping for Robotics and Virtual and Augmented Reality.

[BibT_eX]

[DOI]

CoRR, 2018

Machine Learning in Compiler Optimisation.

[BibT_eX]

[DOI]

CoRR, 2018

MaxPair: Enhance OpenCL Concurrent Kernel Execution by Weighted Maximum Matching.

[BibT_eX]

[DOI]

Christian Fensch

Proceedings of the 11th Workshop on General Purpose Processing using GPUs, 2018

A Cross-platform Evaluation of Graphics Shader Compiler Optimization.

[BibT_eX]

[DOI]

Lewis Crawford

Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2018

Algorithmic Performance-Accuracy Trade-off in 3D Vision Applications.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2018

Automatic Parameter Tuning of Motion Planning Algorithms.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018

Characterising Across-Stack Optimisations for Deep Convolutional Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Symposium on Workload Characterization, 2018

SLAMBench2: Multi-Objective Head-to-Head Benchmarking for Visual SLAM.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

CAnDL: a domain specific language for compiler analysis.

[BibT_eX]

[DOI]

Lewis Crawford

Proceedings of the 27th International Conference on Compiler Construction, 2018

Automatic Matching of Legacy Code to Heterogeneous APIs: An Idiomatic Approach.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating Systems, 2018

2017

Merge or Separate?: Multi-job Scheduling for OpenCL Kernels on CPU/GPU Platforms.

[BibT_eX]

[DOI]

Proceedings of the General Purpose GPUs, 2017

Discovery and exploitation of general reductions: a constraint based approach.

[BibT_eX]

[DOI]

Proceedings of the 2017 International Symposium on Code Generation and Optimization, 2017

2016

Selecting Heterogeneous Cores for Diversity.

[BibT_eX]

[DOI]

Erik Tomusk

ACM Trans. Archit. Code Optim., 2016

Four Metrics to Evaluate Heterogeneous Multicores.

[BibT_eX]

[DOI]

Erik Tomusk

ACM Trans. Archit. Code Optim., 2016

Diplomat: Mapping of Multi-kernel Applications Using a Static Dataflow Abstraction.

[BibT_eX]

[DOI]

Bruno Bodin

Luigi Nardi

Paul H. J. Kelly

Proceedings of the 24th IEEE International Symposium on Modeling, 2016

Portable and transparent software managed scheduling on accelerators for fair resource sharing.

[BibT_eX]

[DOI]

Christos Margiolas

Proceedings of the 2016 International Symposium on Code Generation and Optimization, 2016

Integrating Algorithmic Parameters into Benchmarking and Design Space Exploration in 3D Scene Understanding.

[BibT_eX]

[DOI]

Govind Sreekar Shenoy

Proceedings of the 2016 International Conference on Parallel Architectures and Compilation, 2016

2015

Celebrating diversity: a mixture of experts approach for runtime mapping in dynamic environments.

[BibT_eX]

[DOI]

Murali Krishna Emani

Proceedings of the 36th ACM SIGPLAN Conference on Programming Language Design and Implementation, 2015

PALMOS: A Transparent, Multi-tasking Acceleration Layer for Parallel Heterogeneous Systems.

[BibT_eX]

[DOI]

Christos Margiolas

Proceedings of the 29th ACM on International Conference on Supercomputing, 2015

Introducing SLAMBench, a performance and accuracy benchmarking methodology for SLAM.

[BibT_eX]

[DOI]

Graham D. Riley

Nigel P. Topham

Stephen B. Furber

Proceedings of the IEEE International Conference on Robotics and Automation, 2015

2014

Integrating profile-driven parallelism detection and machine-learning-based mapping.

[BibT_eX]

[DOI]

Georgios Tournavitis

ACM Trans. Archit. Code Optim., 2014

Automatic and Portable Mapping of Data Parallel Programs to OpenCL for GPU-Based Heterogeneous Systems.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2014

Automatic feature generation for machine learning-based optimising compilation.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2014

Partitioning data-parallel programs for heterogeneous MPSoCs: time and energy design space exploration.

[BibT_eX]

[DOI]

Kiran Chandramohan

Proceedings of the SIGPLAN/SIGBED Conference on Languages, 2014

Change Detection Based Parallelism Mapping: Exploiting Offline Models and Online Adaptation.

[BibT_eX]

[DOI]

Murali Krishna Emani

Proceedings of the Languages and Compilers for Parallel Computing, 2014

Smart multi-task scheduling for OpenCL programs on CPU/GPU heterogeneous platforms.

[BibT_eX]

[DOI]

Proceedings of the 21st International Conference on High Performance Computing, 2014

Portable and Transparent Host-Device Communication Optimization for GPGPU Environments.

[BibT_eX]

[DOI]

Christos Margiolas

Proceedings of the 12th Annual IEEE/ACM International Symposium on Code Generation and Optimization, 2014

Exploitation of GPUs for the Parallelisation of Probably Parallel Legacy Code.

[BibT_eX]

[DOI]

Daniel Christopher Powell

Proceedings of the Compiler Construction - 23rd International Conference, 2014

A compiler framework for automatically mapping data parallel programs to heterogeneous MPSoCs.

[BibT_eX]

[DOI]

Kiran Chandramohan

Proceedings of the 2014 International Conference on Compilers, 2014

Exploiting GPU Hardware Saturation for Fast Compiler Optimization.

[BibT_eX]

[DOI]

Alberto Magni

Proceedings of the Seventh Workshop on General Purpose Processing Using GPUs, 2014

Measuring flexibility in single-ISA heterogeneous processors.

[BibT_eX]

[DOI]

Erik Tomusk

Proceedings of the International Conference on Parallel Architectures and Compilation, 2014

Automatic optimization of thread-coarsening for graphics processors.

[BibT_eX]

[DOI]

Alberto Magni

Proceedings of the International Conference on Parallel Architectures and Compilation, 2014

2013

Using machine learning to partition streaming programs.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2013

A large-scale cross-architecture evaluation of thread-coarsening.

[BibT_eX]

[DOI]

Alberto Magni

Proceedings of the International Conference for High Performance Computing, 2013

OpenCL Task Partitioning in the Presence of GPU Contention.

[BibT_eX]

[DOI]

Proceedings of the Languages and Compilers for Parallel Computing, 2013

Portable mapping of data parallel programs to OpenCL for heterogeneous systems.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE/ACM International Symposium on Code Generation and Optimization, 2013

Smart, adaptive mapping of parallelism in the presence of external workload.

[BibT_eX]

[DOI]

Murali Krishna Emani

Proceedings of the 2013 IEEE/ACM International Symposium on Code Generation and Optimization, 2013

General chairs' welcome message.

[BibT_eX]

[DOI]

Christian Fensch

Proceedings of the 22nd International Conference on Parallel Architectures and Compilation Techniques, 2013

2012

Exploring and Predicting the Effects of Microarchitectural Parameters and Compiler Optimizations on Performance and Energy.

[BibT_eX]

[DOI]

ACM Trans. Embed. Comput. Syst., 2012

2011

Compiler Directed Issue Queue Energy Reduction.

[BibT_eX]

[DOI]

Trans. High Perform. Embed. Archit. Compil., 2011

An Empirical Architecture-Centric Approach to Microarchitectural Design Space Exploration.

[BibT_eX]

[DOI]

Christopher K. I. Williams

IEEE Trans. Computers, 2011

Milepost GCC: Machine Learning Enabled Self-tuning Compiler.

[BibT_eX]

[DOI]

Int. J. Parallel Program., 2011

A workload-aware mapping approach for data-parallel programs.

[BibT_eX]

[DOI]

Proceedings of the High Performance Embedded Architectures and Compilers, 2011

A Static Task Partitioning Approach for Heterogeneous Systems Using OpenCL.

[BibT_eX]

[DOI]

Proceedings of the Compiler Construction - 20th International Conference, 2011

2010

A Predictive Model for Dynamic Microarchitectural Adaptivity Control.

[BibT_eX]

[DOI]

Proceedings of the 43rd Annual IEEE/ACM International Symposium on Microarchitecture, 2010

Partitioning streaming parallelism for multi-cores: a machine learning based approach.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Parallel Architectures and Compilation Techniques, 2010

2009

Energy-efficient register caching with compiler assistance.

[BibT_eX]

[DOI]

Oguz Ergin

ACM Trans. Archit. Code Optim., 2009

Exploring the limits of early register release: Exploiting compiler analysis.

[BibT_eX]

[DOI]

Oguz Ergin

ACM Trans. Archit. Code Optim., 2009

Obituary: Peter Knijnenburg (1961-2007).

[BibT_eX]

[DOI]

Henk J. Sips

Concurr. Comput. Pract. Exp., 2009

Mapping parallelism to multi-cores: a machine learning based approach.

[BibT_eX]

[DOI]

Proceedings of the 14th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2009

Towards a holistic approach to auto-parallelization: integrating profile-driven parallelism detection and machine-learning based mapping.

[BibT_eX]

[DOI]

Georgios Tournavitis

Proceedings of the 2009 ACM SIGPLAN Conference on Programming Language Design and Implementation, 2009

Portable compiler optimisation across embedded programs and microarchitectures using machine learning.

[BibT_eX]

[DOI]

Proceedings of the 42st Annual IEEE/ACM International Symposium on Microarchitecture (MICRO-42 2009), 2009

Raced profiles: efficient selection of competing compiler optimizations.

[BibT_eX]

[DOI]

Bruce Worton

Proceedings of the 2009 ACM SIGPLAN/SIGBED conference on Languages, 2009

Reducing Training Time in a One-Shot Machine Learning-Based Compiler.

[BibT_eX]

[DOI]

Proceedings of the Languages and Compilers for Parallel Computing, 2009

Rapid early-stage microarchitecture design using predictive models.

[BibT_eX]

[DOI]

Proceedings of the 27th International Conference on Computer Design, 2009

Automatic Feature Generation for Machine Learning Based Optimizing Compilation.

[BibT_eX]

[DOI]

Proceedings of the CGO 2009, 2009

2008

Instruction Cache Energy Saving Through Compiler Way-Placement.

[BibT_eX]

[DOI]

Proceedings of the Design, Automation and Test in Europe, 2008

Exploring and predicting the architecture/optimising compiler co-design space.

[BibT_eX]

[DOI]

Proceedings of the 2008 International Conference on Compilers, 2008

2007

Introduction to Part 2.

[BibT_eX]

[DOI]

Marcelo Cintra

Trans. High Perform. Embed. Archit. Compil., 2007

Quick and Practical Run-Time Evaluation of Multiple Program Optimizations.

[BibT_eX]

[DOI]

Albert Cohen

Trans. High Perform. Embed. Archit. Compil., 2007

High-Performance Embedded Architecture and Compilation Roadmap.

[BibT_eX]

[DOI]

Dionisios N. Pnevmatikatos

Trans. High Perform. Embed. Archit. Compil., 2007

Microarchitectural Design Space Exploration Using an Architecture-Centric Approach.

[BibT_eX]

[DOI]

Proceedings of the 40th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO-40 2007), 2007

MiDataSets: Creating the Conditions for a More Realistic Evaluation of Iterative Optimization.

[BibT_eX]

[DOI]

Proceedings of the High Performance Embedded Architectures and Compilers, 2007

Topic 4 High-Performance Architectures and Compilers.

[BibT_eX]

[DOI]

José González

Lucian N. Vintan

Proceedings of the Euro-Par 2007, 2007

Rapidly Selecting Good Compiler Optimizations using Performance Counters.

[BibT_eX]

[DOI]

Proceedings of the Fifth International Symposium on Code Generation and Optimization (CGO 2007), 2007

Fast compiler optimisation evaluation using code-feature based performance prediction.

[BibT_eX]

[DOI]

Proceedings of the 4th Conference on Computing Frontiers, 2007

2006

Method-specific dynamic compilation using logistic regression.

[BibT_eX]

[DOI]

Proceedings of the 21th Annual ACM SIGPLAN Conference on Object-Oriented Programming, 2006

Predictive search distributions.

[BibT_eX]

[DOI]

Christopher K. I. Williams

Felix V. Agakov

Proceedings of the Machine Learning, 2006

Using Machine Learning to Focus Iterative Optimization.

[BibT_eX]

[DOI]

Christopher K. I. Williams

Marc Toussaint

Proceedings of the Fourth IEEE/ACM International Symposium on Code Generation and Optimization (CGO 2006), 2006

Hybrid Optimizations: Which Optimization Algorithm to Use?.

[BibT_eX]

[DOI]

J. Eliot B. Moss

Proceedings of the Compiler Construction, 15th International Conference, 2006

Iterative Collective Loop Fusion.

[BibT_eX]

[DOI]

Thomas J. Ashby

Proceedings of the Compiler Construction, 15th International Conference, 2006

Automatic performance model construction for the fast software exploration of new hardware designs.

[BibT_eX]

[DOI]

Proceedings of the 2006 International Conference on Compilers, 2006

2005

A Complete Compiler Approach to Auto-Parallelizing C Programs for Multi-DSP Systems.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2005

IATAC: a smart predictor to turn-off L2 cache lines.

[BibT_eX]

[DOI]

Xavier Vera

ACM Trans. Archit. Code Optim., 2005

Automatic Tuning of Inlining Heuristics.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE SC2005 Conference on High Performance Networking and Computing, 2005

Probabilistic source-level optimisation of embedded programs.

[BibT_eX]

[DOI]

Proceedings of the 2005 ACM SIGPLAN/SIGBED Conference on Languages, 2005

Software Directed Issue Queue Power Reduction.

[BibT_eX]

[DOI]

Proceedings of the 11th International Conference on High-Performance Computer Architecture (HPCA-11 2005), 2005

A Practical Method for Quickly Evaluating Program Optimizations.

[BibT_eX]

[DOI]

Albert Cohen

Proceedings of the High Performance Embedded Architectures and Compilers, 2005

Topic 4 - Compilers for High Performance.

[BibT_eX]

[DOI]

Albert Cohen

Martin Griebl

José Moreira

Proceedings of the Euro-Par 2005, Parallel Processing, 11th International Euro-Par Conference, Lisbon, Portugal, August 30, 2005

Compiler Directed Early Register Release.

[BibT_eX]

[DOI]

Oguz Ergin

Proceedings of the 14th International Conference on Parallel Architectures and Compilation Techniques (PACT 2005), 2005

2004

The effect of cache models on iterative compilation for combined tiling and unrolling.

[BibT_eX]

[DOI]

Kyle A. Gallivan

Concurr. Comput. Pract. Exp., 2004

A fast and accurate method for determining a lower bound on execution time.

[BibT_eX]

[DOI]

G. Watts

Concurr. Comput. Pract. Exp., 2004

Adaptive Java optimisation using instance-based learning.

[BibT_eX]

[DOI]

Shun Long

Proceedings of the 18th Annual International Conference on Supercomputing, 2004

Topic 4: Compilers for High Performance.

[BibT_eX]

[DOI]

Hans P. Zima

Siegfried Benkner

Beniamino Di Martino

Proceedings of the Euro-Par 2004 Parallel Processing, 2004

Cross Component Optimisation in a High Level Category-Based Language.

[BibT_eX]

[DOI]

Thomas J. Ashby

Anthony D. Kennedy

Proceedings of the Euro-Par 2004 Parallel Processing, 2004

2003

Combined Selection of Tile Sizes and Unroll Factors Using Iterative Compilation.

[BibT_eX]

[DOI]

J. Supercomput., 2003

Array recovery and high-level transformations for DSP applications.

[BibT_eX]

[DOI]

ACM Trans. Embed. Comput. Syst., 2003

Towards general and exact distributed invalidation.

[BibT_eX]

[DOI]

Elena A. Stöhr

J. Parallel Distributed Comput., 2003

Topic Introduction.

[BibT_eX]

[DOI]

Michael Gerndt

Chau-Wen Tseng

Markus Schordan

Proceedings of the Euro-Par 2003. Parallel Processing, 2003

Compiler parallelization of C programs for multi-core DSPs with multiple address spaces.

[BibT_eX]

[DOI]

Proceedings of the 1st IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis, 2003

Combining Program Recovery, Auto-Parallelisation and Locality Analysis for C Programs on Multi-Processor Embedded Systems.

[BibT_eX]

[DOI]

Proceedings of the 12th International Conference on Parallel Architectures and Compilation Techniques (PACT 2003), 27 September, 2003

2002

Compile Time Barrier Synchronization Minimization.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2002

Integrating Loop and Data Transformations for Global Optimization.

[BibT_eX]

[DOI]

J. Parallel Distributed Comput., 2002

Iterative Compilation.

[BibT_eX]

[DOI]

Proceedings of the Embedded Processor Design Challenges: Systems, Architectures, Modeling, and Simulation, 2002

Evaluating Iterative Compilation.

[BibT_eX]

[DOI]

Proceedings of the Languages and Compilers for Parallel Computing, 15th Workshop, 2002

2001

Topic 04: Compilers for High Performance.

[BibT_eX]

[DOI]

Jens Knoop

Manish Gupta

Keshav Pingali

Proceedings of the Euro-Par 2001: Parallel Processing, 2001

Compiler Transformation of Pointers to Explicit Array Accesses in DSP Applications.

[BibT_eX]

[DOI]

Proceedings of the Compiler Construction, 10th International Conference, 2001

An empirical evaluation of high level transformations for embedded processors.

[BibT_eX]

[DOI]

Proceedings of the 2001 International Conference on Compilers, 2001

2000

Exact Distributed Invalidation.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2000, Parallel Processing, 6th International Euro-Par Conference, Munich, Germany, August 29, 2000

1999

Nonsingular Data Transformations: Definition, Validity, and Applications.

[BibT_eX]

[DOI]

Int. J. Parallel Program., 1999

A Feasibility Study in Iterative Compilation.

[BibT_eX]

[DOI]

Harry A. G. Wijshoff

Proceedings of the High Performance Computing, Second International Symposium, 1999

OCEANS - Optimising Compilers for Embedded Applications.

[BibT_eX]

[DOI]

Paul van der Mark

Andy Nisbet

Proceedings of the Euro-Par '99 Parallel Processing, 5th International Euro-Par Conference, Toulouse, France, August 31, 1999

Efficient Parallelization Using Combined Loop and Data Transformations.

[BibT_eX]

[DOI]

Proceedings of the 1999 International Conference on Parallel Architectures and Compilation Techniques, 1999

1998

First Fast Sink: A compiler algorithm for barrier placement optimisation.

[BibT_eX]

[DOI]

Elena A. Stöhr

Future Gener. Comput. Syst., 1998

MARS: A Distributed Memory Approach to Shared Memory Compilation.

[BibT_eX]

[DOI]

Proceedings of the Languages, 1998

OCEANS: Optimising Compilers for Embedded Applications.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par '98 Parallel Processing, 1998

1997

A Graph Based Approach to Barrier Synchronisation Minimisation.

[BibT_eX]

[DOI]

Proceedings of the 11th international conference on Supercomputing, 1997

Non-Singular Data Transformations: Definition, Validity and Applications.

[BibT_eX]

[DOI]

Proceedings of the 11th international conference on Supercomputing, 1997

Barrier Synchronisation Optimisation.

[BibT_eX]

[DOI]

Proceedings of the High-Performance Computing and Networking, 1997

OCEANS: Optimizing Compilers for Embedded Applications.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par '97 Parallel Processing, 1997

1996

Expert Programmer versus Parallelizing Compiler: A Comparative Study of Two Approaches for Distributed Shared Memory.

[BibT_eX]

[DOI]

J. Mark Bull

Sci. Program., 1996

Practical Loop Generation.

[BibT_eX]

[DOI]

Zbigniew Chamski

Proceedings of the 29th Annual Hawaii International Conference on System Sciences (HICSS-29), 1996

Compiler Reduction of Invalidation Traffic in Virtual Shared Memory Systems.

[BibT_eX]

[DOI]

Andy Nisbet

Proceedings of the Euro-Par '96 Parallel Processing, 1996

A compiler algorithm to reduce invalidation latency in virtual shared memory systems.

[BibT_eX]

[DOI]

Andy Nisbet

Proceedings of the Fifth International Conference on Parallel Architectures and Compilation Techniques, 1996

1995

Synchronization Minimization in a SPMD Execution Model.

[BibT_eX]

[DOI]

L. Kervella

J. Parallel Distributed Comput., 1995

A hierarchical locality algorithm for NUMA compilation.

[BibT_eX]

[DOI]

Proceedings of the 3rd Euromicro Workshop on Parallel and Distributed Processing (PDP '95), 1995

A Compiler Strategy for Shared Virtual Memories.

[BibT_eX]

[DOI]

Proceedings of the Languages, 1995

Compiler Reduction of Synchronisation in Shared Virtual Memory Systems.

[BibT_eX]

[DOI]

Proceedings of the 9th international conference on Supercomputing, 1995

1994

A Data Partitioning Algorithm for Distributed Memory Compilation.

[BibT_eX]

[DOI]

Proceedings of the PARLE '94: Parallel Architectures and Languages Europe, 1994

1993

Program and data transformations for efficient execution on distributed memory architectures.

[BibT_eX]

PhD thesis, 1993

1992

A New Program Transformation to Minimise Communication in Distributed Memory Architecture.

[BibT_eX]

[DOI]

G. A. Hedayat

Proceedings of the PARLE '92: Parallel Architectures and Languages Europe, 1992

A transformational approach to compiling Sisal for distributed memory architectures.

[BibT_eX]

[DOI]