Simon J. Pennycook

Orcid: 0000-0003-0237-3823

Affiliations:
  • Intel
  • University of Warwick


According to our database1, Simon J. Pennycook authored at least 33 papers between 2011 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
A Performance-Portable SYCL Implementation of CRK-HACC for Exascale.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

Towards Alignment of Parallelism in SYCL and ISO C++.
Proceedings of the 2023 International Workshop on OpenCL, 2023

2022
Untangling Modern Parallel Programming Models.
Proceedings of the IWOCL'22: International Workshop on OpenCL, Bristol, United Kingdom, May 10, 2022

2021
Navigating Performance, Portability, and Productivity.
Comput. Sci. Eng., 2021

A Holistic Systems Approach to Leveraging Heterogeneity.
Proceedings of the IEEE/ACM Programming Environments for Heterogeneous Computing, 2021

Revisiting a Metric for Performance Portability.
Proceedings of the International Workshop on Performance, 2021

Analyzing Reduction Abstraction Capabilities.
Proceedings of the International Workshop on Performance, 2021

Toward a Better Defined SYCL Memory Consistency Model.
Proceedings of the IWOCL'21: International Workshop on OpenCL, Munich Germany, April, 2021, 2021

2020
Interpreting and Visualizing Performance Portability Metrics.
Proceedings of the IEEE/ACM International Workshop on Performance, 2020

Data Parallel C++: Enhancing SYCL Through Extensions for Productivity and Performance.
Proceedings of the IWOCL '20: International Workshop on OpenCL, 2020

2019
Developments in memory management in OpenMP.
Int. J. High Perform. Comput. Netw., 2019

Implications of a metric for performance portability.
Future Gener. Comput. Syst., 2019

2018
CosmoFlow: using deep learning to learn the universe at scale.
Proceedings of the International Conference for High Performance Computing, 2018

Supporting Function Variants in OpenMP.
Proceedings of the Evolving OpenMP for Evolving Architectures, 2018

2017
High-Performance Code Generation though Fusion and Vectorization.
CoRR, 2017

IXPUG: Experiences on Intel Knights Landing at the One Year Mark.
Proceedings of the High Performance Computing, 2017

2016
Separable projection integrals for higher-order correlators of the cosmic microwave sky: Acceleration by factors exceeding 100.
J. Comput. Phys., 2016

A Metric for Performance Portability.
CoRR, 2016

A Modern Memory Management System for OpenMP.
Proceedings of the Third Workshop on Accelerator Programming Using Directives, 2016

Workstealing and Nested Parallelism in SMP Systems.
Proceedings of the OpenMP: Memory, Devices, and Tasks, 2016

2013
An investigation of the performance portability of OpenCL.
J. Parallel Distributed Comput., 2013

Parallel File System Analysis Through Application I/O Tracing.
Comput. J., 2013

Exploring SIMD for Molecular Dynamics, Using Intel® Xeon® Processors and Intel® Xeon Phi Coprocessors.
Proceedings of the 27th IEEE International Symposium on Parallel and Distributed Processing, 2013

Model-Led Optimisation of a Geometric Multigrid Application.
Proceedings of the 10th IEEE International Conference on High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing, 2013

2012
Evaluating the performance of legacy applications on emerging parallel architectures.
PhD thesis, 2012

On the Acceleration of Wavefront Applications using Distributed Many-Core Architectures.
Comput. J., 2012

Towards the Automated Generation of Hard Disk Models through Physical Geometry Discovery.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Developing Performance-Portable Molecular Dynamics Kernels in OpenCL.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

LDPLFS: Improving I/O Performance without Application Modification.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum, 2012

2011
Should we worry about memory loss?
SIGMETRICS Perform. Evaluation Rev., 2011

Performance analysis of a hybrid MPI/CUDA implementation of the NASLU benchmark.
SIGMETRICS Perform. Evaluation Rev., 2011

Light-Weight Parallel I/O Analysis at Scale.
Proceedings of the Computer Performance Engineering, 2011

WMTools - Assessing Parallel Application Memory Utilisation at Scale.
Proceedings of the Computer Performance Engineering, 2011


  Loading...