Simon J. Pennycook

According to our database1, Simon J. Pennycook authored at least 21 papers between 2011 and 2019.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Homepages:

On csauthors.net:

Bibliography

2019
Developments in memory management in OpenMP.
IJHPCN, 2019

Implications of a metric for performance portability.
Future Generation Comp. Syst., 2019

2018
CosmoFlow: using deep learning to learn the universe at scale.
Proceedings of the International Conference for High Performance Computing, 2018

Supporting Function Variants in OpenMP.
Proceedings of the Evolving OpenMP for Evolving Architectures, 2018

2017
IXPUG: Experiences on Intel Knights Landing at the One Year Mark.
Proceedings of the High Performance Computing, 2017

2016
Separable projection integrals for higher-order correlators of the cosmic microwave sky: Acceleration by factors exceeding 100.
J. Comput. Physics, 2016

A Modern Memory Management System for OpenMP.
Proceedings of the Third Workshop on Accelerator Programming Using Directives, 2016

Workstealing and Nested Parallelism in SMP Systems.
Proceedings of the OpenMP: Memory, Devices, and Tasks, 2016

2013
An investigation of the performance portability of OpenCL.
J. Parallel Distrib. Comput., 2013

Parallel File System Analysis Through Application I/O Tracing.
Comput. J., 2013

Exploring SIMD for Molecular Dynamics, Using Intel® Xeon® Processors and Intel® Xeon Phi Coprocessors.
Proceedings of the 27th IEEE International Symposium on Parallel and Distributed Processing, 2013

Model-Led Optimisation of a Geometric Multigrid Application.
Proceedings of the 10th IEEE International Conference on High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing, 2013

2012
Evaluating the performance of legacy applications on emerging parallel architectures.
PhD thesis, 2012

On the Acceleration of Wavefront Applications using Distributed Many-Core Architectures.
Comput. J., 2012

Towards the Automated Generation of Hard Disk Models through Physical Geometry Discovery.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Developing Performance-Portable Molecular Dynamics Kernels in OpenCL.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

LDPLFS: Improving I/O Performance without Application Modification.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum, 2012

2011
Should we worry about memory loss?
SIGMETRICS Performance Evaluation Review, 2011

Performance analysis of a hybrid MPI/CUDA implementation of the NASLU benchmark.
SIGMETRICS Performance Evaluation Review, 2011

Light-Weight Parallel I/O Analysis at Scale.
Proceedings of the Computer Performance Engineering, 2011

WMTools - Assessing Parallel Application Memory Utilisation at Scale.
Proceedings of the Computer Performance Engineering, 2011


  Loading...