Christopher S. Daley

Orcid: 0000-0003-3105-0804

Affiliations:
  • Lawrence Berkeley National Laboratory, NERSC center
  • University of Chicago, Flash Center for Computational Science


According to our database1, Christopher S. Daley authored at least 31 papers between 2011 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Evaluating the Potential of Disaggregated Memory Systems for HPC applications.
CoRR, 2023

OpenMP Offload Features and Strategies for High Performance across Architectures and Compilers.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023

2022
Flash-X: A multiphysics simulation software instrument.
SoftwareX, 2022


A Methodology for Evaluating Tightly-integrated and Disaggregated Accelerated Architectures.
Proceedings of the IEEE/ACM International Workshop on Performance Modeling, 2022

A Portable Sparse Solver Framework for Large Matrices on Heterogeneous Architectures.
Proceedings of the 29th IEEE International Conference on High Performance Computing, 2022

2021
Nyx: A Massively Parallel AMR Code for Computational Cosmology.
J. Open Source Softw., 2021

GPU Offloading of a Large-Scale Gyrokinetic Particle-in-Cell Fortran Code on Summit: From OpenACC to OpenMP.
Proceedings of the Accelerator Programming Using Directives - 8th International Workshop, 2021

Non-recurring engineering (NRE) best practices: a case study with the NERSC/NVIDIA OpenMP contract.
Proceedings of the International Conference for High Performance Computing, 2021



2020
Performance characterization of scientific workflows for the optimal use of Burst Buffers.
Future Gener. Comput. Syst., 2020

Experiences in porting mini-applications to OpenACC and OpenMP on heterogeneous systems.
Concurr. Comput. Pract. Exp., 2020

Characterizing Scientific Workflows on HPC Systems using Logs.
Proceedings of the IEEE/ACM Workflows in Support of Large-Scale Science, 2020

Enabling Execution of a Legacy CFD Mini Application on Accelerators Using OpenMP.
Proceedings of the High Performance Computing - 35th International Conference, 2020

Performance Assessment of OpenMP Compilers Targeting NVIDIA V100 GPUs.
Proceedings of the Accelerator Programming Using Directives - 7th International Workshop, 2020

A Case Study of Porting HPGMG from CUDA to OpenMP Target Offload.
Proceedings of the OpenMP: Portable Multi-Level Parallelism on Modern Systems, 2020

2019
Evaluation of Directive-Based GPU Programming Models on a Block Eigensolver with Consideration of Large Sparse Matrices.
Proceedings of the Accelerator Programming Using Directives - 6th International Workshop, 2019

2018
A Metric for Evaluating Supercomputer Performance in the Era of Extreme Heterogeneity.
Proceedings of the 2018 IEEE/ACM Performance Modeling, 2018

2017
Performance analysis of emerging data analytics and HPC workloads.
Proceedings of the 2nd Joint International Workshop on Parallel Data Storage & Data Intensive Scalable Computing Systems, 2017

Performance and Energy Usage of Workloads on KNL and Haswell Architectures.
Proceedings of the High Performance Computing Systems. Performance Modeling, Benchmarking, and Simulation, 2017

2016
Optimizations in a high-performance conjugate gradient benchmark for IA-based multi- and many-core processors.
Int. J. High Perform. Comput. Appl., 2016

2015
Ongoing verification of a multiphysics community code: FLASH.
Softw. Pract. Exp., 2015

Lessons Learned from Optimizing Science Kernels for Intel's "Knights Corner"' Architecture.
Comput. Sci. Eng., 2015

2014
Evolution of FLASH, a multi-physics scientific simulation code for high-performance computing.
Int. J. High Perform. Comput. Appl., 2014

Cori: A Pre-Exascale Supercomputer for Big Data and HPC Applications.
Proceedings of the Big Data and High Performance Computing, 2014

2013
Pragmatic optimizations for better scientific utilization of large supercomputers.
Int. J. High Perform. Comput. Appl., 2013

Analysis of Cray XC30 Performance Using Trinity-NERSC-8 Benchmarks and Comparison with Cray XE6 and IBM BG/Q.
Proceedings of the High Performance Computing Systems. Performance Modeling, Benchmarking and Simulation, 2013

Parallel Algorithms for Using Lagrangian Markers in Immersed Boundary Method with Adaptive Mesh Refinement in FLASH.
Proceedings of the 25th International Symposium on Computer Architecture and High Performance Computing, 2013

2012
Optimization of multigrid based elliptic solver for large scale simulations in the FLASH code.
Concurr. Comput. Pract. Exp., 2012

2011
Parallel algorithms for moving Lagrangian data on block structured Eulerian meshes.
Parallel Comput., 2011


  Loading...