Nicholas J. Wright

Orcid: 0000-0002-4914-6997

According to our database1, Nicholas J. Wright authored at least 57 papers between 2007 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
An automated and portable method for selecting an optimal GPU frequency.
Future Gener. Comput. Syst., December, 2023

Evaluating the Potential of Disaggregated Memory Systems for HPC applications.
CoRR, 2023

Not all applications have boring communication patterns: Profiling message matching with BMM.
Concurr. Comput. Pract. Exp., 2023

Power Analysis of NERSC Production Workloads.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

A Performance Model for Estimating the Cost of Scaling to Practical Quantum Advantage.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

Performance-Aware Energy-Efficient GPU Frequency Selection using DNN-based Models.
Proceedings of the 52nd International Conference on Parallel Processing, 2023

2022
Understanding the Impact of Input Entropy on FPU, CPU, and GPU Power.
CoRR, 2022

A DPU Solution for Container Overlay Networks.
CoRR, 2022

FPGA-based HPC accelerators: An evaluation on performance and energy efficiency.
Concurr. Comput. Pract. Exp., 2022

A Methodology for Evaluating Tightly-integrated and Disaggregated Accelerated Architectures.
Proceedings of the IEEE/ACM International Workshop on Performance Modeling, 2022

Optimal GPU Frequency Selection using Multi-Objective Approaches for HPC Systems.
Proceedings of the IEEE High Performance Extreme Computing Conference, 2022

2021
Use It or Lose It: Cheap Compute Everywhere.
Proceedings of the Driving Scientific and Engineering Discoveries Through the Integration of Experiment, Big Data, and Modeling and Simulation, 2021

Non-recurring engineering (NRE) best practices: a case study with the NERSC/NVIDIA OpenMP contract.
Proceedings of the International Conference for High Performance Computing, 2021

Architectural Requirements for Deep Learning Workloads in HPC Environments.
Proceedings of the 2021 International Workshop on Performance Modeling, 2021

Understanding power variation and its implications on performance optimization on the Cori supercomputer.
Proceedings of the 2021 International Workshop on Performance Modeling, 2021

New Challenges of Benchmarking All-Flash Storage for HPC.
Proceedings of the 6th IEEE/ACM International Parallel Data Systems Workshop, 2021

Experiences Porting the SU3_Bench Microbenchmark to the Intel Arria 10 and Xilinx Alveo U280 FPGAs.
Proceedings of the IWOCL'21: International Workshop on OpenCL, Munich Germany, April, 2021, 2021

2020
Performance characterization of scientific workflows for the optimal use of Burst Buffers.
Future Gener. Comput. Syst., 2020

Characterizing Scientific Workflows on HPC Systems using Logs.
Proceedings of the IEEE/ACM Workflows in Support of Large-Scale Science, 2020

Performance Assessment of OpenMP Compilers Targeting NVIDIA V100 GPUs.
Proceedings of the Accelerator Programming Using Directives - 7th International Workshop, 2020

The Performance and Energy Efficiency Potential of FPGAs in Scientific Computing.
Proceedings of the 2020 IEEE/ACM Performance Modeling, 2020

Performance Trade-offs in GPU Communication: A Study of Host and Device-initiated Approaches.
Proceedings of the 2020 IEEE/ACM Performance Modeling, 2020

A Case Study of Porting HPGMG from CUDA to OpenMP Target Offload.
Proceedings of the OpenMP: Portable Multi-Level Parallelism on Modern Systems, 2020

Uncovering Access, Reuse, and Sharing Characteristics of I/O-Intensive Files on Large-Scale Production HPC Systems.
Proceedings of the 18th USENIX Conference on File and Storage Technologies, 2020

Quantifying the impact of network congestion on application performance and network metrics.
Proceedings of the IEEE International Conference on Cluster Computing, 2020

2019
A Quantitative Approach to Architecting All-Flash Lustre File Systems.
Proceedings of the High Performance Computing, 2019

Evaluation of Directive-Based GPU Programming Models on a Block Eigensolver with Consideration of Large Sparse Matrices.
Proceedings of the Accelerator Programming Using Directives - 6th International Workshop, 2019

Understanding Data Motion in the Modern HPC Data Center.
Proceedings of the IEEE/ACM Fourth International Parallel Data Systems Workshop, 2019

GPCNeT: designing a benchmark suite for inducing and measuring contention in HPC networks.
Proceedings of the International Conference for High Performance Computing, 2019

A Zoom-in Analysis of I/O Logs to Detect Root Causes of I/O Performance Bottlenecks.
Proceedings of the 19th IEEE/ACM International Symposium on Cluster, 2019

2018
A year in the life of a parallel file system.
Proceedings of the International Conference for High Performance Computing, 2018

A Metric for Evaluating Supercomputer Performance in the Era of Extreme Heterogeneity.
Proceedings of the 2018 IEEE/ACM Performance Modeling, 2018

IOMiner: Large-Scale Analytics Framework for Gaining Knowledge from I/O Logs.
Proceedings of the IEEE International Conference on Cluster Computing, 2018

2017
UMAMI: a recipe for generating meaningful metrics through holistic I/O performance analysis.
Proceedings of the 2nd Joint International Workshop on Parallel Data Storage & Data Intensive Scalable Computing Systems, 2017

Performance analysis of emerging data analytics and HPC workloads.
Proceedings of the 2nd Joint International Workshop on Parallel Data Storage & Data Intensive Scalable Computing Systems, 2017

Performance and Energy Usage of Workloads on KNL and Haswell Architectures.
Proceedings of the High Performance Computing Systems. Performance Modeling, Benchmarking, and Simulation, 2017

Understanding Performance Variability on the Aries Dragonfly Network.
Proceedings of the 2017 IEEE International Conference on Cluster Computing, 2017

2016
Modular HPC I/O Characterization with Darshan.
Proceedings of the 5th Workshop on Extreme-Scale Programming Tools, 2016

2014
Roofline Model Toolkit: A Practical Tool for Architectural and Program Analysis.
Proceedings of the High Performance Computing Systems. Performance Modeling, Benchmarking, and Simulation, 2014

Maximizing Throughput on a Dragonfly Network.
Proceedings of the International Conference for High Performance Computing, 2014

Measurement and interpretation of microbenchmark and application energy use on the cray XC30.
Proceedings of the 2nd International Workshop on Energy Efficient Supercomputing, 2014

Abstract machine models and proxy architectures for exascale computing.
Proceedings of the 1st International Workshop on Hardware-Software Co-Design for High Performance Computing, 2014

Cori: A Pre-Exascale Supercomputer for Big Data and HPC Applications.
Proceedings of the Big Data and High Performance Computing, 2014

2013
Performance Tuning of Fock Matrix and Two-Electron Integral Calculations for NWChem on Leading HPC Platforms.
Proceedings of the High Performance Computing Systems. Performance Modeling, Benchmarking and Simulation, 2013

Analysis of Cray XC30 Performance Using Trinity-NERSC-8 Benchmarks and Comparison with Cray XE6 and IBM BG/Q.
Proceedings of the High Performance Computing Systems. Performance Modeling, Benchmarking and Simulation, 2013

Extreme Data Science at the National Energy Research Scientific Computing (NERSC) Center.
Proceedings of the Parallel Computing: Accelerating Computational Science and Engineering (CSE), 2013

2012
A preliminary evaluation of the hardware acceleration of the Cray Gemini interconnect for PGAS languages and comparison with MPI.
SIGMETRICS Perform. Evaluation Rev., 2012

Evaluating Interconnect and Virtualization Performance forHigh Performance Computing.
SIGMETRICS Perform. Evaluation Rev., 2012

2011
Comprehensive Performance Monitoring for GPU Cluster Systems.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

2010
A programming model performance study using the NAS parallel benchmarks.
Sci. Program., 2010

Parallel I/O performance: From events to ensembles.
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Effective Performance Measurement at Petascale Using IPM.
Proceedings of the 16th IEEE International Conference on Parallel and Distributed Systems, 2010

Performance Analysis of High Performance Computing Applications on the Amazon Web Services Cloud.
Proceedings of the Cloud Computing, Second International Conference, 2010

Effective Holistic Performance Measurement at Petascale Using IPM.
Proceedings of the Competence in High Performance Computing 2010, 2010

2009
Performance Analysis and Workload Characterization with IPM.
Proceedings of the Tools for High Performance Computing 2009, 2009

2008
Modeling and predicting application performance on parallel computers using HPC challenge benchmarks.
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

2007
WRF nature run.
Proceedings of the ACM/IEEE Conference on High Performance Networking and Computing, 2007


  Loading...