Simon D. Hammond

According to our database1, Simon D. Hammond authored at least 67 papers between 2006 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Perspectives on AI Architectures and Co-design for Earth System Predictability.
CoRR, 2023

Enabling power measurement and control on Astra: The first petascale Arm supercomputer.
Concurr. Comput. Pract. Exp., 2023

Evaluation of HPC Workloads Running on Open-Source RISC-V Hardware.
Proceedings of the High Performance Computing, 2023

2022
Understanding Power and Energy Utilization in Large Scale Production Physics Simulation Codes.
CoRR, 2022

Minerva: Rethinking Secure Architectures for the Era of Fabric-Attached Memory Architectures.
Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022

2021
StressBench: A Configurable Full System Network and I/O Benchmark Framework.
Proceedings of the 2021 IEEE High Performance Extreme Computing Conference, 2021

DeACT: Architecture-Aware Virtual Memory Support for Fabric Attached Memory Systems.
Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2021

Stealth-Persist: Architectural Support for Persistent Applications in Hybrid Memory Systems.
Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2021

2020
Chronicles of astra: challenges and lessons from the first petascale arm supercomputer.
Proceedings of the International Conference for High Performance Computing, 2020

PreFAM: Understanding the Impact of Prefetching in Fabric-Attached Memory Architectures.
Proceedings of the MEMSYS 2020: The International Symposium on Memory Systems, 2020

2019
Scalable generation of graphs for benchmarking HPC community-detection algorithms.
Proceedings of the International Conference for High Performance Computing, 2019

Page migration support for disaggregated non-volatile memories.
Proceedings of the International Symposium on Memory Systems, 2019

Investigating Fairness in Disaggregated Non-Volatile Memories.
Proceedings of the 2019 IEEE Computer Society Annual Symposium on VLSI, 2019

Evaluating the Marvell ThunderX2 Server Processor for HPC Workloads.
Proceedings of the 17th International Conference on High Performance Computing & Simulation, 2019

Automatic Generation of Warp-Level Primitives and Atomic Instructions for Fast and Portable Parallel Reduction on GPUs.
Proceedings of the IEEE/ACM International Symposium on Code Generation and Optimization, 2019

2018
Sparse Matrix-Matrix Multiplication on Multilevel Memory Architectures : Algorithms and Experiments.
CoRR, 2018

Profiling and Debugging Support for the Kokkos Programming Model.
Proceedings of the High Performance Computing, 2018

Exploring Allocation Policies in Disaggregated Non-Volatile Memories.
Proceedings of the Workshop on Memory Centric High Performance Computing, 2018

Evaluating the Intel Skylake Xeon Processor for HPC Workloads.
Proceedings of the 2018 International Conference on High Performance Computing & Simulation, 2018

Optimizing for KNL Usage Modes When Data Doesn't Fit in MCDRAM.
Proceedings of the 47th International Conference on Parallel Processing, 2018

2017
Optical interconnects for extreme scale computing systems.
Parallel Comput., 2017

Two-level main memory co-design: Multi-threaded algorithmic primitives, analysis, and simulation.
J. Parallel Distributed Comput., 2017

Designing vector-friendly compact BLAS and LAPACK kernels.
Proceedings of the International Conference for High Performance Computing, 2017

Performance analysis for using non-volatile memory DIMMs: opportunities and challenges.
Proceedings of the International Symposium on Memory Systems, 2017

Double Buffering for MCDRAM on Second Generation $$\hbox {Intel}^{\circledR }$$ Xeon Phi $$^{\text {TM}}$$ Processors with OpenMP.
Proceedings of the Scaling OpenMP for Exascale Performance and Portability, 2017

Fast linear algebra-based triangle counting with KokkosKernels.
Proceedings of the 2017 IEEE High Performance Extreme Computing Conference, 2017

Revisiting Online Autotuning for Sparse-Matrix Vector Multiplication Kernels on Next-Generation Architectures.
Proceedings of the 19th IEEE International Conference on High Performance Computing and Communications; 15th IEEE International Conference on Smart City; 3rd IEEE International Conference on Data Science and Systems, 2017

2016
Analyzing allocation behavior for multi-level memory.
Proceedings of the Second International Symposium on Memory Systems, 2016

Multi-Level Memory Policies: What You Add Is More Important Than What You Take Out.
Proceedings of the Second International Symposium on Memory Systems, 2016

End-to-End Modeling and Optimization of Power Consumption in HPC Interconnects.
Proceedings of the 45th International Conference on Parallel Processing Workshops, 2016

(SAI) Stalled, Active and Idle: Characterizing Power and Performance of Large-Scale Dragonfly Networks.
Proceedings of the 2016 IEEE International Conference on Cluster Computing, 2016

2015
Design Methodology for Optimizing Optical Interconnection Networks in High Performance Systems.
Proceedings of the High Performance Computing - 30th International Conference, 2015

The Potential and Perils of Multi-Level Memory.
Proceedings of the 2015 International Symposium on Memory Systems, 2015

k-Means Clustering on Two-Level Memory Systems.
Proceedings of the 2015 International Symposium on Memory Systems, 2015

Toward transparent optical networking in exascale computers.
Proceedings of the European Conference on Optical Communication, 2015

2014
An evaluation of MPI message rate on hybrid-core processors.
Int. J. High Perform. Comput. Appl., 2014

Exascale design space exploration and co-design.
Future Gener. Comput. Syst., 2014

SNAP: Strong Scaling High Fidelity Molecular Dynamics Simulations on Leadership-Class Computing Platforms.
Proceedings of the Supercomputing - 29th International Conference, 2014

Abstract machine models and proxy architectures for exascale computing.
Proceedings of the 1st International Workshop on Hardware-Software Co-Design for High Performance Computing, 2014

2013
Reducing the Bulk in the Bulk Synchronous Parallel Model.
Parallel Process. Lett., 2013

An investigation of the performance portability of OpenCL.
J. Parallel Distributed Comput., 2013

Parallel File System Analysis Through Application I/O Tracing.
Comput. J., 2013

Towards Automated Memory Model Generation Via Event Tracing.
Comput. J., 2013

Analysis of Cray XC30 Performance Using Trinity-NERSC-8 Benchmarks and Comparison with Cray XE6 and IBM BG/Q.
Proceedings of the High Performance Computing Systems. Performance Modeling, Benchmarking and Simulation, 2013

The impact of hybrid-core processors on MPI message rate.
Proceedings of the 20th European MPI Users's Group Meeting, 2013

Application Explorations for Future Interconnects.
Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013

GPU acceleration of Data Assembly in Finite Element Methods and its energy implications.
Proceedings of the 24th International Conference on Application-Specific Systems, 2013

2012
On the Acceleration of Wavefront Applications using Distributed Many-Core Architectures.
Comput. J., 2012

Unprecedented Scalability and Performance of the New NNSA Tri-Lab Linux Capacity Cluster 2.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Navigating an Evolutionary Fast Path to Exascale.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Poster: Assessing the Predictive Capabilities of Mini-applications.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

LDPLFS: Improving I/O Performance without Application Modification.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum, 2012

On the Role of Co-design in High Performance Computing.
Proceedings of the Transition of HPC Towards Exascale Computing, 2012

2011
Should we worry about memory loss?
SIGMETRICS Perform. Evaluation Rev., 2011

Performance analysis of a hybrid MPI/CUDA implementation of the NASLU benchmark.
SIGMETRICS Perform. Evaluation Rev., 2011

Benchmarking and modelling of POWER7, Westmere, BG/P, and GPUs: an industry case study.
SIGMETRICS Perform. Evaluation Rev., 2011

Predictive analysis of a hydrodynamics application on large-scale CMP clusters.
Comput. Sci. Res. Dev., 2011

Light-Weight Parallel I/O Analysis at Scale.
Proceedings of the Computer Performance Engineering, 2011

WMTools - Assessing Parallel Application Memory Utilisation at Scale.
Proceedings of the Computer Performance Engineering, 2011

2010
To upgrade or not to upgrade? Catamount vs. Cray Linux Environment.
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

2009
Performance prediction and procurement in practice: assessing the suitability of commodity cluster components for wavefront codes.
IET Softw., 2009

WARPP: a toolkit for simulating high-performance parallel scientific codes.
Proceedings of the 2nd International Conference on Simulation Tools and Techniques for Communications, 2009

Predictive analysis and optimisation of pipelined wavefront computations.
Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009

Predictive Simulation of HPC Applications.
Proceedings of the IEEE 23rd International Conference on Advanced Information Networking and Applications, 2009

2007
Distributed Broadcast Scheduling in Mobile Ad Hoc Networks with Unknown Topologies.
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

Predicting the Effect on Performance of Container-Managed Persistence in a Distributed Enterprise Application.
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

2006
Loop Transformations in the Ahead-of-Time Optimization of Java Bytecode.
Proceedings of the Compiler Construction, 15th International Conference, 2006


  Loading...