Ryan E. Grant

Orcid: 0000-0002-0163-3892

Affiliations:
  • Sandia National Laboratories


According to our database1, Ryan E. Grant authored at least 77 papers between 1993 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Design of a portable implementation of partitioned point-to-point communication primitives.
Concurr. Comput. Pract. Exp., 2023

Enabling power measurement and control on Astra: The first petascale Arm supercomputer.
Concurr. Comput. Pract. Exp., 2023

Modeling and Benchmarking the Potential Benefit of Early-Bird Transmission in Fine-Grained Communication.
Proceedings of the 52nd International Conference on Parallel Processing, 2023

A Dynamic Network-Native MPI Partitioned Aggregation Over InfiniBand Verbs.
Proceedings of the IEEE International Conference on Cluster Computing, 2023

A Lightweight Network Traffic Prediction Method for SmartNICs.
Proceedings of the IEEE International Conference on Cluster Computing, 2023

2022
APE: Metrics for understanding application performance efficiency under power caps.
Sustain. Comput. Informatics Syst., 2022

Special Issue on Hot Interconnects.
IEEE Micro, 2022

"Smarter" NICs for faster molecular dynamics: a case study.
Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022

Micro-Benchmarking MPI Partitioned Point-to-Point Communication.
Proceedings of the 51st International Conference on Parallel Processing, 2022

2021
Implementation and evaluation of MPI 4.0 partitioned communication libraries.
Parallel Comput., 2021

Hot Interconnects 27.
IEEE Micro, 2021

RVMA: Remote Virtual Memory Access.
Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium, 2021

Design of a Portable Implementation of Partitioned Point-to-Point Communication Primitives.
Proceedings of the ICPP Workshops 2021: 50th International Conference on Parallel Processing, 2021

Partitioned Collective Communication.
Proceedings of the Workshop on Exascale MPI, 2021

MiniMod: A Modular Miniapplication Benchmarking Framework for HPC.
Proceedings of the IEEE International Conference on Cluster Computing, 2021

2020
Hot Interconnects 26.
IEEE Micro, 2020

Foreword to the Special Issue of the Workshop on Exascale MPI (ExaMPI 2017).
Concurr. Comput. Pract. Exp., 2020

Hardware MPI message matching: Insights into MPI matching behavior to inform design.
Concurr. Comput. Pract. Exp., 2020

Tail queues: A multi-threaded matching architecture.
Concurr. Comput. Pract. Exp., 2020

A survey of MPI usage in the US exascale computing project.
Concurr. Comput. Pract. Exp., 2020

ALAMO: Autonomous Lightweight Allocation, Management, and Optimization.
Proceedings of the Driving Scientific and Engineering Discoveries Through the Convergence of HPC, Big Data and AI, 2020

RaDD Runtimes: Radical and Different Distributed Runtimes with SmartNICs.
Proceedings of the Fourth IEEE/ACM Annual Workshop on Emerging Parallel and Distributed Runtime Systems and Middleware, 2020

Workshop 16: SNACS Scalable Networks for Advanced Computing Systems.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium Workshops, 2020

Low-cost MPI Multithreaded Message Matching Benchmarking.
Proceedings of the 22nd IEEE International Conference on High Performance Computing and Communications; 18th IEEE International Conference on Smart City; 6th IEEE International Conference on Data Science and Systems, 2020

2019
Small scale to extreme: Methods for characterizing energy efficiency in supercomputing applications.
Sustain. Comput. Informatics Syst., 2019

Using simulation to examine the effect of MPI message matching costs on application performance.
Parallel Comput., 2019

A dynamic, unified design for dedicated message matching engines for collective and point-to-point communications.
Parallel Comput., 2019

Finepoints: Partitioned Multithreaded MPI Communication.
Proceedings of the High Performance Computing - 34th International Conference, 2019

INCA: in-network compute assistance.
Proceedings of the International Conference for High Performance Computing, 2019

MPI tag matching performance on ConnectX and ARM.
Proceedings of the 26th European MPI Users' Group Meeting, 2019

Introduction to SNACS 2019.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2019

Fuzzy Matching: Hardware Accelerated MPI Communication Middleware.
Proceedings of the 19th IEEE/ACM International Symposium on Cluster, 2019

2018
Unraveling Network-Induced Memory Contention: Deeper Insights with Machine Learning.
IEEE Trans. Parallel Distributed Syst., 2018

Characterizing MPI matching via trace-based simulation.
Parallel Comput., 2018

A Comparison of Power Management Mechanisms: P-States vs. Node-Level Power Cap Control.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium Workshops, 2018

A Dedicated Message Matching Mechanism for Collective Communications.
Proceedings of the 47th International Conference on Parallel Processing, 2018

Improving MPI Multi-threaded RMA Communication Performance.
Proceedings of the 47th International Conference on Parallel Processing, 2018

The Case for Semi-Permanent Cache Occupancy: Understanding the Impact of Data Locality on Network Processing.
Proceedings of the 47th International Conference on Parallel Processing, 2018

Measuring Multithreaded Message Matching Misery.
Proceedings of the Euro-Par 2018: Parallel Processing, 2018

2017
sPIN: high-performance streaming processing in the network.
Proceedings of the International Conference for High Performance Computing, 2017

Evaluating energy and power profiling techniques for HPC workloads.
Proceedings of the Eighth International Green and Sustainable Computing Conference, 2017

Enabling Diverse Software Stacks on Supercomputers Using High Performance Virtual Clusters.
Proceedings of the 2017 IEEE International Conference on Cluster Computing, 2017

A Tale of Two Systems: Using Containers to Deploy HPC Applications on Supercomputers and Clouds.
Proceedings of the IEEE International Conference on Cloud Computing Technology and Science, 2017

2016
Program optimizations: The interplay between power, performance, and energy.
Parallel Comput., 2016

Hot Interconnects 23.
IEEE Micro, 2016

Standardizing Power Monitoring and Control at Exascale.
Computer, 2016

MPI Sessions: Leveraging Runtime Infrastructure to Increase Scalability of Applications at Exascale.
Proceedings of the 23rd European MPI Users' Group Meeting, EuroMPI 2016, 2016

SHMEM-MT: A Benchmark Suite for Assessing Multi-threaded SHMEM Performance.
Proceedings of the OpenSHMEM and Related Technologies. Enhancing OpenSHMEM for Hybrid Environments, 2016

NiMC: Characterizing and Eliminating Network-Induced Memory Contention.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016

Overcoming Challenges in Scalable Power Monitoring with the Power API.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016

MPI Performance Characterization on InfiniBand with Fine-Grain Multithreaded Communication.
Proceedings of the 18th IEEE International Conference on High Performance Computing and Communications; 14th IEEE International Conference on Smart City; 2nd IEEE International Conference on Data Science and Systems, 2016

(SAI) Stalled, Active and Idle: Characterizing Power and Performance of Large-Scale Dragonfly Networks.
Proceedings of the 2016 IEEE International Conference on Cluster Computing, 2016

RMA-MT: A Benchmark Suite for Assessing MPI Multi-threaded RMA Performance.
Proceedings of the IEEE/ACM 16th International Symposium on Cluster, 2016

2015
Scalable Network Communication Using Unreliable RDMA.
Proceedings of the Handbook on Data Centers, 2015

Scalable connectionless RDMA over unreliable datagrams.
Parallel Comput., 2015

Overtime: a tool for analyzing performance variation due to network interference.
Proceedings of the 3rd Workshop on Exascale MPI, 2015

Preparing for exascale: modeling MPI for many-core systems using fine-grain queues.
Proceedings of the 3rd Workshop on Exascale MPI, 2015

Toward an evolutionary task parallel integrated MPI + X programming model.
Proceedings of the Sixth International Workshop on Programming Models and Applications for Multicores and Manycores, 2015

Optimizing Explicit Hydrodynamics for Power, Energy, and Performance.
Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015

Re-evaluating Network Onload vs. Offload for the Many-Core Era.
Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015

2014
Enabling communication concurrency through flexible MPI endpoints.
Int. J. High Perform. Comput. Appl., 2014

An evaluation of MPI message rate on hybrid-core processors.
Int. J. High Perform. Comput. Appl., 2014

Early experiences co-scheduling work and communication tasks for hybrid MPI+X applications.
Proceedings of the 2014 Workshop on Exascale MPI, 2014

Energy Consumption of Resilience Mechanisms in Large Scale Systems.
Proceedings of the 22nd Euromicro International Conference on Parallel, 2014

Metrics for Evaluating Energy Saving Techniques for Resilient HPC Systems.
Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014

2013
Evaluating energy savings for checkpoint/restart.
Proceedings of the 1st International Workshop on Energy Efficient Supercomputing, 2013

Protocols for Fully Offloaded Collective Operations on Accelerated Network Adapters.
Proceedings of the 42nd International Conference on Parallel Processing, 2013

2011
RDMA Capable iWARP over Datagrams.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

2010
A study of hardware assisted IP over InfiniBand and its impact on enterprise data center performance.
Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2010

iWARP redefined: Scalable connectionless communication over high-speed Ethernet.
Proceedings of the 2010 International Conference on High Performance Computing, 2010

2009
Improving energy efficiency of asymmetric chip multithreaded multiprocessors through reduced OS noise scheduling.
Concurr. Comput. Pract. Exp., 2009

Evaluation of ConnectX Virtual Protocol Interconnect for Data Centers.
Proceedings of the 15th IEEE International Conference on Parallel and Distributed Systems, 2009

2008
An Analysis of QoS Provisioning for Sockets Direct Protocol vs. IPoIB over Modern InfiniBand Networks.
Proceedings of the 37th International Conference on Parallel Processing, 2008

2007
A Comprehensive Analysis of OpenMP Applications on Dual-Core Intel Xeon SMPs.
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

Improving system efficiency through scheduling and power management.
Proceedings of the 2007 IEEE International Conference on Cluster Computing, 2007

2006
Power-performance efficiency of asymmetric multiprocessors for multi-threaded scientific applications.
Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

1993
NCSA <i>mosaic</i> 1993.
J. Comput. High. Educ., 1993


  Loading...