Stephen W. Poole

Orcid: 0000-0002-4531-7453

Affiliations:
  • Los Alamos National Laboratory, NM, USA
  • Oak Ridge National Laboratory, TN, USA (former)
  • IBM Corporation, Houstin, TX, USA (former)


According to our database1, Stephen W. Poole authored at least 81 papers between 1988 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
The Future of HPC in Nuclear Security.
IEEE Internet Comput., 2023

DPU-Bench: A Micro-Benchmark Suite to Measure Offload Efficiency Of SmartNICs.
Proceedings of the Practice and Experience in Advanced Research Computing, 2023

Exploring Challenges Associated with Employing SmartNICs as General-Purpose HPC Accelerators.
Proceedings of the IEEE High Performance Extreme Computing Conference, 2023

Battle of the BlueFields: An In-Depth Comparison of the BlueField-2 and BlueField-3 SmartNICs.
Proceedings of the IEEE Symposium on High-Performance Interconnects, 2023

Extending OpenSHMEM with Aggregation Support for Improved Message Rate Performance.
Proceedings of the Euro-Par 2023: Parallel Processing - 29th International Conference on Parallel and Distributed Computing, Limassol, Cyprus, August 28, 2023

2022
RaiderSTREAM: Adapting the STREAM Benchmark to Modern HPC Systems.
Proceedings of the IEEE High Performance Extreme Computing Conference, 2022

Bring the BitCODE-Moving Compute and Data in Distributed Heterogeneous Systems.
Proceedings of the IEEE International Conference on Cluster Computing, 2022

2021
UCX Programming Interface for Remote Function Injection and Invocation.
Proceedings of the OpenSHMEM and Related Technologies. OpenSHMEM in the Era of Exascale and Smart Networks, 2021

SHMEM-ML: Leveraging OpenSHMEM and Apache Arrow for Scalable, Composable Machine Learning.
Proceedings of the OpenSHMEM and Related Technologies. OpenSHMEM in the Era of Exascale and Smart Networks, 2021

Two-Chains: High Performance Framework for Function Injection and Execution.
Proceedings of the IEEE International Conference on Cluster Computing, 2021

2020
OpenSHMEM I/O Extensions for Fine-Grained Access to Persistent Memory Storage.
Proceedings of the Driving Scientific and Engineering Discoveries Through the Convergence of HPC, Big Data and AI, 2020

HOOVER: Leveraging OpenSHMEM for High Performance, Flexible Streaming Graph Applications.
Proceedings of the 3rd IEEE/ACM Annual Parallel Applications Workshop: Alternatives To MPI+X, 2020

2017
Thoughtful Precision in Mini-Apps.
Proceedings of the 2017 IEEE International Conference on Cluster Computing, 2017

2015
Utility Functions and Resource Management in an Oversubscribed Heterogeneous Computing Environment.
IEEE Trans. Computers, 2015

Utility maximizing dynamic resource management in an oversubscribed energy-constrained heterogeneous computing system.
Sustain. Comput. Informatics Syst., 2015

Electrical Grid and Supercomputing Centers: An Investigative Analysis of Emerging Opportunities and Challenges.
Inform. Spektrum, 2015

Measuring Server Energy Proportionality.
Proceedings of the 6th ACM/SPEC International Conference on Performance Engineering, Austin, TX, USA, January 31, 2015

2014
Optimizing I/O forwarding techniques for extreme-scale event tracing.
Clust. Comput., 2014

Power signatures of high-performance computing workloads.
Proceedings of the 2nd International Workshop on Energy Efficient Supercomputing, 2014

Extending the OpenSHMEM Memory Model to Support User-Defined Spaces.
Proceedings of the 8th International Conference on Partitioned Global Address Space Programming Models, 2014

Development and Extension of Atomic Memory Operations in OpenSHMEM.
Proceedings of the 8th International Conference on Partitioned Global Address Space Programming Models, 2014

OpenSHMEM Reference Implementation using UCCS-uGNI Transport Layer.
Proceedings of the 8th International Conference on Partitioned Global Address Space Programming Models, 2014

Fault Tolerance for OpenSHMEM.
Proceedings of the 8th International Conference on Partitioned Global Address Space Programming Models, 2014

Designing a High Performance OpenSHMEM Implementation Using Universal Common Communication Substrate as a Communication Middleware.
Proceedings of the OpenSHMEM and Related Technologies. Experiences, Implementations, and Tools, 2014

Extending the OpenSHMEM Analyzer to Perform Synchronization and Multi-valued Analysis.
Proceedings of the OpenSHMEM and Related Technologies. Experiences, Implementations, and Tools, 2014

OpenSHMEM Extensions and a Vision for Its Future Direction.
Proceedings of the OpenSHMEM and Related Technologies. Experiences, Implementations, and Tools, 2014

Analyzing the Energy and Power Consumption of Remote Memory Accesses in the OpenSHMEM Model.
Proceedings of the OpenSHMEM and Related Technologies. Experiences, Implementations, and Tools, 2014

Utility Driven Dynamic Resource Management in an Oversubscribed Energy-Constrained Heterogeneous System.
Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014

Application Power Signature Analysis.
Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014

Energy-aware resource management for computing systems.
Proceedings of the Seventh International Conference on Contemporary Computing, 2014

Power Consumption Due to Data Movement in Distributed Programming Models.
Proceedings of the Euro-Par 2014 Parallel Processing, 2014

2013
Modeling and predicting performance of high performance computing applications on hardware accelerators.
Int. J. High Perform. Comput. Appl., 2013

The co-design architecture for exascale systems, a novel approach for scalable designs.
Comput. Sci. Res. Dev., 2013

TUE, a New Energy-Efficiency Metric Applied at ORNL's Jaguar.
Proceedings of the Supercomputing - 28th International Supercomputing Conference, 2013

An Analysis Framework for Investigating the Trade-Offs between System Performance and Energy Consumption in a Heterogeneous Computing Environment.
Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013

Revisiting Server Energy Proportionality.
Proceedings of the 42nd International Conference on Parallel Processing, 2013


Exploring energy and performance behaviors of data-intensive scientific workflows on systems with deep memory hierarchies.
Proceedings of the 20th Annual International Conference on High Performance Computing, 2013

2012
Energy-Efficient Online Provisioning for HPC Workloads.
Proceedings of the Handbook of Energy-Aware and Green Computing - Two Volume Set., 2012

Comparative Study of Runtime Systems for Energy-Aware High-Performance Computing.
Proceedings of the Handbook of Energy-Aware and Green Computing - Two Volume Set., 2012

Towards efficient supercomputing: searching for the right efficiency metric.
Proceedings of the Third Joint WOSP/SIPEW International Conference on Performance Engineering, 2012

The Network Adapter: The Missing Link between MPI Applications and Network Performance.
Proceedings of the IEEE 24th International Symposium on Computer Architecture and High Performance Computing, 2012

Experimental analysis of 10Gbps transfers over physical and emulated dedicated connections.
Proceedings of the International Conference on Computing, Networking and Communications, 2012

Cloud computing infrastructure robustness: A game theory approach.
Proceedings of the International Conference on Computing, Networking and Communications, 2012

Enabling event tracing at leadership-class scale through I/O forwarding middleware.
Proceedings of the 21st International Symposium on High-Performance Parallel and Distributed Computing, 2012

2011
OpenSHMEM - Toward a Unified RMA Model.
Proceedings of the Encyclopedia of Parallel Computing, 2011

A mathematical analysis of the R-MAT random graph generator.
Networks, 2011

A technique for moving large data sets over high-performance long distance networks.
Proceedings of the IEEE 27th Symposium on Mass Storage Systems and Technologies, 2011

Power signature analysis of the SPECpower_ssj2008 benchmark.
Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2011

Time Utility Functions for Modeling and Evaluating Resource Allocations in a Heterogeneous Computing System.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

Modeling and predicting application performance on hardware accelerators.
Proceedings of the 2011 IEEE International Symposium on Workload Characterization, 2011

An idiom-finding tool for increasing productivity of accelerators.
Proceedings of the 25th International Conference on Supercomputing, 2011, Tucson, AZ, USA, May 31, 2011

Power measurement for high performance computing: State of the art.
Proceedings of the 2011 International Green Computing Conference and Workshops, 2011

Reducing Energy Usage with Memory and Computation-Aware Dynamic Frequency Scaling.
Proceedings of the Euro-Par 2011 Parallel Processing - 17th International Conference, 2011

Diagnosing Anomalous Network Performance with Confidence.
Proceedings of the 11th IEEE/ACM International Symposium on Cluster, 2011

2010
Sparse Matrix-Vector Multiplication on a Reconfigurable Supercomputer with Application.
ACM Trans. Reconfigurable Technol. Syst., 2010

Acceleration of the Smith-Waterman algorithm using single and multiple graphics processors.
J. Comput. Phys., 2010

A New Benchmark For Evaluation Of Graph-Theoretic Algorithms
CoRR, 2010

Testbed and Experiments for High-Performance Networking.
Proceedings of the Testbeds and Research Infrastructures. Development of Networks and Communities, 2010

Introducing OpenSHMEM: SHMEM for the PGAS community.
Proceedings of the Fourth Conference on Partitioned Global Address Space Programming Model, 2010

Collecting Sensor Data for High-Performance Computing: A Case-study.
Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 2010

Overlapping computation and communication: Barrier algorithms and ConnectX-2 CORE-Direct capabilities.
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Investigating the potential of application-centric aggressive power management for HPC workloads.
Proceedings of the 2010 International Conference on High Performance Computing, 2010

Energy-efficient application-aware online provisioning for virtualized clouds and data centers.
Proceedings of the International Green Computing Conference 2010, 2010

Hardware/software co-design for high performance computing: challenges and opportunities.
Proceedings of the 8th International Conference on Hardware/Software Codesign and System Synthesis, 2010

ConnectX-2 InfiniBand Management Queues: First Investigation of the New Support for Network Offloaded Collective Operations.
Proceedings of the 10th IEEE/ACM International Conference on Cluster, 2010

2009
Coordinating government funding of file system and I/O research through the high end computing university research activity.
ACM SIGOPS Oper. Syst. Rev., 2009

UltraScience Net: High-Performance Network Research Test-Bed.
Int. J. Distributed Sens. Networks, 2009

A Taxonomy of MPI-Oriented Usage Models in Parallelized Scientific Codes.
Proceedings of the 2009 International Conference on Software Engineering Research & Practice, 2009

Performance analysis and projections for Petascale applications on Cray XT series systems.
Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009

Performance Characterization of a Hierarchical MPI Implementation on Large-scale Distributed-memory Platforms.
Proceedings of the ICPP 2009, 2009

Impact of Quad-Core Cray XT4 System and Software Stack on Scientific Computation.
Proceedings of the Euro-Par 2009 Parallel Processing, 2009

2008
Wide-area performance profiling of 10GigE and InfiniBand technologies.
Proceedings of the ACM/IEEE Conference on High Performance Computing, 2008

X-SRQ- Improving Scalability and Performance of Multi-core InfiniBand Clusters.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2008

Sparse Matrix-Vector Multiplication on a Reconfigurable Supercomputer.
Proceedings of the 16th IEEE International Symposium on Field-Programmable Custom Computing Machines, 2008

An Implementation of the Conjugate Gradient Algorithm on FPGAs.
Proceedings of the 16th IEEE International Symposium on Field-Programmable Custom Computing Machines, 2008

2007
NPU-Based Image Compositing in a Distributed Visualization System.
IEEE Trans. Vis. Comput. Graph., 2007

2006
PaScal - a new parallel and scalable server IO networking infrastructure for supporting global storage/file systems in large-size Linux clusters.
Proceedings of the 25th IEEE International Performance Computing and Communications Conference, 2006

2002
Granidt: Towards Gigabit Rate Network Intrusion Detection Technology.
Proceedings of the Field-Programmable Logic and Applications, 2002

1991
Wide format floating-point math libraries.
Proceedings of the Proceedings Supercomputing '91, 1991

1988
Block-iterative finite element computations for incompressible flow problems.
Proceedings of the 2nd international conference on Supercomputing, 1988


  Loading...