Joshua Hursey

According to our database1, Joshua Hursey authored at least 23 papers between 2005 and 2020.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

On csauthors.net:

Bibliography

2020
Scalable, fault-tolerant job step management for high-performance systems.
IBM J. Res. Dev., 2020

Application health monitoring for extreme-scale resiliency using cooperative fault management.
Concurr. Comput. Pract. Exp., 2020

2019
A performance analysis and optimization of PMIx-based HPC software stacks.
Proceedings of the 26th European MPI Users' Group Meeting, 2019

2018
PMIx: Process management for exascale environments.
Parallel Comput., 2018

2017
OnRamp: A web-portal for teaching parallel and distributed computing.
J. Parallel Distributed Comput., 2017

2016
OnRamp to Parallel and Distributed Computing: Web-portal for Teaching Parallel and Distributed Computing.
Proceedings of the 47th ACM Technical Symposium on Computing Science Education, 2016

2015
OnRamp to parallel and distributed computing.
Proceedings of the Workshop on Education for High-Performance Computing, 2015

2014
Netloc: Towards a Comprehensive View of the HPC System Topology.
Proceedings of the 43rd International Conference on Parallel Processing Workshops, 2014

2013
An evaluation of User-Level Failure Mitigation support in MPI.
Computing, 2013

Advancing application process affinity experimentation: open MPI's LAMA-based affinity interface.
Proceedings of the 20th European MPI Users's Group Meeting, 2013

2012
Analyzing fault aware collective performance in a process fault tolerant MPI.
Parallel Comput., 2012

2011
A Log-Scaling Fault Tolerant Agreement Algorithm for a Fault Tolerant MPI.
Proceedings of the Recent Advances in the Message Passing Interface, 2011

Run-Through Stabilization: An MPI Proposal for Process Fault Tolerance.
Proceedings of the Recent Advances in the Message Passing Interface, 2011

Building a Fault Tolerant MPI Application: A Ring Communication Example.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

Preserving Collective Performance across Process Failure for a Fault Tolerant MPI.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

Locality-Aware Parallel Process Mapping for Multi-core HPC Systems.
Proceedings of the 2011 IEEE International Conference on Cluster Computing (CLUSTER), 2011

2010
Checkpoint/Restart-Enabled Parallel Debugging.
Proceedings of the Recent Advances in the Message Passing Interface, 2010

2009
Interconnect agnostic checkpoint/restart in open MPI.
Proceedings of the 18th ACM International Symposium on High Performance Distributed Computing, 2009

2008
Representing unit test data for large scale software development.
Proceedings of the ACM 2008 Symposium on Software Visualization, 2008

2007
An Extensible Framework for Distributed Testing of MPI Implementations.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 14th European PVM/MPI User's Group Meeting, Paris, France, September 30, 2007

The Design and Implementation of Checkpoint/Restart Process Fault Tolerance for Open MPI.
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

Why It's Worth the Hassle: The Value of In-Situ Studies When Designing Ubicomp.
Proceedings of the UbiComp 2007: Ubiquitous Computing, 9th International Conference, 2007

2005
A Paradigm for Parallel Matrix Algorithms: .
Proceedings of the Euro-Par 2005, Parallel Processing, 11th International Euro-Par Conference, Lisbon, Portugal, August 30, 2005


  Loading...