Jeff R. Hammond

Orcid: 0000-0003-3181-8190

Affiliations:

NVIDIA, Santa Clara, CA, USA
Intel Labs

According to our database¹, Jeff R. Hammond authored at least 54 papers between 2011 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2026

Tensor Algebra Processing Primitives (TAPP): Towards a Standard for Tensor Operations.

[BibT_eX]

[DOI]

CoRR, January, 2026

Asynchronous-many-task systems: Challenges and opportunities - Scaling an AMR astrophysics code on exascale machines using Kokkos and HPX.

[BibT_eX]

[DOI]

Int. J. High Perform. Comput. Appl., 2026

2025

Demystifying NCCL: An In-Depth Analysis of GPU Communication Protocols and Algorithms.

[BibT_eX]

[DOI]

Proceedings of the IEEE Symposium on High-Performance Interconnects, 2025

2023

shmem4py: OpenSHMEM for Python.

[BibT_eX]

[DOI]

Dataset, July, 2023

shmem4py: OpenSHMEM for Python.

[BibT_eX]

[DOI]

J. Open Source Softw., 2023

shmem4py: High-Performance One-Sided Communication for Python Applications.

[BibT_eX]

[DOI]

Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

MPI Application Binary Interface Standardization.

[BibT_eX]

[DOI]

Jean-Baptiste Besnard

Jed Brown

Gonzalo Brito Gadeschi

Simon Byrne

Joseph Schuchart

Hui Zhou

Proceedings of the 30th European MPI Users' Group Meeting, 2023

Optimizing Cloud Computing Resource Usage for Hemodynamic Simulation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023

Application Experiences on a GPU-Accelerated Arm-based HPC Testbed.

[BibT_eX]

[DOI]

Proceedings of the HPC Asia 2023 Workshops, 2023

2022

Early Application Experiences on a Modern GPU-Accelerated Arm-based HPC Platform.

[BibT_eX]

[DOI]

CoRR, 2022

Benchmarking Fortran DO CONCURRENT on CPUs and GPUs Using BabelStream.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM International Workshop on Performance Modeling, 2022

2021

Enabling ISO Standard Languages for Complex HPC Workflows.

[BibT_eX]

[DOI]

Proceedings of the Driving Scientific and Engineering Discoveries Through the Integration of Experiment, Big Data, and Modeling and Simulation, 2021

OpenSHMEM over MPI as a Performance Contender: Thorough Analysis and Optimizations.

[BibT_eX]

[DOI]

Proceedings of the OpenSHMEM and Related Technologies. OpenSHMEM in the Era of Exascale and Smart Networks, 2021

2020

Data Parallel C++: Enhancing SYCL Through Extensions for Productivity and Performance.

[BibT_eX]

[DOI]

Proceedings of the IWOCL '20: International Workshop on OpenCL, 2020

2019

Evaluating data parallelism in C++ using the Parallel Research Kernels.

[BibT_eX]

[DOI]

Jeff R. Hammond

Timothy G. Mattson

Proceedings of the International Workshop on OpenCL, 2019

A comparative analysis of Kokkos and SYCL as heterogeneous, parallel programming models for C++ applications.

[BibT_eX]

[DOI]

Jeff R. Hammond

Michael Kinsner

James C. Brodman

Proceedings of the International Workshop on OpenCL, 2019

Software combining to mitigate multithreaded MPI contention.

[BibT_eX]

[DOI]

Kenneth J. Raffenetti

Proceedings of the ACM International Conference on Supercomputing, 2019

2018

Dynamic Adaptable Asynchronous Progress Model for MPI RMA Multiphase Applications.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2018

Lock Contention Management in Multithreaded MPI.

[BibT_eX]

[DOI]

ACM Trans. Parallel Comput., 2018

Visualization of OpenMP* Task Dependencies Using Intel® Advisor - Flow Graph Analyzer.

[BibT_eX]

[DOI]

Proceedings of the Evolving OpenMP for Evolving Architectures, 2018

2017

TTC: A High-Performance Compiler for Tensor Transpositions.

[BibT_eX]

[DOI]

Paul Springer

Jeff R. Hammond

Paolo Bientinesi

ACM Trans. Math. Softw., 2017

Exploring versioned distributed arrays for resilience in scientific applications.

[BibT_eX]

[DOI]

Zachary A. Rubenstein

Int. J. High Perform. Comput. Appl., 2017

Performance Evaluation of NWChem Ab-Initio Molecular Dynamics (AIMD) Simulations on the Intel® Xeon Phi™ Processor.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing, 2017

2016

MADNESS: A Multiresolution, Adaptive Numerical Environment for Scientific Simulation.

[BibT_eX]

[DOI]

SIAM J. Sci. Comput., 2016

Scaling up Hartree-Fock calculations on Tianhe-2.

[BibT_eX]

[DOI]

Int. J. High Perform. Comput. Appl., 2016

Comparing Runtime Systems with Exascale Ambitions Using the Parallel Research Kernels.

[BibT_eX]

[DOI]

Rob F. Van der Wijngaart

Proceedings of the High Performance Computing - 31st International Conference, 2016

CAF Events Implementation Using MPI-3 Capabilities.

[BibT_eX]

[DOI]

Alessandro Fanfarillo

Jeff R. Hammond

Proceedings of the 23rd European MPI Users' Group Meeting, EuroMPI 2016, 2016

A Proposal to OpenMP for Addressing the CPU Oversubscription Challenge.

[BibT_eX]

[DOI]

Yonghong Yan

Jeff R. Hammond

Chunhua Liao

Alexandre E. Eichenberger

Proceedings of the OpenMP: Memory, Devices, and Tasks, 2016

A Hartree-Fock Application Using UPC++ and the New DArray Library.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016

One-Sided Interface for Matrix Operations Using MPI-3 RMA: A Case Study with Elemental.

[BibT_eX]

[DOI]

Assefaw Hadish Gebremedhin

Barbara M. Chapman

Proceedings of the 45th International Conference on Parallel Processing, 2016

2015

MADNESS: A Multiresolution, Adaptive Numerical Environment for Scientific Simulation.

[BibT_eX]

[DOI]

CoRR, 2015

Improving concurrency and asynchrony in multithreaded MPI applications using software offloading.

[BibT_eX]

[DOI]

Karthikeyan Vaidyanathan

Proceedings of the International Conference for High Performance Computing, 2015

Casper: An Asynchronous Progress Model for MPI RMA on Many-Core Architectures.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015

Versioned Distributed Arrays for Resilience in Scientific Applications: Global View Resilience.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Computational Science, 2015

Scaling NWChem with Efficient and Portable Asynchronous Communication in MPI RMA.

[BibT_eX]

[DOI]

Proceedings of the 15th IEEE/ACM International Symposium on Cluster, 2015

2014

A massively parallel tensor contraction framework for coupled-cluster computations.

[BibT_eX]

[DOI]

J. Parallel Distributed Comput., 2014

To INT_MAX... and beyond!: exploring large-count support in MPI.

[BibT_eX]

[DOI]

Jeff R. Hammond

Andreas Schäfer

Robert Latham

Proceedings of the 2014 Workshop on Exascale MPI, 2014

Towards a matrix-oriented strided interface in OpenSHMEM.

[BibT_eX]

[DOI]

Jeff R. Hammond

Proceedings of the 8th International Conference on Partitioned Global Address Space Programming Models, 2014

Implementing OpenSHMEM Using MPI-3 One-Sided Communication.

[BibT_eX]

[DOI]

Jeff R. Hammond

Sayan Ghosh

Barbara M. Chapman

Proceedings of the OpenSHMEM and Related Technologies. Experiences, Implementations, and Tools, 2014

Anatomy of High-Performance Many-Threaded Matrix Multiplication.

[BibT_eX]

[DOI]

Tyler M. Smith

Robert A. van de Geijn

Mikhail Smelyanskiy

Jeff R. Hammond

Field G. Van Zee

Proceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium, 2014

WorkQ: A many-core producer/consumer execution model applied to PGAS computations.

[BibT_eX]

[DOI]

Proceedings of the 20th IEEE International Conference on Parallel and Distributed Systems, 2014

2013

Elemental: A New Framework for Distributed Memory Dense Matrix Computations.

[BibT_eX]

[DOI]

Jack Poulson

Bryan Marker

Robert A. van de Geijn

Jeff R. Hammond

Nichols A. Romero

ACM Trans. Math. Softw., 2013

Challenges and methods in large-scale computational chemistry applications.

[BibT_eX]

[DOI]

Jeff R. Hammond

XRDS, 2013

Performance Analysis of the NWChem TCE for Different Communication Patterns.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing Systems. Performance Modeling, Benchmarking and Simulation, 2013

Cyclops Tensor Framework: Reducing Communication and Eliminating Load Imbalance in Massively Parallel Contractions.

[BibT_eX]

[DOI]

Proceedings of the 27th IEEE International Symposium on Parallel and Distributed Processing, 2013

Performance Analysis of the Lattice Boltzmann Model Beyond Navier-Stokes.

[BibT_eX]

[DOI]

Amanda Peters Randles

Proceedings of the 27th IEEE International Symposium on Parallel and Distributed Processing, 2013

Inspector/executor load balancing algorithms for block-sparse tensor contractions.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Supercomputing, 2013

2012

Performance characterization of global address space applications: a case study with NWChem.

[BibT_eX]

[DOI]

Jeff R. Hammond

Sriram Krishnamoorthy

Sameer Shende

Nichols A. Romero

Allen D. Malony

Concurr. Comput. Pract. Exp., 2012

Supporting the Global Arrays PGAS Model Using MPI One-Sided Communication.

[BibT_eX]

[DOI]

James Dinan

Pavan Balaji

Jeff R. Hammond

Sriram Krishnamoorthy

Vinod Tipparaju

Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012

ALCF MPI Benchmarks: Understanding Machine-Specific Communication Behavior.

[BibT_eX]

[DOI]

Proceedings of the 41st International Conference on Parallel Processing Workshops, 2012

An evaluation of difference and threshold techniques for efficient checkpoints.

[BibT_eX]

[DOI]

Sean Hogan

Jeff R. Hammond

Andrew A. Chien

Proceedings of the IEEE/IFIP International Conference on Dependable Systems and Networks Workshops, 2012

2011

Poster: Passing the three trillion particle limit with an error-controlled fast multipole method.

[BibT_eX]

[DOI]

Ivo Kabadshow

Holger Dachsel

Jeff R. Hammond

Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, 2011

Poster: High-level, one-sided programming models on MPI: a case study with global arrays and NWChem.

[BibT_eX]

[DOI]

James Dinan

Pavan Balaji

Jeff R. Hammond

Sriram Krishnamoorthy

Vinod Tipparaju

Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, 2011

Noncollective Communicator Creation in MPI.

[BibT_eX]

[DOI]

James Dinan

Sriram Krishnamoorthy

Proceedings of the Recent Advances in the Message Passing Interface, 2011

Jeff R. Hammond

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...