Sameer Kumar

Dataset, October, 2021

UIUC-PPL/charm: v7.0.0-rc2.

[BibT_eX]

[DOI]

Dataset, September, 2021

UIUC-PPL/charm: v7.0.0-rc1.

[BibT_eX]

[DOI]

Dataset, June, 2021

2020

UIUC-PPL/charm: v6.11.0-beta1.

[BibT_eX]

[DOI]

Dataset, October, 2020

UIUC-PPL/charm: Charm++ version 6.10.2.

[BibT_eX]

[DOI]

Dataset, August, 2020

UIUC-PPL/charm: v6.10.1.

[BibT_eX]

[DOI]

Dataset, March, 2020

UIUC-PPL/charm: v6.10.0.

[BibT_eX]

[DOI]

Dataset, February, 2020

2019

UIUC-PPL/charm: v6.10.0-rc2.

[BibT_eX]

[DOI]

Dataset, October, 2019

UIUC-PPL/charm: v6.10.0-rc.

[BibT_eX]

[DOI]

Dataset, September, 2019

UIUC-PPL/charm: v6.10.0-beta1.

[BibT_eX]

[DOI]

Dataset, August, 2019

2018

Efficient Training of Convolutional Neural Nets on Large Distributed Systems.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Cluster Computing, 2018

2017

PowerAI DDL.

[BibT_eX]

[DOI]

CoRR, 2017

MPI Acceleration of Image Classification: Are We Seeing the Resurgence of MPI in Solving Big Data Problems?

[BibT_eX]

[DOI]

Proceedings of the 2017 Workshop on Software Engineering Methods for Parallel and High Performance Applications, 2017

2016

Space Performance Tradeoffs in Compressing MPI Group Data Structures.

[BibT_eX]

[DOI]

Philip Heidelberger

Craig B. Stunkel

Proceedings of the 23rd European MPI Users' Group Meeting, EuroMPI 2016, 2016

Optimization of Message Passing Services on POWER8 InfiniBand Clusters.

[BibT_eX]

[DOI]

Proceedings of the 23rd European MPI Users' Group Meeting, EuroMPI 2016, 2016

2015

UCX: An Open Source Framework for HPC Network APIs and Beyond.

[BibT_eX]

[DOI]

Proceedings of the 23rd IEEE Annual Symposium on High-Performance Interconnects, 2015

2014

Optimization of MPI collective operations on the IBM Blue Gene/Q supercomputer.

[BibT_eX]

[DOI]

Int. J. High Perform. Comput. Appl., 2014

Scalable MPI-3.0 RMA on the Blue Gene/Q Supercomputer.

[BibT_eX]

[DOI]

Michael Blocksome

Proceedings of the 21st European MPI Users' Group Meeting, 2014

2013

IBM Blue Gene/Q system software stack.

[BibT_eX]

[DOI]

IBM J. Res. Dev., 2013

Optimization of MPI_Allreduce on the blue Gene/Q supercomputer.

[BibT_eX]

[DOI]

Daniel Faraj

Proceedings of the 20th European MPI Users's Group Meeting, 2013

2012

The IBM Blue Gene/Q Interconnection Fabric.

[BibT_eX]

[DOI]

Jeffrey J. Parker

IEEE Micro, 2012

A divide and conquer strategy for scaling weather simulations with multiple regions of interest.

[BibT_eX]

[DOI]

Proceedings of the SC Conference on High Performance Computing Networking, 2012

Looking under the hood of the IBM blue gene/Q network.

[BibT_eX]

[DOI]

Anamitra R. Choudhury

Yogish Sabharwal

Swati Singhal

Jeffrey J. Parker

Proceedings of the SC Conference on High Performance Computing Networking, 2012

PAMI: A Parallel Active Message Interface for the Blue Gene/Q Supercomputer.

[BibT_eX]

[DOI]

Saiful Azmi bin Hj Husain

Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012

Collective algorithms for sub-communicators.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Supercomputing, 2012

Performance Evaluation and Optimization of Nested High Resolution Weather Simulations.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2012 Parallel Processing - 18th International Conference, 2012

2011

Mpi on millions of Cores.

[BibT_eX]

[DOI]

Parallel Process. Lett., 2011

Comparison of neuronal spike exchange methods on a Blue Gene/P supercomputer.

[BibT_eX]

[DOI]

Michael L. Hines

Felix Schürmann

Frontiers Comput. Neurosci., 2011

The IBM Blue Gene/Q interconnection network and message unit.

[BibT_eX]

[DOI]

Jeffrey J. Parker

Proceedings of the Conference on High Performance Computing Networking, 2011

Optimizing MPI Collectives Using Efficient Intra-node Communication Techniques over the Blue Gene/P Supercomputer.

[BibT_eX]

[DOI]

Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

2010

Architecture of the Component Collective Messaging Interface.

[BibT_eX]

[DOI]

Int. J. High Perform. Comput. Appl., 2010

Enabling Concurrent Multithreaded MPI Communication on Multicore Petascale Systems.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in the Message Passing Interface, 2010

Optimization of applications with non-blocking neighborhood collectives via multisends on the Blue Gene/P supercomputer.

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Minimizing MPI Resource Contention in Multithreaded Multicore Environments.

[BibT_eX]

[DOI]

Bronis R. de Supinski

Rajeev Thakur

Proceedings of the 2010 IEEE International Conference on Cluster Computing, 2010

2009

MPI on a Million Processors.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2009

MPI collective communications on the blue gene/p supercomputer: algorithms and optimizations.

[BibT_eX]

[DOI]

Proceedings of the 23rd international conference on Supercomputing, 2009

Dynamic topology aware load balancing algorithms for molecular dynamics applications.

[BibT_eX]

[DOI]

Abhinav Bhatele

Proceedings of the 23rd international conference on Supercomputing, 2009

MPI Collective Communications on The Blue Gene/P Supercomputer: Algorithms and Optimizations.

[BibT_eX]

[DOI]

Proceedings of the 17th IEEE Symposium on High Performance Interconnects, 2009

2008

Scalable molecular dynamics with NAMD on the IBM Blue Gene/L system.

[BibT_eX]

[DOI]

IBM J. Res. Dev., 2008

Fine-grained parallelization of the Car - Parrinello ab initio molecular dynamics method on the IBM Blue Gene/L supercomputer.

[BibT_eX]

[DOI]

IBM J. Res. Dev., 2008

Architecture of the Component Collective Messaging Interface.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2008

Overcoming scaling challenges in biomolecular simulations across multiple platforms.

[BibT_eX]

[DOI]

Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

Evaluating the effect of replacing CNK with linux on the compute-nodes of blue gene/l.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual International Conference on Supercomputing, 2008

The deep computing messaging framework: generalized scalable message passing on the blue gene/P supercomputer.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual International Conference on Supercomputing, 2008

Optimization of All-to-All Communication on the Blue Gene/L Supercomputer.

[BibT_eX]

[DOI]

Proceedings of the 2008 International Conference on Parallel Processing, 2008

2006

Scaling applications to massively parallel machines using Projections performance analysis tool.

[BibT_eX]

[DOI]

Future Gener. Comput. Syst., 2006

Performance evaluation of adaptive MPI.

[BibT_eX]

[DOI]

Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2006

Achieving strong scaling with NAMD on Blue Gene/L.

[BibT_eX]

[DOI]

Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

2005

Optimizing Communication for Massively Parallel Processing

[BibT_eX]

[DOI]

PhD thesis, 2005

Improved Point-to-Point and Collective Communication Performance with Output-Queued High-Radix Routers.

[BibT_eX]

[DOI]

Craig B. Stunkel

Proceedings of the High Performance Computing, 2005

2004

Scalable fine-grained parallelization of plane-wave-based ab initio molecular dynamics for large supercomputers.

[BibT_eX]

[DOI]

J. Comput. Chem., 2004

Opportunities and Challenges of Modern Communication Architectures: Case Study with QsNet.

[BibT_eX]

[DOI]

Proceedings of the 18th International Parallel and Distributed Processing Symposium (IPDPS 2004), 2004

Faucets: Efficient Resource Allocation on the Computational Grid.

[BibT_eX]

[DOI]

Proceedings of the 33rd International Conference on Parallel Processing (ICPP 2004), 2004

Scaling All-to-All Multicast on Fat-tree Networks.

[BibT_eX]

[DOI]

Proceedings of the 10th International Conference on Parallel and Distributed Systems, 2004

2003

A Framework for Collective Personalized Communication.

[BibT_eX]

[DOI]

Krishnan Varadarajan

Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS 2003), 2003

Scaling Molecular Dynamics to 3000 Processors with Projections: A Performance Analysis Case Study.

[BibT_eX]

[DOI]

Proceedings of the Computational Science - ICCS 2003, 2003

2002

NAMD: biomolecular simulation on thousands of processors.

[BibT_eX]

[DOI]

Proceedings of the 2002 ACM/IEEE conference on Supercomputing, 2002

A Malleable-Job System for Timeshared Parallel Machines.

[BibT_eX]

[DOI]