Sayantan Sur

Proceedings of the 2011 IEEE International Conference on Cluster Computing (CLUSTER), 2011

MPI Alltoall Personalized Exchange on GPGPU Clusters: Design Alternatives and Benefit.

[BibT_eX]

[DOI]

Ashish Kumar Singh

Sreeram Potluri

Hao Wang

Proceedings of the 2011 IEEE International Conference on Cluster Computing (CLUSTER), 2011

2010

Designing truly one-sided MPI-2 RMA intra-node communication on multi-core systems.

[BibT_eX]

[DOI]

Ping Lai

Comput. Sci. Res. Dev., 2010

Unifying UPC and MPI runtimes: experience with MVAPICH.

[BibT_eX]

[DOI]

Proceedings of the Fourth Conference on Partitioned Global Address Space Programming Model, 2010

Quantifying performance benefits of overlap using MPI-2 in a seismic modeling application.

[BibT_eX]

[DOI]

Proceedings of the 24th International Conference on Supercomputing, 2010

High Performance Design and Implementation of Nemesis Communication Layer for Two-Sided and One-Sided MPI Semantics in MVAPICH2.

[BibT_eX]

[DOI]

Miao Luo

Sreeram Potluri

Ping Lai

Emilio Pasquale Mancini

Hari Subramoni

Proceedings of the 39th International Conference on Parallel Processing, 2010

Improving Application Performance and Predictability Using Multiple Virtual Lanes in Modern Multi-core InfiniBand Clusters.

[BibT_eX]

[DOI]

Proceedings of the 39th International Conference on Parallel Processing, 2010

Designing Power-Aware Collective Communication Algorithms for InfiniBand Clusters.

[BibT_eX]

[DOI]

Emilio Pasquale Mancini

Proceedings of the 39th International Conference on Parallel Processing, 2010

Design and Evaluation of Generalized Collective Communication Primitives with Overlap Using ConnectX-2 Offload Engine.

[BibT_eX]

[DOI]

Hari Subramoni

Proceedings of the IEEE 18th Annual Symposium on High Performance Interconnects, 2010

Designing High-End Computing Systems with InfiniBand and High-Speed Ethernet.

[BibT_eX]

[DOI]

Pavan Balaji

Proceedings of the IEEE 18th Annual Symposium on High Performance Interconnects, 2010

2009

Efficient, portable implementation of asynchronous multi-place programs.

[BibT_eX]

[DOI]

Proceedings of the 14th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2009

2007

High performance MPI design using unreliable datagram for ultra-scale InfiniBand clusters.

[BibT_eX]

[DOI]

Proceedings of the 21th Annual International Conference on Supercomputing, 2007

Performance Analysis and Evaluation of Mellanox ConnectX InfiniBand Architecture with Multi-Core Platforms.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual IEEE Symposium on High-Performance Interconnects, 2007

Zero-copy protocol for MPI using infiniband unreliable datagram.

[BibT_eX]

[DOI]

Matthew J. Koop

Proceedings of the 2007 IEEE International Conference on Cluster Computing, 2007

Lightweight kernel-level primitives for high-performance MPI intra-node communication over multi-core systems.

[BibT_eX]

[DOI]

Proceedings of the 2007 IEEE International Conference on Cluster Computing, 2007

2006

MPI and communication - High-performance and scalable MPI over InfiniBand with reduced memory usage: an in-depth performance analysis.

[BibT_eX]

[DOI]

Matthew J. Koop

Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

RDMA read based rendezvous protocol for MPI over InfiniBand: design alternatives and benefits.

[BibT_eX]

[DOI]

Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2006

Shared receive queue based scalable MPI design for InfiniBand clusters.

[BibT_eX]

[DOI]

Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

2005

High Performance Broadcast Support in La-Mpi Over Quadrics.

[BibT_eX]

[DOI]

Int. J. High Perform. Comput. Appl., 2005

Analysis of Design Considerations for Optimizing Multi-Channel MPI over InfiniBand.

[BibT_eX]

[DOI]

Proceedings of the 19th International Parallel and Distributed Processing Symposium (IPDPS 2005), 2005

LiMIC: Support for High-Performance MPI Intra-node Communication on Linux Cluster.

[BibT_eX]

[DOI]

Proceedings of the 34th International Conference on Parallel Processing (ICPP 2005), 2005

Can Memory-Less Network Adapters Benefit Next-Generation InfiniBand Systems?.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual IEEE Symposium on High Performance Interconnects (HOTIC 2005), 2005

High Performance RDMA Based All-to-All Broadcast for InfiniBand Clusters.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing, 2005

2004

Efficient and Scalable All-to-All Personalized Exchange for InfiniBand-Based Clusters.

[BibT_eX]

[DOI]

Hyun-Wook Jin