Sayantan Sur

According to our database1, Sayantan Sur authored at least 42 papers between 2004 and 2020.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2020
Minimizing the usage of hardware counters for collective communication using triggered operations.
Parallel Comput., 2020

2019
Efficient implementation of MPI-3 RMA over openFabrics interfaces.
Parallel Comput., 2019

2017

2016
Design and Implementation of OpenSHMEM Using OFI on the Aries Interconnect.
Proceedings of the OpenSHMEM and Related Technologies. Enhancing OpenSHMEM for Hybrid Environments, 2016

2015
A Brief Introduction to the OpenFabrics Interfaces - A New Network API for Maximizing High Performance Application Efficiency.
Proceedings of the 23rd IEEE Annual Symposium on High-Performance Interconnects, 2015

2014
Early Evaluation of Scalable Fabric Interface for PGAS Programming Models.
Proceedings of the 8th International Conference on Partitioned Global Address Space Programming Models, 2014

2011
Collective Communication, Network Support For.
Proceedings of the Encyclopedia of Parallel Computing, 2011

InfiniBand.
Proceedings of the Encyclopedia of Parallel Computing, 2011

MVAPICH2-GPU: optimized GPU to GPU communication for InfiniBand clusters.
Comput. Sci. Res. Dev., 2011

High-performance and scalable non-blocking all-to-all with collective offload on InfiniBand clusters: a study with parallel 3D FFT.
Comput. Sci. Res. Dev., 2011

Codesign for InfiniBand Clusters.
Computer, 2011

Optimizing MPI One Sided Communication on Multi-core InfiniBand Clusters Using Shared Memory Backed Windows.
Proceedings of the Recent Advances in the Message Passing Interface, 2011

Design and Implementation of Key Proposed MPI-3 One-Sided Communication Semantics on InfiniBand.
Proceedings of the Recent Advances in the Message Passing Interface, 2011

Memcached Design on High Performance RDMA Capable Interconnects.
Proceedings of the International Conference on Parallel Processing, 2011

Designing Non-blocking Broadcast with Collective Offload on InfiniBand Clusters: A Case Study with HPL.
Proceedings of the IEEE 19th Annual Symposium on High Performance Interconnects, 2011

Multi-threaded UPC runtime with network endpoints: Design alternatives and evaluation on multi-core architectures.
Proceedings of the 18th International Conference on High Performance Computing, 2011

INAM - A Scalable InfiniBand Network Analysis and Monitoring Tool.
Proceedings of the Euro-Par 2011: Parallel Processing Workshops - CCPI, CGWS, HeteroPar, HiBB, HPCVirt, HPPC, HPSS, MDGS, ProPer, Resilience, UCHPC, VHPC, Bordeaux, France, August 29, 2011

Optimized Non-contiguous MPI Datatype Communication for GPU Clusters: Design, Implementation and Evaluation with MVAPICH2.
Proceedings of the 2011 IEEE International Conference on Cluster Computing (CLUSTER), 2011

Design and Evaluation of Network Topology-/Speed- Aware Broadcast Algorithms for InfiniBand Clusters.
Proceedings of the 2011 IEEE International Conference on Cluster Computing (CLUSTER), 2011

MPI Alltoall Personalized Exchange on GPGPU Clusters: Design Alternatives and Benefit.
Proceedings of the 2011 IEEE International Conference on Cluster Computing (CLUSTER), 2011

2010
Designing truly one-sided MPI-2 RMA intra-node communication on multi-core systems.
Comput. Sci. Res. Dev., 2010

Unifying UPC and MPI runtimes: experience with MVAPICH.
Proceedings of the Fourth Conference on Partitioned Global Address Space Programming Model, 2010

Quantifying performance benefits of overlap using MPI-2 in a seismic modeling application.
Proceedings of the 24th International Conference on Supercomputing, 2010

High Performance Design and Implementation of Nemesis Communication Layer for Two-Sided and One-Sided MPI Semantics in MVAPICH2.
Proceedings of the 39th International Conference on Parallel Processing, 2010

Improving Application Performance and Predictability Using Multiple Virtual Lanes in Modern Multi-core InfiniBand Clusters.
Proceedings of the 39th International Conference on Parallel Processing, 2010

Designing Power-Aware Collective Communication Algorithms for InfiniBand Clusters.
Proceedings of the 39th International Conference on Parallel Processing, 2010

Design and Evaluation of Generalized Collective Communication Primitives with Overlap Using ConnectX-2 Offload Engine.
Proceedings of the IEEE 18th Annual Symposium on High Performance Interconnects, 2010

Designing High-End Computing Systems with InfiniBand and High-Speed Ethernet.
Proceedings of the IEEE 18th Annual Symposium on High Performance Interconnects, 2010

2009
Efficient, portable implementation of asynchronous multi-place programs.
Proceedings of the 14th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2009

2007
High performance MPI design using unreliable datagram for ultra-scale InfiniBand clusters.
Proceedings of the 21th Annual International Conference on Supercomputing, 2007

Performance Analysis and Evaluation of Mellanox ConnectX InfiniBand Architecture with Multi-Core Platforms.
Proceedings of the 15th Annual IEEE Symposium on High-Performance Interconnects, 2007

Zero-copy protocol for MPI using infiniband unreliable datagram.
Proceedings of the 2007 IEEE International Conference on Cluster Computing, 2007

Lightweight kernel-level primitives for high-performance MPI intra-node communication over multi-core systems.
Proceedings of the 2007 IEEE International Conference on Cluster Computing, 2007

2006
MPI and communication - High-performance and scalable MPI over InfiniBand with reduced memory usage: an in-depth performance analysis.
Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

RDMA read based rendezvous protocol for MPI over InfiniBand: design alternatives and benefits.
Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2006

Shared receive queue based scalable MPI design for InfiniBand clusters.
Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

2005
High Performance Broadcast Support in La-Mpi Over Quadrics.
Int. J. High Perform. Comput. Appl., 2005

Analysis of Design Considerations for Optimizing Multi-Channel MPI over InfiniBand.
Proceedings of the 19th International Parallel and Distributed Processing Symposium (IPDPS 2005), 2005

LiMIC: Support for High-Performance MPI Intra-node Communication on Linux Cluster.
Proceedings of the 34th International Conference on Parallel Processing (ICPP 2005), 2005

Can Memory-Less Network Adapters Benefit Next-Generation InfiniBand Systems?.
Proceedings of the 13th Annual IEEE Symposium on High Performance Interconnects (HOTIC 2005), 2005

High Performance RDMA Based All-to-All Broadcast for InfiniBand Clusters.
Proceedings of the High Performance Computing, 2005

2004
Efficient and Scalable All-to-All Personalized Exchange for InfiniBand-Based Clusters.
Proceedings of the 33rd International Conference on Parallel Processing (ICPP 2004), 2004


  Loading...