Mohammadreza Bayatpour

According to our database1, Mohammadreza Bayatpour authored at least 21 papers between 2016 and 2021.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2021
The MVAPICH project: Transforming research into high-performance MPI library for HPC community.
J. Comput. Sci., 2021

BluesMPI: Efficient MPI Non-blocking Alltoall Offloading Designs on Modern BlueField Smart NICs.
Proceedings of the High Performance Computing - 36th International Conference, 2021

Layout-aware Hardware-assisted Designs for Derived Data Types in MPI.
Proceedings of the 28th IEEE International Conference on High Performance Computing, 2021

Large-Message Nonblocking MPI_Iallgather and MPI Ibcast Offload via BlueField-2 DPU.
Proceedings of the 28th IEEE International Conference on High Performance Computing, 2021

Towards Architecture-aware Hierarchical Communication Trees on Modern HPC Systems.
Proceedings of the 28th IEEE International Conference on High Performance Computing, 2021

2020
FALCON-X: Zero-copy MPI derived datatype processing on modern CPU and GPU architectures.
J. Parallel Distributed Comput., 2020

Communication-Aware Hardware-Assisted MPI Overlap Engine.
Proceedings of the High Performance Computing - 35th International Conference, 2020

A hierarchical and load-aware design for large message neighborhood collectives.
Proceedings of the International Conference for High Performance Computing, 2020

Scalable MPI Collectives using SHARP: Large Scale Performance Evaluation on the TACC Frontera System.
Proceedings of the Workshop on Exascale MPI, 2020

Performance Characterization of Network Mechanisms for Non-Contiguous Data Transfers in MPI.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium Workshops, 2020

Machine-agnostic and Communication-aware Designs for MPI on Emerging Architectures.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020

Design and Characterization of InfiniBand Hardware Tag Matching in MPI.
Proceedings of the 20th IEEE/ACM International Symposium on Cluster, 2020

2019
Efficient design for MPI asynchronous progress without dedicated resources.
Parallel Comput., 2019

FALCON: Efficient Designs for Zero-Copy MPI Datatype Processing on Emerging Architectures.
Proceedings of the 2019 IEEE International Parallel and Distributed Processing Symposium, 2019

Design and Characterization of Shared Address Space MPI Collectives on Modern Architectures.
Proceedings of the 19th IEEE/ACM International Symposium on Cluster, 2019

2018
Cooperative rendezvous protocols for improved performance and overlap.
Proceedings of the International Conference for High Performance Computing, 2018

Efficient Asynchronous Communication Progress for MPI without Dedicated Resources.
Proceedings of the 25th European MPI Users' Group Meeting, 2018

Designing Efficient Shared Address Space Reduction Collectives for Multi-/Many-cores.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium, 2018

SALaR: Scalable and Adaptive Designs for Large Message Reduction Collectives.
Proceedings of the IEEE International Conference on Cluster Computing, 2018

2017
Scalable reduction collectives with data partitioning-based multi-leader design.
Proceedings of the International Conference for High Performance Computing, 2017

2016
Adaptive and Dynamic Design for MPI Tag Matching.
Proceedings of the 2016 IEEE International Conference on Cluster Computing, 2016


  Loading...