Kawthar Shafie Khorassani

Orcid: 0000-0001-5856-5483

Affiliations:
  • Ohio State University, Columbus, USA


According to our database1, Kawthar Shafie Khorassani authored at least 16 papers between 2019 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
High Performance MPI over the Slingshot Interconnect.
J. Comput. Sci. Technol., February, 2023

Network-Assisted Noncontiguous Transfers for GPU-Aware MPI Libraries.
IEEE Micro, 2023

MPI-xCCL: A Portable MPI Library over Collective Communication Libraries for Various Accelerators.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

Designing and Optimizing GPU-aware Nonblocking MPI Neighborhood Collective Communication for PETSc<sup>*</sup>.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023

Implementing and Optimizing a GPU-aware MPI Library for Intel GPUs: Early Experiences.
Proceedings of the 23rd IEEE/ACM International Symposium on Cluster, 2023

2022
High Performance MPI over the Slingshot Interconnect: Early Experiences.
Proceedings of the PEARC '22: Practice and Experience in Advanced Research Computing, Boston, MA, USA, July 10, 2022

Accelerating MPI All-to-All Communication with Online Compression on Modern GPU Clusters.
Proceedings of the High Performance Computing - 37th International Conference, 2022

Highly Efficient Alltoall and Alltoallv Communication Algorithms for GPU Systems.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2022

Network Assisted Non-Contiguous Transfers for GPU-Aware MPI Libraries.
Proceedings of the IEEE Symposium on High-Performance Interconnects, 2022

2021
Designing a ROCm-Aware MPI Library for AMD GPUs: Early Experiences.
Proceedings of the High Performance Computing - 36th International Conference, 2021

Adaptive and Hierarchical Large Message All-to-all Communication Algorithms for Large-scale Dense GPU Systems.
Proceedings of the 21st IEEE/ACM International Symposium on Cluster, 2021

2020
NV-group: link-efficient reduction for distributed deep learning on modern dense GPU systems.
Proceedings of the ICS '20: 2020 International Conference on Supercomputing, 2020

Dynamic Kernel Fusion for Bulk Non-contiguous Data Transfer on GPU Clusters.
Proceedings of the IEEE International Conference on Cluster Computing, 2020

2019
Performance Evaluation of MPI Libraries on GPU-Enabled OpenPOWER Architectures: Early Experiences.
Proceedings of the High Performance Computing, 2019

OMB-UM: Design, Implementation, and Evaluation of CUDA Unified Memory Aware MPI Benchmarks.
Proceedings of the 2019 IEEE/ACM Performance Modeling, 2019

High-Performance Adaptive MPI Derived Datatype Communication for Modern Multi-GPU Systems.
Proceedings of the 26th IEEE International Conference on High Performance Computing, 2019


  Loading...