Kaiming Ouyang

Orcid: 0000-0002-4775-1835

According to our database1, Kaiming Ouyang authored at least 14 papers between 2017 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Accelerating MPI Collectives with Process-in-Process-based Multi-object Techniques.
Proceedings of the 32nd International Symposium on High-Performance Parallel and Distributed Computing, 2023

PiP-MColl: Process-in-Process-based Multi-object MPI Collectives.
Proceedings of the IEEE International Conference on Cluster Computing, 2023

KF K-means: A High Performance K-means Implementation using Kernel Fusion.
Proceedings of the IEEE International Conference on Big Data, 2023

2022
Exploring Interprocess Techniques for High-Performance MPI Communication.
PhD thesis, 2022

On the Difference Between Shared Memory and Shared Address Space in HPC Communication.
Proceedings of the Supercomputing Frontiers - 7th Asian Conference, 2022

2021
FT-CNN: Algorithm-Based Fault Tolerance for Convolutional Neural Networks.
IEEE Trans. Parallel Distributed Syst., 2021

Daps: A Dynamic Asynchronous Progress Stealing Model for MPI Communication.
Proceedings of the IEEE International Conference on Cluster Computing, 2021

2020
Algorithm-Based Fault Tolerance for Convolutional Neural Networks.
CoRR, 2020

CAB-MPI: exploring interprocess work-stealing towards balanced MPI communication.
Proceedings of the International Conference for High Performance Computing, 2020

2019
FT-iSort: efficient fault tolerance for introsort.
Proceedings of the International Conference for High Performance Computing, 2019

TSM2: optimizing tall-and-skinny matrix-matrix multiplication on GPUs.
Proceedings of the ACM International Conference on Supercomputing, 2019

2018
Fault tolerant one-sided matrix decompositions on heterogeneous systems with GPUs.
Proceedings of the International Conference for High Performance Computing, 2018

2017
Correcting soft errors online in fast fourier transform.
Proceedings of the International Conference for High Performance Computing, 2017

Silent Data Corruption Resilient Two-sided Matrix Factorizations.
Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2017


  Loading...