Carl Pearson

Orcid: 0000-0001-6481-970X

According to our database1, Carl Pearson authored at least 22 papers between 2014 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Interconnect Bandwidth Heterogeneity on AMD MI250x and Infinity Fabric.
CoRR, 2023

Latency and Bandwidth Microbenchmarks of US Department of Energy Systems in the June 2023 Top 500 List.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

Latency and Bandwidth Microbenchmarks of Six US Department of Energy Systems in the Top500.
Proceedings of the IEEE International Conference on Cluster Computing, 2023

2022
Machine Learning for CUDA+MPI Design Rules.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2022

2021
Movement and placement of non-contiguous data in distributed GPU computing
PhD thesis, 2021

TEMPI: An Interposed MPI Library with a Canonical Representation of CUDA-aware Datatypes.
Proceedings of the HPDC '21: The 30th International Symposium on High-Performance Parallel and Distributed Computing, 2021

2020
Fast CUDA-Aware MPI Datatypes without Platform Support.
CoRR, 2020

Efficient Inference on GPUs for the Sparse Deep Neural Network Graph Challenge 2020.
CoRR, 2020

Node-Aware Stencil Communication for Heterogeneous Supercomputers.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium Workshops, 2020

At-Scale Sparse Deep Neural Network Inference With Efficient GPU Implementation.
Proceedings of the 2020 IEEE High Performance Extreme Computing Conference, 2020

2019
Evaluating Characteristics of CUDA Communication Primitives on High-Bandwidth Interconnects.
Proceedings of the 2019 ACM/SPEC International Conference on Performance Engineering, 2019

Update on Triangle Counting on GPU.
Proceedings of the 2019 IEEE High Performance Extreme Computing Conference, 2019

Accelerating Sparse Deep Neural Networks on FPGAs.
Proceedings of the 2019 IEEE High Performance Extreme Computing Conference, 2019

Update on k-truss Decomposition on GPU.
Proceedings of the 2019 IEEE High Performance Extreme Computing Conference, 2019

2018
SCOPE: C3SR Systems Characterization and Benchmarking Framework.
CoRR, 2018

NUMA-Aware Data-Transfer Measurements for Power/NVLink Multi-GPU Systems.
Proceedings of the High Performance Computing, 2018

A Fast and Massively-Parallel Inverse Solver for Multiple-Scattering Tomographic Image Reconstruction.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium, 2018

Collaborative (CPU + GPU) Algorithms for Triangle Counting and Truss Decomposition.
Proceedings of the 2018 IEEE High Performance Extreme Computing Conference, 2018

2017
RAI: A Scalable Project Submission System for Parallel Programming Courses.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

Rebooting the Data Access Hierarchy of Computing Systems.
Proceedings of the IEEE International Conference on Rebooting Computing, 2017

2016
WebGPU: A Scalable Online Development Platform for GPU Programming Courses.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016

2014
Adaptive Cache Bypass and Insertion for Many-core Accelerators.
Proceedings of the 2nd International Workshop on Many-core Embedded Systems, 2014


  Loading...