Sreepathi Pai

Orcid: 0000-0002-3691-7238

According to our database1, Sreepathi Pai authored at least 23 papers between 2012 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Asynchronous Automata Processing on GPUs.
Proc. ACM Meas. Anal. Comput. Syst., March, 2023

2021
Efficient Execution of Graph Algorithms on CPU with SIMD Extensions.
Proceedings of the IEEE/ACM International Symposium on Code Generation and Optimization, 2021

2020
Groute: Asynchronous Multi-GPU Programming Model with Applications to Large-scale Graph Processing.
ACM Trans. Parallel Comput., 2020

Horus: A Modular GPU Emulator Framework.
Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2020

Why GPUs are Slow at Executing NFAs and How to Make them Faster.
Proceedings of the ASPLOS '20: Architectural Support for Programming Languages and Operating Systems, 2020

2019
A Hybrid Graph Coloring Algorithm for GPUs.
CoRR, 2019

Statistical caching for near memory management.
Proceedings of the International Symposium on Memory Systems, 2019

Performance Evaluation of OpenCL Standard Support (and Beyond).
Proceedings of the International Workshop on OpenCL, 2019

One Size Doesn't Fit All: Quantifying Performance Portability of Graph Applications on GPUs.
Proceedings of the IEEE International Symposium on Workload Characterization, 2019

2018
Locality analysis through static parallel sampling.
Proceedings of the 39th ACM SIGPLAN Conference on Programming Language Design and Implementation, 2018

Architectural Support for Efficient Large-Scale Automata Processing.
Proceedings of the 51st Annual IEEE/ACM International Symposium on Microarchitecture, 2018

2017
Bounded exhaustive test-input generation on GPUs.
Proc. ACM Program. Lang., 2017

Groute: An Asynchronous Multi-GPU Programming Model for Irregular Computations.
Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2017

Parallel triangle counting and k-truss identification using graph-centric methods.
Proceedings of the 2017 IEEE High Performance Extreme Computing Conference, 2017

Controlled Kernel Launch for Dynamic Parallelism in GPUs.
Proceedings of the 2017 IEEE International Symposium on High Performance Computer Architecture, 2017

2016
Adaptive Work-Efficient Connected Components on the GPU.
CoRR, 2016

Lowering IrGL to CUDA.
CoRR, 2016

A compiler for throughput optimization of graph algorithms on GPUs.
Proceedings of the 2016 ACM SIGPLAN International Conference on Object-Oriented Programming, 2016

Synchronization Trade-Offs in GPU Implementations of Graph Algorithms.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016

2015
Stochastic gradient descent on GPUs.
Proceedings of the 8th Workshop on General Purpose Processing using GPUs, 2015

2014
Preemptive thread block scheduling with online structural runtime prediction for concurrent GPGPU kernels.
Proceedings of the International Conference on Parallel Architectures and Compilation, 2014

2013
Improving GPGPU concurrency with elastic kernels.
Proceedings of the Architectural Support for Programming Languages and Operating Systems, 2013

2012
Fast and efficient automatic memory management for GPUs using compiler-assisted runtime coherence scheme.
Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2012


  Loading...