Piyush Sao

Orcid: 0000-0002-9432-5855

According to our database¹, Piyush Sao authored at least 29 papers between 2013 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Ghosts of Softmax: Complex Singularities That Limit Safe Step Sizes in Cross-Entropy.

[BibT_eX]

[DOI]

Piyush Sao

CoRR, March, 2026

Fast Evaluation of Truncated Neumann Series by Low-Product Radix Kernels.

[BibT_eX]

[DOI]

Piyush Sao

CoRR, February, 2026

What Trace Powers Reveal About Log-Determinants: Closed-Form Estimators, Certificates, and Failure Modes.

[BibT_eX]

[DOI]

Piyush Sao

CoRR, January, 2026

2025

Fast Active-Set Thresholding Method for Nonnegative Least Squares.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Big Data, 2025

2024

Interface for Sparse Linear Algebra Operations.

[BibT_eX]

[DOI]

CoRR, 2024

Accelerated Constrained Sparse Tensor Factorization on Massively Parallel Architectures.

[BibT_eX]

[DOI]

Proceedings of the 53rd International Conference on Parallel Processing, 2024

PANDORA: A Parallel Dendrogram Construction Algorithm for Single Linkage Clustering on GPU.

[BibT_eX]

[DOI]

Piyush Sao

Andrey Prokopenko

Damien Lebrun-Grandié

Proceedings of the 53rd International Conference on Parallel Processing, 2024

2023

Newly Released Capabilities in the Distributed-Memory SuperLU Sparse Direct Solver.

[BibT_eX]

[DOI]

ACM Trans. Math. Softw., March, 2023

Brief Announcement: Communication Optimal Sparse LU Factorization for Planar Matrices.

[BibT_eX]

[DOI]

Piyush Sao

Xiaoye Sherry Li

Proceedings of the 35th ACM Symposium on Parallelism in Algorithms and Architectures, 2023

Unified Communication Optimization Strategies for Sparse Triangular Solver on CPU and GPU Clusters.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2023

Optimizing Communication in 2D Grid-Based MPI Applications at Exascale.

[BibT_eX]

[DOI]

Proceedings of the 30th European MPI Users' Group Meeting, 2023

2022

Exaflops Biomedical Knowledge Graph Analytics.

[BibT_eX]

[DOI]

Proceedings of the SC22: International Conference for High Performance Computing, 2022

A single-tree algorithm to compute the Euclidean minimum spanning tree on GPUs.

[BibT_eX]

[DOI]

Andrey Prokopenko

Piyush Sao

Damien Lebrun-Grandié

Proceedings of the 51st International Conference on Parallel Processing, 2022

2021

Sparse Binary Matrix-Vector Multiplication on Neuromorphic Computers.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2021

Scalable All-pairs Shortest Paths for Huge Graphs on Multi-GPU Clusters.

[BibT_eX]

[DOI]

Proceedings of the HPDC '21: The 30th International Symposium on High-Performance Parallel and Distributed Computing, 2021

2020

Traversing Large Graphs on GPUs with Unified Memory.

[BibT_eX]

[DOI]

Proc. VLDB Endow., 2020

Scalable knowledge graph analytics at 136 petaflop/s.

[BibT_eX]

[DOI]

Ramakrishnan Kannan

Piyush Sao

Hao Lu

Drahomira Herrmannova

Proceedings of the International Conference for High Performance Computing, 2020

A supernodal all-pairs shortest path algorithm.

[BibT_eX]

[DOI]

Proceedings of the PPoPP '20: 25th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2020

2019

A communication-avoiding 3D algorithm for sparse LU factorization on heterogeneous systems.

[BibT_eX]

[DOI]

Piyush Sao

Xiaoye S. Li

Richard W. Vuduc

J. Parallel Distributed Comput., 2019

Self-stabilizing Connected Components.

[BibT_eX]

[DOI]

Proceedings of the 9th IEEE/ACM Workshop on Fault Tolerance for HPC at eXtreme Scale, 2019

Multifrontal Non-negative Matrix Factorization.

[BibT_eX]

[DOI]

Piyush Sao

Ramakrishnan Kannan

Proceedings of the Parallel Processing and Applied Mathematics, 2019

A communication-avoiding 3D sparse triangular solver.

[BibT_eX]

[DOI]

Proceedings of the ACM International Conference on Supercomputing, 2019

2018

Scalable and resilient sparse linear solvers.

[BibT_eX]

[DOI]

Piyush Sao

PhD thesis, 2018

A Communication-Avoiding 3D LU Factorization Algorithm for Sparse Matrices.

[BibT_eX]

[DOI]

Piyush Sao

Xiaoye Sherry Li

Richard W. Vuduc

Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium, 2018

2016

A Self-Correcting Connected Components Algorithm.

[BibT_eX]

[DOI]

Proceedings of the ACM Workshop on Fault-Tolerance for HPC at Extreme Scale, 2016

2015

A Sparse Direct Solver for Distributed Memory Xeon Phi-Accelerated Systems.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015

2014

A distributed kernel summation framework for general-dimension machine learning.

[BibT_eX]

[DOI]

Stat. Anal. Data Min., 2014

A Distributed CPU-GPU Sparse Direct Solver.

[BibT_eX]

[DOI]

Piyush Sao

Richard W. Vuduc

Xiaoye Sherry Li

Proceedings of the Euro-Par 2014 Parallel Processing, 2014

2013

Self-stabilizing iterative solvers.

[BibT_eX]

[DOI]

Piyush Sao

Richard W. Vuduc

Proceedings of the Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, 2013

Piyush Sao

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...