Saeed Maleki

According to our database1, Saeed Maleki authored at least 18 papers between 2011 and 2021.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

On csauthors.net:

Bibliography

2021
Synthesizing optimal collective algorithms.
Proceedings of the PPoPP '21: 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2021

2020
Scaling Distributed Training with Adaptive Summation.
CoRR, 2020

2019
Distributed Word2Vec using Graph Analytics Frameworks.
CoRR, 2019

CHET: an optimizing compiler for fully-homomorphic neural-network inferencing.
Proceedings of the 40th ACM SIGPLAN Conference on Programming Language Design and Implementation, 2019

2018
An empirical study of the effect of source-level loop transformations on compiler stability.
Proc. ACM Program. Lang., 2018

CHET: Compiler and Runtime for Homomorphic Evaluation of Tensor Programs.
CoRR, 2018

Semantics-Preserving Parallelization of Stochastic Gradient Descent.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium, 2018

2017
Parallel Stochastic Gradient Descent with Sound Combiners.
CoRR, 2017

LORE: A loop repository for the evaluation of compilers.
Proceedings of the 2017 IEEE International Symposium on Workload Characterization, 2017

2016
Low-Rank Methods for Parallelizing Dynamic Programming Algorithms.
ACM Trans. Parallel Comput., 2016

Efficient parallelization using rank convergence in dynamic programming algorithms.
Commun. ACM, 2016

DSMR: a shared and distributed memory algorithm for single-source shortest path problem.
Proceedings of the 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2016

DSMR: A Parallel Algorithm for Single-Source Shortest Path Problem.
Proceedings of the 2016 International Conference on Supercomputing, 2016

Parallelizing WFST speech decoders.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2014
Parallelizing dynamic programming through rank convergence.
Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2014

Tiled Linear Algebra a System for Parallel Graph Algorithms.
Proceedings of the Languages and Compilers for Parallel Computing, 2014

2012
Performance Portability with the Chapel Language.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012

2011
An Evaluation of Vectorizing Compilers.
Proceedings of the 2011 International Conference on Parallel Architectures and Compilation Techniques, 2011


  Loading...