Sriram Krishnamoorthy
According to our database^{1},
Sriram Krishnamoorthy
authored at least 146 papers
between 2003 and 2021.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis OtherLinks
Online presence:

on orcid.org

on hpc.pnl.gov
On csauthors.net:
Bibliography
2021
PaKman: A Scalable Algorithm for Generating Genomic Contigs on Distributed Memory Machines.
IEEE Trans. Parallel Distributed Syst., 2021
CoRR, 2021
2020
ACM Trans. Archit. Code Optim., 2020
FPDetect: Efficient Reasoning About Stencil Programs Using Selective Direct Evaluation.
ACM Trans. Archit. Code Optim., 2020
FailAmp: Relativization Transformation for Soft Error Detection in Structured Address Generation.
ACM Trans. Archit. Code Optim., 2020
Analytical Modeling and Design of Gallium Oxide Schottky Barrier Diodes Beyond Unipolar Figure of Merit Using Highk Dielectric Superjunction Structures.
CoRR, 2020
Design of a βGa<sub>2</sub>O<sub>3</sub> Schottky Barrier Diode With ptype IIINitride Guard Ring for Enhanced Breakdown.
CoRR, 2020
An Abstractionguided Approach to Scalable and Rigorous FloatingPoint Error Analysis.
CoRR, 2020
Density matrix quantum circuit simulation via the BSP machine on modern GPU clusters.
Proceedings of the International Conference for High Performance Computing, 2020
Scalable heterogeneous execution of a coupledcluster model with perturbative triples.
Proceedings of the International Conference for High Performance Computing, 2020
Proceedings of the International Conference for High Performance Computing, 2020
2019
ACM Trans. Parallel Comput., 2019
CoRR, 2019
Proceedings of the International Conference for High Performance Computing, 2019
Proceedings of the 6th ACM SIGPLAN International Workshop on Libraries, 2019
NoCenabled software/hardware codesign framework for accelerating <i>kmer</i> counting.
Proceedings of the 13th IEEE/ACM International Symposium on NetworksonChip, 2019
Proceedings of the 2019 IEEE International Parallel and Distributed Processing Symposium, 2019
BonVoision: leveraging spatial data smoothness for recovery from memory soft errors.
Proceedings of the ACM International Conference on Supercomputing, 2019
Performance Models for Data Transfers: A Case Study with Molecular Chemistry Kernels.
Proceedings of the 48th International Conference on Parallel Processing, 2019
GroundTruth Prediction to Accelerate SoftError Impact Analysis for Iterative Methods.
Proceedings of the 26th IEEE International Conference on High Performance Computing, 2019
Mapping Arbitrarily Sparse TwoBody Interactions on OneDimensional Quantum Circuits.
Proceedings of the 26th IEEE International Conference on High Performance Computing, 2019
Towards Predicting the Impact of RollForward Failure Recovery for HPC Applications.
Proceedings of the 49th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, 2019
Proceedings of the IEEE/ACM International Symposium on Code Generation and Optimization, 2019
Proceedings of the Algorithms for Computational Biology  6th International Conference, 2019
2018
IEEE Trans. Parallel Distributed Syst., 2018
ACM Trans. Archit. Code Optim., 2018
Exploring the capabilities of support vector machines in detecting silent data corruptions.
Sustain. Comput. Informatics Syst., 2018
Proc. ACM Program. Lang., 2018
Proceedings of the 2nd IEEE/ACM International Workshop on Software Correctness for HPC Applications, 2018
Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2018
Proceedings of the 39th ACM SIGPLAN Conference on Programming Language Design and Implementation, 2018
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium, 2018
Proceedings of the 32nd International Conference on Supercomputing, 2018
Characterizing the Impact of Soft Errors Affecting Floatingpoint ALUs using RTLIevel Fault Injection.
Proceedings of the 47th International Conference on Parallel Processing, 2018
Quantification, Tradeoff Analysis, and Optimal Checkpoint Placement for Reliability and Availability.
Proceedings of the 25th IEEE International Conference on High Performance Computing, 2018
Proceedings of the 25th IEEE International Conference on High Performance Computing, 2018
Proceedings of the 2018 International Symposium on Code Generation and Optimization, 2018
Proceedings of the 15th ACM International Conference on Computing Frontiers, 2018
Comparative analysis of softerror detection strategies: a case study with iterative methods.
Proceedings of the 15th ACM International Conference on Computing Frontiers, 2018
Proceedings of the 18th IEEE/ACM International Symposium on Cluster, 2018
2017
CoRR, 2017
Automatic Riskbased Selective Redundancy for Faulttolerant Taskparallel HPC Applications.
Proceedings of the Third International Workshop on Extreme Scale Programming Models and Middleware, 2017
Exploiting Vector and Multicore Parallelism for Recursive, Data and TaskParallel Programs.
Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2017
Optimizing the FourIndex Integral Transform Using Data Movement Lower Bounds Analysis.
Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2017
Proceedings of the 38th ACM SIGPLAN Conference on Programming Language Design and Implementation, 2017
Proceedings of the Languages and Compilers for Parallel Computing, 2017
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017
Proceedings of the 46th International Conference on Parallel Processing, 2017
Proceedings of the 24th IEEE International Conference on High Performance Computing, 2017
Proceedings of the 2017 IEEE International Conference on Cluster Computing, 2017
Proceedings of the 2017 IEEE International Conference on Cluster Computing, 2017
Proceedings of the 2017 IEEE International Conference on Cluster Computing, 2017
2016
ACM Trans. Archit. Code Optim., 2016
ACM Trans. Archit. Code Optim., 2016
Work stealing for GPUaccelerated parallel programs in a global address space framework.
Concurr. Comput. Pract. Exp., 2016
A domainspecific compiler for a parallel multiresolution adaptive numerical simulation environment.
Proceedings of the International Conference for High Performance Computing, 2016
Proceedings of the 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2016
PolyCheck: dynamic verification of iteration space transformations on affine programs.
Proceedings of the 43rd Annual ACM SIGPLANSIGACT Symposium on Principles of Programming Languages, 2016
Proceedings of the 37th ACM SIGPLAN Conference on Programming Language Design and Implementation, 2016
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016
Proceedings of the 45th International Conference on Parallel Processing, 2016
Proceedings of the 25th ACM International Symposium on HighPerformance Parallel and Distributed Computing, 2016
Proceedings of the 23rd IEEE International Conference on High Performance Computing, 2016
Proceedings of the 25th International Conference on Compiler Construction, 2016
2015
Global transformations for legacy parallel applications via structural analysis and rewriting.
Parallel Comput., 2015
A work stealing based approach for enabling scalable optimal sequence homology detection.
J. Parallel Distributed Comput., 2015
Proceedings of the International Conference for High Performance Computing, 2015
Proceedings of the 36th ACM SIGPLAN Conference on Programming Language Design and Implementation, 2015
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, 2015
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, 2015
2014
Introduction to the JPDC Special Issue on DomainSpecific Languages and HighLevel Frameworks for HighPerformance Computing.
J. Parallel Distributed Comput., 2014
Int. J. High Perform. Comput. Appl., 2014
Proceedings of the International Conference for High Performance Computing, 2014
Proceedings of the International Conference for High Performance Computing, 2014
Proceedings of the International Conference for High Performance Computing, 2014
Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation, 2014
Proceedings of the 43rd International Conference on Parallel Processing Workshops, 2014
Proceedings of the 43rd International Conference on Parallel Processing, 2014
Scalable replay with partialorder dependencies for messagelogging fault tolerance.
Proceedings of the 2014 IEEE International Conference on Cluster Computing, 2014
2013
A scalable infrastructure for the performance analysis of passive target synchronization.
Parallel Comput., 2013
Int. J. Parallel Program., 2013
Clust. Comput., 2013
A framework for load balancing of tensor contraction expressions via dynamic task partitioning.
Proceedings of the International Conference for High Performance Computing, 2013
Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation, 2013
Proceedings of the International Conference on Supercomputing, 2013
2012
Empirical performance modeldriven data layout optimization and library call selection for tensor contraction expressions.
J. Parallel Distributed Comput., 2012
Performance characterization of global address space applications: a case study with NWChem.
Concurr. Comput. Pract. Exp., 2012
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012
Load Balancing of Dynamical Nucleation Theory Monte Carlo Simulations through Resource Sharing Barriers.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012
Proceedings of the International Conference on Supercomputing, 2012
On the Use of Term Rewriting for Performance Ooptimization of Legacy HPC Applications.
Proceedings of the 41st International Conference on Parallel Processing, 2012
Work stealing and persistencebased load balancers for iterative overdecomposed applications.
Proceedings of the 21st International Symposium on HighPerformance Parallel and Distributed Computing, 2012
Proceedings of the 19th International Conference on High Performance Computing, 2012
Global Futures: A Multithreaded Execution Model for Global Arraysbased Applications.
Proceedings of the 12th IEEE/ACM International Symposium on Cluster, 2012
2011
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, 2011
Scalable implementations of accurate excitedstate coupled cluster theories: application of highlevel methods to porphyrinbased systems.
Proceedings of the Conference on High Performance Computing Networking, 2011
Poster: Highlevel, onesided programming models on MPI: a case study with global arrays and NWChem.
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, 2011
Proceedings of the Recent Advances in the Message Passing Interface, 2011
Proceedings of the 16th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2011
A Redundant Communication Approach to Scalable Fault Tolerance in PGAS Programming Models.
Proceedings of the 19th International Euromicro Conference on Parallel, 2011
Proceedings of the EuroPar 2011 Parallel Processing  17th International Conference, 2011
Tolerating correlated failures for generalized Cartesian distributions via bipartite matching.
Proceedings of the 8th Conference on Computing Frontiers, 2011
Practical Loop Transformations for Tensor Contraction Expressions on Multilevel Memory Hierarchies.
Proceedings of the Compiler Construction  20th International Conference, 2011
Parameterized Microbenchmarking: An Autotuning Approach for Complex Applications.
Proceedings of the 2011 International Conference on Parallel Architectures and Compilation Techniques, 2011
2010
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2010), May 30, 2010
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010
Proceedings of the 2010 IEEE International Conference on Cluster Computing, 2010
Proceedings of the 10th IEEE/ACM International Conference on Cluster, 2010
Proceedings of the 10th IEEE/ACM International Conference on Cluster, 2010
2009
An Integrated Approach to LocalityConscious Processor Allocation and Scheduling of MixedParallel Applications.
IEEE Trans. Parallel Distributed Syst., 2009
Proceedings of the ACM/IEEE Conference on High Performance Computing, 2009
Proceedings of the 23rd international conference on Supercomputing, 2009
Scalable transparent checkpointrestart of global address space applications on virtual machines over infiniband.
Proceedings of the 6th Conference on Computing Frontiers, 2009
Data Layout Transformation for Enhancing Data Locality on NUCA Chip Multiprocessors.
Proceedings of the PACT 2009, 2009
2008
Global trees: a framework for linked data structures on distributed memory parallel systems.
Proceedings of the ACM/IEEE Conference on High Performance Computing, 2008
Automatic data movement and computation mapping for multilevel parallel architectures with explicitly managed memories.
Proceedings of the 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2008
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008
Proceedings of the 22nd Annual International Conference on Supercomputing, 2008
Proceedings of the 2008 International Conference on Parallel Processing, 2008
Proceedings of the 2008 International Conference on Parallel Processing, 2008
Proceedings of the Computational Science, 2008
Automatic Transformations for CommunicationMinimized Parallelization and Locality Optimization in the Polyhedral Model.
Proceedings of the Compiler Construction, 17th International Conference, 2008
2007
Concurr. Comput. Pract. Exp., 2007
Proceedings of the ACM SIGPLAN 2007 Conference on Programming Language Design and Implementation, 2007
A global address space framework for locality aware scheduling of blocksparse computations.
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007
Proceedings of the 2007 IEEE International Conference on Cluster Computing, 2007
2006
J. Supercomput., 2006
Efficient synthesis of outofcore algorithms using a nonlinear optimization solver.
J. Parallel Distributed Comput., 2006
Data management and query  Hypergraph partitioning for automatic memory hierarchy management.
Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006
Blue Gene system software  Design and implementation of a onesided communication interface for the IBM eServer Blue Gene® supercomputer.
Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006
An approach to localityconscious load balancing and transparent memory hierarchy management with a globaladdressspace parallel programming model.
Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006
An extensible global address space framework with decoupled task and data abstractions.
Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006
An Integrated Approach for Processor Allocation and Scheduling of MixedParallel Applications.
Proceedings of the 2006 International Conference on Parallel Processing (ICPP 2006), 2006
Identifying CostEffective Common Subexpressions to Reduce Operation Count in Tensor Contraction Evaluations.
Proceedings of the Computational Science, 2006
Task Scheduling and File Replication for DataIntensive Jobs with Batchshared I/O.
Proceedings of the 15th IEEE International Symposium on High Performance Distributed Computing, 2006
Locality Conscious Processor Allocation and Scheduling for Mixed Parallel Applications.
Proceedings of the 2006 IEEE International Conference on Cluster Computing, 2006
Proceedings of the 15th International Conference on Parallel Architectures and Compilation Techniques (PACT 2006), 2006
2005
Synthesis of HighPerformance Parallel Programs for a Class of ab Initio Quantum Chemistry Models.
Proc. IEEE, 2005
Integrated Loop Optimizations for Data Locality Enhancement of Tensor Contraction Expressions.
Proceedings of the ACM/IEEE SC2005 Conference on High Performance Networking and Computing, 2005
Cache Miss Characterization and Data Locality Optimization for Imperfectly Nested Loops on Shared Memory Multiprocessors.
Proceedings of the 19th International Parallel and Distributed Processing Symposium (IPDPS 2005), 2005
Proceedings of the High Performance Computing, 2005
2004
Int. J. High Perform. Comput. Netw., 2004
Proceedings of the Languages and Compilers for High Performance Computing, 2004
Proceedings of the High Performance Computing, 2004
2003
Proceedings of the High Performance Computing  HiPC 2003, 10th International Conference, 2003
Proceedings of the 2003 IEEE International Conference on Cluster Computing (CLUSTER 2003), 2003