Padma Raghavan

Proceedings of the 37th ACM Symposium on Parallelism in Algorithms and Architectures, 2025

2024

Multi-resource scheduling of moldable workflows.

[BibT_eX]

[DOI]

J. Parallel Distributed Comput., February, 2024

To Protect or Not To Protect: Probability-Aware Selective Protection for Sparse Iterative Solvers.

[BibT_eX]

[DOI]

Proceedings of the 36th IEEE International Symposium on Computer Architecture and High Performance Computing, 2024

2023

Dynamic Selective Protection of Sparse Iterative Solvers via ML Prediction of Soft Error Impacts.

[BibT_eX]

[DOI]

Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

2022

Resilient Scheduling of Moldable Parallel Jobs to Cope With Silent Errors.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2022

2021

Resilient Scheduling Heuristics for Rigid Parallel Jobs.

[BibT_eX]

[DOI]

Int. J. Netw. Comput., 2021

Multi-Resource List Scheduling of Moldable Parallel Jobs under Precedence Constraints.

[BibT_eX]

[DOI]

Lucas Perotin

Proceedings of the ICPP 2021: 50th International Conference on Parallel Processing, Lemont, IL, USA, August 9, 2021

2020

Selective Protection for Sparse Iterative Solvers to Reduce the Resilience Overhead.

[BibT_eX]

[DOI]

Proceedings of the 32nd IEEE International Symposium on Computer Architecture and High Performance Computing, 2020

Reservation and Checkpointing Strategies for Stochastic Jobs.

[BibT_eX]

[DOI]

Ana Gainaru

Brice Goglin

Valentin Honoré

Guillaume Pallez Aupy

Yves Robert

Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020

Design and Comparison of Resilient Scheduling Heuristics for Parallel Jobs.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium Workshops, 2020

Resilient Scheduling of Moldable Jobs on Failure-Prone Platforms.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Cluster Computing, 2020

2019

A New Framework for Evaluating Straggler Detection Mechanisms in MapReduce.

[BibT_eX]

[DOI]

ACM Trans. Model. Perform. Evaluation Comput. Syst., 2019

On-the-fly scheduling versus reservation-based scheduling for unpredictable workflows.

[BibT_eX]

[DOI]

Int. J. High Perform. Comput. Appl., 2019

Reservation Strategies for Stochastic Jobs.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE International Parallel and Distributed Processing Symposium, 2019

Speculative Scheduling for Stochastic HPC Applications.

[BibT_eX]

[DOI]

Proceedings of the 48th International Conference on Parallel Processing, 2019

2018

Coping with silent and fail-stop errors at scale by combining replication and checkpointing.

[BibT_eX]

[DOI]

J. Parallel Distributed Comput., 2018

Co-scheduling Amdahl applications on cache-partitioned systems.

[BibT_eX]

[DOI]

Int. J. High Perform. Comput. Appl., 2018

Realizing the potential of data science.

[BibT_eX]

[DOI]

Francine Berman

Rob A. Rutenbar

Brent Hailpern

Henrik I. Christensen

Commun. ACM, 2018

A Scalability and Sensitivity Study of Parallel Geometric Algorithms for Graph Partitioning.

[BibT_eX]

[DOI]

Shad Kirmani

Proceedings of the 30th International Symposium on Computer Architecture and High Performance Computing, 2018

Scheduling Parallel Tasks under Multiple Resources: List Scheduling vs. Pack Scheduling.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium, 2018

2017

An embedded sectioning scheme for multiprocessor topology-aware mapping of irregular applications.

[BibT_eX]

[DOI]

Shad Kirmani

JeongHyung Park

Int. J. High Perform. Comput. Appl., 2017

Co-Scheduling Algorithms for Cache-Partitioned Systems.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

Identifying the Right Replication Level to Detect and Correct Silent Errors at Scale.

[BibT_eX]

[DOI]

Proceedings of the ACM Workshop on Fault-Tolerance for HPC at Extreme Scale, 2017

2016

Co-scheduling algorithms for high-throughput workload execution.

[BibT_eX]

[DOI]

J. Sched., 2016

Research and Education in Computational Science and Engineering.

[BibT_eX]

[DOI]

CoRR, 2016

Locality-Aware Laplacian Mesh Smoothing.

[BibT_eX]

[DOI]

Guillaume Aupy

JeongHyung Park

Proceedings of the 45th International Conference on Parallel Processing, 2016

2015

STS-k: a multilevel sparse triangular solution scheme for NUMA multicores.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2015

Phase Detection with Hidden Markov Models for DVFS on Many-Core Processors.

[BibT_eX]

[DOI]

Proceedings of the 35th IEEE International Conference on Distributed Computing Systems, 2015

2014

Hybrid Sparse Linear Solutions with Substituted Factorization.

[BibT_eX]

[DOI]

Joshua Dennis Booth

Proceedings of the High Performance Computing for Computational Science - VECPAR 2014 - 11th International Conference, Eugene, OR, USA, June 30, 2014

A multilevel compressed sparse row format for efficient sparse computations on multicore processors.

[BibT_eX]

[DOI]

Humayun Kabir

Joshua Dennis Booth

Proceedings of the 21st International Conference on High Performance Computing, 2014

2013

Special Issue: Selected Papers from Super Computing 2012.

[BibT_eX]

[DOI]

Jeffrey S. Vetter

Sci. Program., 2013

Speedup-Aware Co-Schedules for Efficient Workload Management.

[BibT_eX]

[DOI]

Youngtae Youn

Parallel Process. Lett., 2013

Scalable parallel graph partitioning.

[BibT_eX]

[DOI]

Shad Kirmani

Proceedings of the International Conference for High Performance Computing, 2013

Interference Resolver in Shared Storage Systems to Provide Fairness to I/O Intensive Applications.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013

2012

Similarity Graph Neighborhoods for Enhanced Supervised Classification.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Computational Science, 2012

NUMA-aware graph mining techniques for performance and energy efficiency.

[BibT_eX]

[DOI]

Michael R. Frasca

Kamesh Madduri

Proceedings of the SC Conference on High Performance Computing Networking, 2012

Fault tolerant preconditioned conjugate gradient for sparse linear system solution.

[BibT_eX]

[DOI]

Sowmyalatha Srinivasmurthy

Proceedings of the International Conference on Supercomputing, 2012

Adapting Sparse Triangular Solution to GPUs.

[BibT_eX]

[DOI]

Proceedings of the 41st International Conference on Parallel Processing Workshops, 2012

Phase Partitioning Methods for I/O Cache Optimization.

[BibT_eX]

[DOI]

Michael R. Frasca

Proceedings of the 41st International Conference on Parallel Processing, 2012

2011

Can models of scientific software-hardware interactions be predictive?

[BibT_eX]

[DOI]

Michael R. Frasca

Proceedings of the International Conference on Computational Science, 2011

A Multilevel Cholesky Conjugate Gradients Hybrid Solver for Linear Systems with Multiple Right-hand Sides.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Computational Science, 2011

Exploiting dense substructures for fast sparse matrix vector multiplication.

[BibT_eX]

[DOI]

Int. J. High Perform. Comput. Appl., 2011

Virtual I/O caching: dynamic storage cache management for concurrent workloads.

[BibT_eX]

[DOI]

Proceedings of the Conference on High Performance Computing Networking, 2011

Characterizing the impact of soft errors on iterative methods in scientific computing.

[BibT_eX]

[DOI]

Sowmyalatha Srinivasmurthy

Proceedings of the 25th International Conference on Supercomputing, 2011, Tucson, AZ, USA, May 31, 2011

2010

Parallel Hybrid Preconditioning: Incomplete Factorization with Selective Sparse Approximate Inversion.

[BibT_eX]

[DOI]

SIAM J. Sci. Comput., 2010

PFFTC: An improved fast Fourier transform for the IBM cell broadband engine.

[BibT_eX]

[DOI]

Andrew Shaffer

Bruce Einfalt

Proceedings of the International Conference on Computational Science, 2010

Characterizing sparse preconditioner performance for the support vector machine kernel.

[BibT_eX]

[DOI]

Kelly Fermoyle

Sai Prashanth Muralidhara

Proceedings of the International Conference on Computational Science, 2010

Intra-application shared cache partitioning for multithreaded applications.

[BibT_eX]

[DOI]

Sai Prashanth Muralidhara

Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2010

Intra-application cache partitioning.

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

T-NUCA - a novel approach to non-uniform access latency cache architectures for 3D CMPs.

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Analyzing the soft error resilience of linear solvers on multicore multiprocessors.

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Dynamic core partitioning for energy efficiency.

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Feature subspace transformations for enhancing k-means clustering.

[BibT_eX]

[DOI]

Sanjukta Bhowmick

Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

2009

Adapting application execution in CMPs using helper threads.

[BibT_eX]

[DOI]

J. Parallel Distributed Comput., 2009

Towards Low-Cost, High-Accuracy Classifiers for Linear Solver Selection.

[BibT_eX]

[DOI]

Sanjukta Bhowmick

Brice Toth

Proceedings of the Computational Science, 2009

Adapting Application Mapping to Systematic Within-Die Process Variations on Chip Multiprocessors.

[BibT_eX]

[DOI]

Proceedings of the High Performance Embedded Architectures and Compilers, 2009

Hybrid Techniques for Fast Multicore Simulation.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2009 Parallel Processing, 2009

Markov Model Based Disk Power Management for Data Intensive Workloads.

[BibT_eX]

[DOI]

Proceedings of the 9th IEEE/ACM International Symposium on Cluster Computing and the Grid, 2009

2008

Evaluating the role of scratchpad memories in chip multiprocessors for sparse matrix computations.

[BibT_eX]

[DOI]

Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

Managing power, performance and reliability trade-offs.

[BibT_eX]

[DOI]

Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

Towards energy efficient scaling of scientific codes.

[BibT_eX]

[DOI]

Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

A helper thread based EDP reduction scheme for adapting application execution in CMPs.

[BibT_eX]

[DOI]

Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

Ring data location prediction scheme for Non-Uniform Cache Architectures.

[BibT_eX]

[DOI]

Proceedings of the 26th International Conference on Computer Design, 2008

2007

Reducing energy consumption of parallel sparse matrix applications through integrated link/CPU voltage scaling.

[BibT_eX]

[DOI]

J. Supercomput., 2007

Phase-aware adaptive hardware selection for power-efficient scientific computations.

[BibT_eX]

[DOI]

Proceedings of the 2007 International Symposium on Low Power Electronics and Design, 2007

Analysis of the IPv4 Address Space Delegation Structure.

[BibT_eX]

[DOI]

Proceedings of the 12th IEEE Symposium on Computers and Communications (ISCC 2007), 2007

Memory Optimizations For Fast Power-Aware Sparse Computations.

[BibT_eX]

[DOI]

Mary Jane Irwin

Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

Load Miss Prediction - Exploiting Power Performance Trade-offs.

[BibT_eX]

[DOI]

Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

Link Shutdown Opportunities During Collective Communications in 3-D Torus Nets.

[BibT_eX]

[DOI]

Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

Ring Prediction for Non-Uniform Cache Architectures.

[BibT_eX]

[DOI]

Proceedings of the 16th International Conference on Parallel Architectures and Compilation Techniques (PACT 2007), 2007

2006

Effective Preconditioning through Ordering Interleaved with Incomplete Factorization.

[BibT_eX]

[DOI]

Ingyu Lee

SIAM J. Matrix Anal. Appl., 2006

Poster reception - Toward a power efficient computer architecture for Barnes-Hut N-body simulations.

[BibT_eX]

[DOI]

Mary Jane Irwin

Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

Poster reception - Energy/performance modeling for collective communication in 3-D torus cluster networks.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

Integrated link/CPU voltage scaling for reducing energy consumption of parallel sparse matrix applications.

[BibT_eX]

[DOI]

Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

Conjugate gradient sparse solvers: performance-power characteristics.

[BibT_eX]

[DOI]

Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

On improving performance and energy profiles of sparse scientific applications.

[BibT_eX]

[DOI]

Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

Characterizing the Performance and Energy Attributes of Scientific Simulations.

[BibT_eX]

[DOI]

Proceedings of the Computational Science, 2006

Opportunities and Challenges for Parallel Computing in Science and Engineering.

[BibT_eX]

[DOI]

Michael A. Heroux

Horst D. Simon

Proceedings of the Parallel Processing for Scientific Computing, 2006

Frontiers of Scientific Computing: An Overview.

[BibT_eX]

[DOI]

Michael A. Heroux

Horst D. Simon

Proceedings of the Parallel Processing for Scientific Computing, 2006

2005

Adaptive Software for Scientific Computing: Co-Managing Quality-Performance-Power Tradeoffs.

[BibT_eX]

[DOI]

Proceedings of the 19th International Parallel and Distributed Processing Symposium (IPDPS 2005), 2005

Reducing Power with Performance Constraints for Parallel Sparse Applications.

[BibT_eX]

[DOI]

Proceedings of the 19th International Parallel and Distributed Processing Symposium (IPDPS 2005), 2005

Multi-pass Mapping Schemes for Parallel Sparse Matrix Computations.

[BibT_eX]

[DOI]

Proceedings of the Computational Science, 2005

2004

Faster PDE-based simulations using robust composite linear solvers.

[BibT_eX]

[DOI]

Future Gener. Comput. Syst., 2004

Parallel Hybrid Sparse Solvers Through Flexible Incomplete Cholesky Preconditioning.

[BibT_eX]

[DOI]

Proceedings of the Applied Parallel Computing, 2004

Advanced Algorithms and Software Components for Scientific Computing: An Introduction.

[BibT_eX]

[DOI]

Proceedings of the Applied Parallel Computing, 2004

Towards a Grid enabled system for multicomponent materials design.

[BibT_eX]

[DOI]

Zi-Kui Liu

Proceedings of the 4th IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGrid 2004), 2004

2003

A latency tolerant hybrid sparse solver using incomplete Cholesky factorization.

[BibT_eX]

[DOI]

Numer. Linear Algebra Appl., 2003

Time-Memory Trade-Offs Using Sparse Matrix Methods for Large-Scale Eigenvalue Problems.

[BibT_eX]

[DOI]

Chao Yang

Proceedings of the Computational Science and Its Applications, 2003

The Role of Multi-method Linear Solvers in PDE-based Simulations.

[BibT_eX]

[DOI]

Proceedings of the Computational Science and Its Applications, 2003

2002

Large-Scale Normal Coordinate Analysis on Distributed Memory Parallel Systems.

[BibT_eX]

[DOI]

Int. J. High Perform. Comput. Appl., 2002

A new data-mapping scheme for latency-tolerant distributed sparse triangular solution.

[BibT_eX]

[DOI]

Proceedings of the 2002 ACM/IEEE conference on Supercomputing, 2002

A Combinatorial Scheme for Developing Efficient Composite Solvers.

[BibT_eX]

[DOI]

Sanjukta Bhowmick

Proceedings of the Computational Science - ICCS 2002, 2002

2001

Level search schemes for information filtering and retrieval.

[BibT_eX]

[DOI]

Xiaoyan Zhang

Michael W. Berry

Inf. Process. Manag., 2001

Scalable Preconditioning Using Incomplete Factors.

[BibT_eX]

Proceedings of the Tenth SIAM Conference on Parallel Processing for Scientific Computing, 2001

2000

Towards a Scalable Hybrid Sparse Solver.

[BibT_eX]

[DOI]

Concurr. Pract. Exp., 2000

A Grid Computing Environment for Enabling Large Scale Quantum Mechanical Simulations.

[BibT_eX]

[DOI]

Jack J. Dongarra

Proceedings of the Grid Computing, 2000

1999

Performance of Greedy Ordering Heuristics for Sparse Cholesky Factorization.

[BibT_eX]

[DOI]

SIAM J. Matrix Anal. Appl., 1999

Incomplete Cholesky Parallel Preconditioners with Selective Inversion.

[BibT_eX]

Proceedings of the Ninth SIAM Conference on Parallel Processing for Scientific Computing, 1999

1998

Efficient Parallel Sparse Triangular Solution Using Selective Inversion.

[BibT_eX]

[DOI]

Parallel Process. Lett., 1998

1997

Parallel Ordering Using Edge Contraction.

[BibT_eX]

[DOI]

Parallel Comput., 1997

Performance of a Fully Parallel Sparse Solver.

[BibT_eX]

[DOI]

Michael T. Heath

Int. J. High Perform. Comput. Appl., 1997

1995

Distributed Sparse Gaussian Elimination and Orthogonal Factorization.

[BibT_eX]

[DOI]

SIAM J. Sci. Comput., 1995

A Cartesian Parallel Nested Dissection Algorithm.

[BibT_eX]

[DOI]

Michael T. Heath

SIAM J. Matrix Anal. Appl., 1995

1988

Distributed orthogonal factorization.

[BibT_eX]

[DOI]

Alex Pothen