Todd Gamblin

Orcid: 0000-0002-7857-2805

Affiliations:
  • Lawrence Livermore National Laboratory, Livermore, CA, USA
  • University of North Carolina at Chapel Hill, Renaissance Computing Institute, NC, USA


According to our database1, Todd Gamblin authored at least 105 papers between 2006 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Scalable Comparative Visualization of Ensembles of Call Graphs.
IEEE Trans. Vis. Comput. Graph., March, 2023

Modeling Parallel Programs using Large Language Models.
CoRR, 2023

Machine Learning-Driven Adaptive OpenMP For Portable Performance on Heterogeneous Systems.
CoRR, 2023

Towards Collaborative Continuous Benchmarking for HPC.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

Flexible and Optimal Dependency Management via Max-SMT.
Proceedings of the 45th IEEE/ACM International Conference on Software Engineering, 2023

2022
AI4IO: A suite of AI-based tools for IO-aware scheduling.
Int. J. High Perform. Comput. Appl., 2022

Overcoming Challenges to Continuous Integration in HPC.
Comput. Sci. Eng., 2022

Using Solver-Aided Languages to Build Package Managers.
CoRR, 2022

Reliabuild: Searching for High-Fidelity Builds Using Active Learning.
CoRR, 2022

Mapping Out the HPC Dependency Chaos.
Proceedings of the SC22: International Conference for High Performance Computing, 2022

Using Answer Set Programming for HPC Dependency Solving.
Proceedings of the SC22: International Conference for High Performance Computing, 2022

Searching for High-Fidelity Builds Using Active Learning.
Proceedings of the 19th IEEE/ACM International Conference on Mining Software Repositories, 2022

Resource Utilization Aware Job Scheduling to Mitigate Performance Variability.
Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022

2021
Visualizing Hierarchical Performance Profiles of Parallel Codes Using CallFlow.
IEEE Trans. Vis. Comput. Graph., 2021

Extending OpenMP for Machine Learning-Driven Adaptation.
Proceedings of the Accelerator Programming Using Directives - 8th International Workshop, 2021

Artemis: Automatic Runtime Tuning of Parallel Execution Parameters Using Machine Learning.
Proceedings of the High Performance Computing - 36th International Conference, 2021

A Holistic View of Memory Utilization on HPC Systems: Current and Future Trends.
Proceedings of the MEMSYS 2021: The International Symposium on Memory Systems, Washington, USA, September 27, 2021

2020
Scalable Comparative Visualization of Ensembles of Call Graphs.
CoRR, 2020

archspec: A library for detecting, labeling, and reasoning about microarchitectures.
Proceedings of the 2nd International Workshop on Containers and New Orchestration Paradigms for Isolated Environments in HPC, 2020

Usability and Performance Improvements in Hatchet.
Proceedings of the IEEE/ACM International Workshop on HPC User Support Tools and Workshop on Programming and Performance Visualization Tools, 2020

Workflows are the New Applications: Challenges in Performance, Portability, and Productivity.
Proceedings of the IEEE/ACM International Workshop on Performance, 2020

CanarIO: Sounding the Alarm on IO-Related Performance Degradation.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020

Auto-tuning Parameter Choices in HPC Applications using Bayesian Optimization.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020

CodeSeer: input-dependent code variants selection via machine learning.
Proceedings of the ICS '20: 2020 International Conference on Supercomputing, 2020

2019
Preserving Command Line Workflow for a Package Management System Using ASCII DAG Visualization.
IEEE Trans. Vis. Comput. Graph., 2019

Using Malleable Task Scheduling to Accelerate Package Manager Installations.
Proceedings of the Tools and Techniques for High Performance Computing, 2019


Hatchet: pruning the overgrowth in parallel profiles.
Proceedings of the International Conference for High Performance Computing, 2019

Analyzing Cost-Performance Tradeoffs of HPC Network Designs under Different Constraints using Simulations.
Proceedings of the 2019 ACM SIGSIM Conference on Principles of Advanced Discrete Simulation, 2019

FuncyTuner: Auto-tuning Scientific Applications With Per-loop Compilation.
Proceedings of the 48th International Conference on Parallel Processing, 2019

2018
MemAxes: Visualization and Analytics for Characterizing Complex Memory Performance Behaviors.
IEEE Trans. Vis. Comput. Graph., 2018

Autotuning in High-Performance Computing Applications.
Proc. IEEE, 2018

PADDLE: Performance Analysis Using a Data-Driven Learning Environment.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium, 2018

Bootstrapping Parameter Space Exploration for Fast Tuning.
Proceedings of the 32nd International Conference on Supercomputing, 2018

PRIONN: Predicting Runtime and IO using Neural Networks.
Proceedings of the 47th International Conference on Parallel Processing, 2018

2017
xSDK Foundations: Toward an Extreme-scale Scientific Software Development Kit.
Supercomput. Front. Innov., 2017

Projecting Performance Data over Simulation Geometry Using SOSflow and ALPINE.
Proceedings of the Programming and Performance Visualization Tools, 2017

Performance modeling under resource constraints using deep transfer learning.
Proceedings of the International Conference for High Performance Computing, 2017

Predicting the performance impact of different fat-tree configurations.
Proceedings of the International Conference for High Performance Computing, 2017

ScrubJay: deriving knowledge from the disarray of HPC performance data.
Proceedings of the International Conference for High Performance Computing, 2017

DR-BW: Identifying Bandwidth Contention in NUMA Architectures with Supervised Learning.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017

Partitioning Low-Diameter Networks to Eliminate Inter-Job Interference.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017

REPPAR Keynote.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

Apollo: Reusable Models for Fast, Dynamic Tuning of Input-Dependent Code.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017

2016
Ordering Traces Logically to Identify Lateness in Message Passing Programs.
IEEE Trans. Parallel Distributed Syst., 2016

Evaluating and extending user-level fault tolerance in MPI applications.
Int. J. High Perform. Comput. Appl., 2016

A Scalable Observation System for Introspection and In Situ Analytics.
Proceedings of the 5th Workshop on Extreme-Scale Programming Tools, 2016

VIPACT: A Visualization Interface for Analyzing Calling Context Trees.
Proceedings of the Third Workshop on Visual Performance Analysis, 2016

Evaluating HPC networks via simulation of parallel workloads.
Proceedings of the International Conference for High Performance Computing, 2016

A machine learning framework for performance coverage analysis of proxy applications.
Proceedings of the International Conference for High Performance Computing, 2016

Caliper: performance introspection for HPC software stacks.
Proceedings of the International Conference for High Performance Computing, 2016

Managing Combinatorial Software Installations with Spack.
Proceedings of the 2016 Third International Workshop on HPC User Support Tools, 2016

A Study of Failures in Community Clusters: The Case of Conte.
Proceedings of the 2016 IEEE International Symposium on Software Reliability Engineering Workshops, 2016

IPDRM Introduction and Committees.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016

MPMD Framework for Offloading Load Balance Computation.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016

VarSys Introduction.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016

Machine Learning Predictions of Runtime and IO Traffic on High-End Clusters.
Proceedings of the 2016 IEEE International Conference on Cluster Computing, 2016

2015
Diagnosis of Performance Faults in LargeScale MPI Applications via Probabilistic Progress-Dependence Inference.
IEEE Trans. Parallel Distributed Syst., 2015

Connecting Performance Analysis and Visualization (Dagstuhl Perspectives Workshop 14022).
Dagstuhl Manifestos, 2015

Debugging high-performance computing applications at massive scales.
Commun. ACM, 2015

Recovering logical structure from Charm++ event traces.
Proceedings of the International Conference for High Performance Computing, 2015

Relating memory performance data to application domain data using an integration API.
Proceedings of the 2nd Workshop on Visual Performance Analysis, 2015

The Spack package manager: bringing order to HPC software chaos.
Proceedings of the International Conference for High Performance Computing, 2015

Decoupled load balancing.
Proceedings of the 20th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2015

Identifying the Culprits Behind Network Congestion.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015

2014
Combing the Communication Hairball: Visualizing Parallel Execution Traces using Logical Time.
IEEE Trans. Vis. Comput. Graph., 2014

State of the Art of Performance Visualization.
Proceedings of the 16th Eurographics Conference on Visualization, 2014

Dissecting On-Node Memory Access Performance: A Semantic Approach.
Proceedings of the International Conference for High Performance Computing, 2014

Evaluating User-Level Fault Tolerance for MPI Applications.
Proceedings of the 21st European MPI Users' Group Meeting, 2014

Extracting logical structure and identifying stragglers in parallel execution traces.
Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2014

Accurate application progress analysis for large-scale parallel debugging.
Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation, 2014

FMI: Fault Tolerant Messaging Interface for Fast and Transparent Recovery.
Proceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium, 2014

Load balancing n-body simulations with highly non-uniform density.
Proceedings of the 2014 International Conference on Supercomputing, 2014

Optimizing the performance of parallel applications on a 5D torus via task mapping.
Proceedings of the 21st International Conference on High Performance Computing, 2014

A User-Level InfiniBand-Based File System and Checkpoint Strategy for Burst Buffers.
Proceedings of the 14th IEEE/ACM International Symposium on Cluster, 2014

2013
Parallelizing heavyweight debugging tools with mpiecho.
Parallel Comput., 2013

Trellis: Portability across architectures with a high-level framework.
J. Parallel Distributed Comput., 2013

Predicting application performance using supervised learning on communication features.
Proceedings of the International Conference for High Performance Computing, 2013

Performance Analysis Techniques for the Exascale Co-Design Process.
Proceedings of the Parallel Computing: Accelerating Computational Science and Engineering (CSE), 2013

Efficient and Scalable Retrieval Techniques for Global File Properties.
Proceedings of the 27th IEEE International Symposium on Parallel and Distributed Processing, 2013

Massively parallel loading.
Proceedings of the International Conference on Supercomputing, 2013

2012
Visualizing Network Traffic to Understand the Performance of Massively Parallel Simulations.
IEEE Trans. Vis. Comput. Graph., 2012

Design and modeling of a non-blocking checkpointing system.
Proceedings of the SC Conference on High Performance Computing Networking, 2012

Abstract: Slack-Conscious Lightweight Loop Scheduling for Improving Scalability of Bulk-synchronous MPI Applications.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Abstract: Exploring Performance Data with Boxfish.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Mapping applications with collectives over sub-communicators on torus networks.
Proceedings of the SC Conference on High Performance Computing Networking, 2012

Novel views of performance data to analyze large-scale adaptive applications.
Proceedings of the SC Conference on High Performance Computing Networking, 2012

Poster: Evaluation Topology Mapping via Graph Partitioning.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Abstract: Evaluating Topology Mapping via Graph Partitioning.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

The myrmics memory allocator: hierarchical, message-passing allocation for global address spaces.
Proceedings of the International Symposium on Memory Management, 2012

Quantifying the effectiveness of load balance algorithms.
Proceedings of the International Conference on Supercomputing, 2012

Probabilistic diagnosis of performance faults in large-scale parallel applications.
Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2012

2011
Memory Trace Compression and Replay for SPMD Systems using Extended PRSDs?
SIGMETRICS Perform. Evaluation Rev., 2011

Large scale debugging of parallel tasks with AutomaDeD.
Proceedings of the Conference on High Performance Computing Networking, 2011

Creating a Tool Set for Optimizing Topology-Aware Node Mappings.
Proceedings of the Tools for High Performance Computing 2011, 2011

Reconciling Sampling and Direct Instrumentation for Unintrusive Call-Path Profiling of MPI Programs.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

Challenges of Scaling Algebraic Multigrid Across Modern Multicore Architectures.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

Interpreting Performance Data across Intuitive Domains.
Proceedings of the International Conference on Parallel Processing, 2011

2010
ScalaTrace: Tracing, Analysis and Modeling of HPC Codes at Scale.
Proceedings of the Applied Parallel and Scientific Computing, 2010

Clustering performance data efficiently at massive scales.
Proceedings of the 24th International Conference on Supercomputing, 2010

Scaling Algebraic Multigrid Solvers: On the Road to Exascale.
Proceedings of the Competence in High Performance Computing 2010, 2010

2009
Scalable performance measurement and analysis.
PhD thesis, 2009

2008
Scalable load-balance measurement for SPMD codes.
Proceedings of the ACM/IEEE Conference on High Performance Computing, 2008

Scalable methods for monitoring and detecting behavioral equivalence classes in scientific codes.
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

2006
Challenges of Scale: When All Computing Becomes Grid Computing.
Proceedings of the High Performance Computing and Grids in Action, 2006


  Loading...