Todd Gamblin

According to our database1, Todd Gamblin authored at least 79 papers between 2006 and 2019.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Homepages:

On csauthors.net:

Bibliography

2019
Preserving Command Line Workflow for a Package Management System Using ASCII DAG Visualization.
IEEE Trans. Vis. Comput. Graph., 2019


Hatchet: pruning the overgrowth in parallel profiles.
Proceedings of the International Conference for High Performance Computing, 2019

Analyzing Cost-Performance Tradeoffs of HPC Network Designs under Different Constraints using Simulations.
Proceedings of the 2019 ACM SIGSIM Conference on Principles of Advanced Discrete Simulation, 2019

FuncyTuner: Auto-tuning Scientific Applications With Per-loop Compilation.
Proceedings of the 48th International Conference on Parallel Processing, 2019

2018
MemAxes: Visualization and Analytics for Characterizing Complex Memory Performance Behaviors.
IEEE Trans. Vis. Comput. Graph., 2018

Autotuning in High-Performance Computing Applications.
Proceedings of the IEEE, 2018

PADDLE: Performance Analysis Using a Data-Driven Learning Environment.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium, 2018

Bootstrapping Parameter Space Exploration for Fast Tuning.
Proceedings of the 32nd International Conference on Supercomputing, 2018

PRIONN: Predicting Runtime and IO using Neural Networks.
Proceedings of the 47th International Conference on Parallel Processing, 2018

2017
xSDK Foundations: Toward an Extreme-scale Scientific Software Development Kit.
CoRR, 2017

Projecting Performance Data over Simulation Geometry Using SOSflow and ALPINE.
Proceedings of the Programming and Performance Visualization Tools, 2017

Performance modeling under resource constraints using deep transfer learning.
Proceedings of the International Conference for High Performance Computing, 2017

Predicting the performance impact of different fat-tree configurations.
Proceedings of the International Conference for High Performance Computing, 2017

ScrubJay: deriving knowledge from the disarray of HPC performance data.
Proceedings of the International Conference for High Performance Computing, 2017

DR-BW: Identifying Bandwidth Contention in NUMA Architectures with Supervised Learning.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017

Partitioning Low-Diameter Networks to Eliminate Inter-Job Interference.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017

REPPAR Keynote.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

Apollo: Reusable Models for Fast, Dynamic Tuning of Input-Dependent Code.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017

2016
Ordering Traces Logically to Identify Lateness in Message Passing Programs.
IEEE Trans. Parallel Distrib. Syst., 2016

Evaluating and extending user-level fault tolerance in MPI applications.
IJHPCA, 2016

A Scalable Observation System for Introspection and In Situ Analytics.
Proceedings of the 5th Workshop on Extreme-Scale Programming Tools, 2016

VIPACT: A Visualization Interface for Analyzing Calling Context Trees.
Proceedings of the Third Workshop on Visual Performance Analysis, 2016

Evaluating HPC networks via simulation of parallel workloads.
Proceedings of the International Conference for High Performance Computing, 2016

A machine learning framework for performance coverage analysis of proxy applications.
Proceedings of the International Conference for High Performance Computing, 2016

Caliper: performance introspection for HPC software stacks.
Proceedings of the International Conference for High Performance Computing, 2016

Managing Combinatorial Software Installations with Spack.
Proceedings of the 2016 Third International Workshop on HPC User Support Tools, 2016

A Study of Failures in Community Clusters: The Case of Conte.
Proceedings of the 2016 IEEE International Symposium on Software Reliability Engineering Workshops, 2016

IPDRM Introduction and Committees.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016

MPMD Framework for Offloading Load Balance Computation.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016

VarSys Introduction.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016

Machine Learning Predictions of Runtime and IO Traffic on High-End Clusters.
Proceedings of the 2016 IEEE International Conference on Cluster Computing, 2016

2015
Diagnosis of Performance Faults in LargeScale MPI Applications via Probabilistic Progress-Dependence Inference.
IEEE Trans. Parallel Distrib. Syst., 2015

Connecting Performance Analysis and Visualization (Dagstuhl Perspectives Workshop 14022).
Dagstuhl Manifestos, 2015

Debugging high-performance computing applications at massive scales.
Commun. ACM, 2015

Recovering logical structure from Charm++ event traces.
Proceedings of the International Conference for High Performance Computing, 2015

Relating memory performance data to application domain data using an integration API.
Proceedings of the 2nd Workshop on Visual Performance Analysis, 2015

The Spack package manager: bringing order to HPC software chaos.
Proceedings of the International Conference for High Performance Computing, 2015

Decoupled load balancing.
Proceedings of the 20th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2015

Identifying the Culprits Behind Network Congestion.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015

2014
Combing the Communication Hairball: Visualizing Parallel Execution Traces using Logical Time.
IEEE Trans. Vis. Comput. Graph., 2014

State of the Art of Performance Visualization.
Proceedings of the Eurographics Conference on Visualization, 2014

Dissecting On-Node Memory Access Performance: A Semantic Approach.
Proceedings of the International Conference for High Performance Computing, 2014

Evaluating User-Level Fault Tolerance for MPI Applications.
Proceedings of the 21st European MPI Users' Group Meeting, 2014

Extracting logical structure and identifying stragglers in parallel execution traces.
Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2014

Accurate application progress analysis for large-scale parallel debugging.
Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation, 2014

FMI: Fault Tolerant Messaging Interface for Fast and Transparent Recovery.
Proceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium, 2014

Load balancing n-body simulations with highly non-uniform density.
Proceedings of the 2014 International Conference on Supercomputing, 2014

Optimizing the performance of parallel applications on a 5D torus via task mapping.
Proceedings of the 21st International Conference on High Performance Computing, 2014

A User-Level InfiniBand-Based File System and Checkpoint Strategy for Burst Buffers.
Proceedings of the 14th IEEE/ACM International Symposium on Cluster, 2014

2013
Parallelizing heavyweight debugging tools with mpiecho.
Parallel Comput., 2013

Trellis: Portability across architectures with a high-level framework.
J. Parallel Distributed Comput., 2013

Predicting application performance using supervised learning on communication features.
Proceedings of the International Conference for High Performance Computing, 2013

Performance Analysis Techniques for the Exascale Co-Design Process.
Proceedings of the Parallel Computing: Accelerating Computational Science and Engineering (CSE), 2013

Efficient and Scalable Retrieval Techniques for Global File Properties.
Proceedings of the 27th IEEE International Symposium on Parallel and Distributed Processing, 2013

Massively parallel loading.
Proceedings of the International Conference on Supercomputing, 2013

2012
Visualizing Network Traffic to Understand the Performance of Massively Parallel Simulations.
IEEE Trans. Vis. Comput. Graph., 2012

Design and modeling of a non-blocking checkpointing system.
Proceedings of the SC Conference on High Performance Computing Networking, 2012

Abstract: Slack-Conscious Lightweight Loop Scheduling for Improving Scalability of Bulk-synchronous MPI Applications.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Abstract: Exploring Performance Data with Boxfish.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Mapping applications with collectives over sub-communicators on torus networks.
Proceedings of the SC Conference on High Performance Computing Networking, 2012

Novel views of performance data to analyze large-scale adaptive applications.
Proceedings of the SC Conference on High Performance Computing Networking, 2012

Poster: Evaluation Topology Mapping via Graph Partitioning.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Abstract: Evaluating Topology Mapping via Graph Partitioning.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

The myrmics memory allocator: hierarchical, message-passing allocation for global address spaces.
Proceedings of the International Symposium on Memory Management, 2012

Quantifying the effectiveness of load balance algorithms.
Proceedings of the International Conference on Supercomputing, 2012

Probabilistic diagnosis of performance faults in large-scale parallel applications.
Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2012

2011
Memory Trace Compression and Replay for SPMD Systems using Extended PRSDs?
SIGMETRICS Performance Evaluation Review, 2011

Large scale debugging of parallel tasks with AutomaDeD.
Proceedings of the Conference on High Performance Computing Networking, 2011

Creating a Tool Set for Optimizing Topology-Aware Node Mappings.
Proceedings of the Tools for High Performance Computing 2011, 2011

Reconciling Sampling and Direct Instrumentation for Unintrusive Call-Path Profiling of MPI Programs.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

Challenges of Scaling Algebraic Multigrid Across Modern Multicore Architectures.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

Interpreting Performance Data across Intuitive Domains.
Proceedings of the International Conference on Parallel Processing, 2011

2010
ScalaTrace: Tracing, Analysis and Modeling of HPC Codes at Scale.
Proceedings of the Applied Parallel and Scientific Computing, 2010

Clustering performance data efficiently at massive scales.
Proceedings of the 24th International Conference on Supercomputing, 2010

Scaling Algebraic Multigrid Solvers: On the Road to Exascale.
Proceedings of the Competence in High Performance Computing 2010, 2010

2008
Scalable load-balance measurement for SPMD codes.
Proceedings of the ACM/IEEE Conference on High Performance Computing, 2008

Scalable methods for monitoring and detecting behavioral equivalence classes in scientific codes.
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

2006
Challenges of Scale: When All Computing Becomes Grid Computing.
Proceedings of the High Performance Computing and Grids in Action, 2006


  Loading...