We stand with Ukraine

We stand with Ukraine

Ann C. Gentile

Affiliations:

Sandia National Laboratories, USA

According to our database¹, Ann C. Gentile authored at least 38 papers between 1997 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2025

An Autonomy Loop for Dynamic HPC Job Time Limit Adjustment.

[DOI]

Thomas Jakobsche

,

Osman Seckin Simsek

,

,

,

Florina M. Ciorba

Proceedings of the Euro-Par 2025: Parallel Processing, 2025

2023

Driving HPC Operations With Holistic Monitoring and Operational Data Analytics (Dagstuhl Seminar 23171).

[DOI]

,

Florina M. Ciorba

,

,

,

Dagstuhl Reports, 2023

Autonomy Loops for Monitoring, Operational Data Analytics, Feedback, and Response in HPC Operations.

[DOI]

Proceedings of the IEEE International Conference on Cluster Computing, 2023

2021

Delay sensitivity-driven congestion mitigation for HPC systems.

[DOI]

,

,

,

,

,

,

Zbigniew Kalbarczyk

,

Ravishankar K. Iyer

Proceedings of the ICS '21: 2021 International Conference on Supercomputing, 2021

2020

Application-aware Congestion Mitigation forHigh-Performance Computing Systems.

[DOI]

,

,

,

,

,

,

Zbigniew Kalbarczyk

,

Ravishankar K. Iyer

CoRR, 2020

ALAMO: Autonomous Lightweight Allocation, Management, and Optimization.

[DOI]

,

Kurt B. Ferreira

,

,

,

Jay F. Lofstead

,

Stephen L. Olivier

,

Kevin T. Pedretti

,

Andrew J. Younge

,

,

Proceedings of the Driving Scientific and Engineering Discoveries Through the Convergence of HPC, Big Data and AI, 2020

Measuring Congestion in High-Performance Datacenter Interconnects.

[DOI]

,

,

,

,

,

,

,

,

Zbigniew Kalbarczyk

,

,

Proceedings of the 17th USENIX Symposium on Networked Systems Design and Implementation, 2020

2019

Understanding Fault Scenarios and Impacts through Fault Injection Experiments in Cielo.

[DOI]

Valerio Formicola

,

,

,

,

,

,

,

,

,

,

,

,

Annette Greiner

,

Zbigniew Kalbarczyk

,

Ravishankar K. Iyer

,

CoRR, 2019

A Study of Network Congestion in Two Supercomputing High-Speed Interconnects.

[DOI]

,

,

,

,

,

,

Zbigniew T. Kalbarczyk

,

,

Ravishankar K. Iyer

Proceedings of the 2019 IEEE Symposium on High-Performance Interconnects, 2019

2018

Integrating Low-latency Analysis into HPC System Monitoring.

[DOI]

Ramin Izadpanah

,

Nichamon Naksinehaboon

,

,

,

Proceedings of the 47th International Conference on Parallel Processing, 2018

Characterizing Supercomputer Traffic Networks Through Link-Level Analysis.

[DOI]

,

,

,

Zbigniew Kalbarczyk

,

Ravishankar K. Iyer

Proceedings of the IEEE International Conference on Cluster Computing, 2018

Large-Scale System Monitoring Experiences and Recommendations.

[DOI]

Proceedings of the IEEE International Conference on Cluster Computing, 2018

2017

Holistic Measurement-Driven System Assessment.

[DOI]

,

,

,

Zbigniew Kalbarczyk

,

Gregory H. Bauer

,

,

Michael T. Showerman

,

,

,

Annette Greiner

,

,

,

Ravishankar K. Iyer

,

Proceedings of the 2017 IEEE International Conference on Cluster Computing, 2017

2016

Continuous whole-system monitoring toward rapid understanding of production HPC applications and systems.

[DOI]

Anthony M. Agelastos

,

Benjamin A. Allan

,

,

,

Sophia Lefantzi

,

,

,

,

Parallel Comput., 2016

Large-Scale Persistent Numerical Data Source Monitoring System Experiences.

[DOI]

,

,

Michael T. Showerman

,

,

,

Gregory H. Bauer

Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016

HPCMASPA Introduction and Committees.

[DOI]

Benjamin A. Allan

,

,

,

Cory Lueninghoener

,

Nichamon Naksinehaboon

,

,

Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016

2015

Overtime: a tool for analyzing performance variation due to network interference.

[DOI]

,

Kevin T. Pedretti

,

Proceedings of the 3rd Workshop on Exascale MPI, 2015

Infrastructure for In Situ System Monitoring and Application Data Analysis.

[DOI]

,

Karen D. Devine

,

Proceedings of the First Workshop on In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization, 2015

New Systems, New Behaviors, New Patterns: Monitoring Insights from System Standup.

[DOI]

,

,

,

,

Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015

Toward Rapid Understanding of Production HPC Applications and Systems.

[DOI]

Anthony M. Agelastos

,

Benjamin A. Allan

,

,

,

Sophia Lefantzi

,

,

,

,

Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015

2014

The Lightweight Distributed Metric Service: A Scalable Infrastructure for Continuous Monitoring of Large Scale Computing Systems and Applications.

[DOI]

Anthony M. Agelastos

,

Benjamin A. Allan

,

,

,

,

,

,

,

Nichamon Naksinehaboon

,

,

,

Michael T. Showerman

,

,

,

Thomas W. Tucker

Proceedings of the International Conference for High Performance Computing, 2014

Demonstrating improved application performance using dynamic monitoring and task mapping.

[DOI]

,

Karen D. Devine

,

,

Kevin T. Pedretti

Proceedings of the 2014 IEEE International Conference on Cluster Computing, 2014

2012

Filtering log data: Finding the needles in the Haystack.

[DOI]

,

,

,

,

,

Proceedings of the IEEE/IFIP International Conference on Dependable Systems and Networks, 2012

2011

Baler: deterministic, lossless log message clustering tool.

[DOI]

,

,

,

,

Chokchai Leangsuksun

Comput. Sci. Res. Dev., 2011

Framework for Enabling System Understanding.

[DOI]

,

,

,

Chokchai Leangsuksun

,

Jackson R. Mayo

,

Philippe P. Pébay

,

,

,

David C. Thompson

,

Proceedings of the Euro-Par 2011: Parallel Processing Workshops - CCPI, CGWS, HeteroPar, HiBB, HPCVirt, HPPC, HPSS, MDGS, ProPer, Resilience, UCHPC, VHPC, Bordeaux, France, August 29, 2011

2010

Combining Virtualization, resource characterization, and Resource management to enable efficient high performance compute platforms through intelligent dynamic resource allocation.

[DOI]

,

,

Vincent De Sapio

,

,

Jackson R. Mayo

,

Philippe P. Pébay

,

,

David C. Thompson

,

Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Quantifying effectiveness of failure prediction and response in HPC systems: Methodology and example.

[DOI]

,

,

Vincent De Sapio

,

,

Jackson R. Mayo

,

Philippe P. Pébay

,

,

David C. Thompson

,

Proceedings of the IEEE/IFIP International Conference on Dependable Systems and Networks Workshops (DSN-W 2010), Chicago, Illinois, USA, June 28, 2010

Using Cloud Constructs and Predictive Analysis to Enable Pre-Failure Process Migration in HPC Systems.

[DOI]

,

,

Vincent De Sapio

,

,

Jackson R. Mayo

,

Philippe P. Pébay

,

,

David C. Thompson

,

Proceedings of the 10th IEEE/ACM International Conference on Cluster, 2010

2009

Resource monitoring and management with OVIS to enable HPC in cloud computing environments.

[DOI]

,

,

Jackson R. Mayo

,

Philippe P. Pébay

,

,

David C. Thompson

,

Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009

2008

Ovis-2: A robust distributed architecture for scalable RAS.

[DOI]

,

Bert J. Debusschere

,

,

Jackson R. Mayo

,

Philippe P. Pébay

,

David C. Thompson

,

Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

Using Probabilistic Characterization to Reduce Runtime Faults in HPC Systems.

[DOI]

,

Bert J. Debusschere

,

,

Jackson R. Mayo

,

Philippe P. Pébay

,

David C. Thompson

,

Proceedings of the 8th IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2008), 2008

2006

OVIS: a tool for intelligent, real-time monitoring of computational clusters.

[DOI]

,

,

,

Philippe P. Pébay

Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

2005

Meaningful Automated Statistical Analysis of Large Computational Clusters.

[DOI]

,

,

Youssef M. Marzouk

,

Philippe P. Pébay

Proceedings of the 2005 IEEE International Conference on Cluster Computing (CLUSTER 2005), September 26, 2005

1999

Lilith Lights: A Network Traffic Visualization Tool for High Performance Clusters.

[DOI]

David A. Evensky

,

,

Proceedings of the High-Performance Computing and Networking, 7th International Conference, 1999

The Lilith framework for the rapid development of secure scalable tools for distributed computing (short paper).

David A. Evensky

,

,

,

Robert C. Armstrong

Proceedings of the Distributed Applications and Interoperable Systems II, Second IFIP WG 6.1 International Working Conference on Distributed Applications and Interoperable Systems, June 28, 1999

1998

A visualization tool for parallel and distributed computing using the Lilith framework.

[DOI]

David A. Evensky

,

,

Proceedings of the SIGMETRICS Symposium on Parallel and Distributed Tools, 1998

Lilith: A Software Framework for the Rapid Development of Scalable Tools for Distributed Computing.

[DOI]

,

David A. Evensky

,

Robert C. Armstrong

Proceedings of the Seventh IEEE International Symposium on High Performance Distributed Computing, 1998

1997

Lilith: Scalable Execution of User Code for Distributed Computing.

[DOI]

David A. Evensky

,

,

,

Robert C. Armstrong

Proceedings of the 6th International Symposium on High Performance Distributed Computing, 1997

Loading...