Surendra Byna

Proceedings of the SNTA@HPDC 2022, 2022

Access Patterns and Performance Behaviors of Multi-layer Supercomputer I/O Subsystems under Production Load.

[BibT_eX]

[DOI]

Proceedings of the HPDC '22: The 31st International Symposium on High-Performance Parallel and Distributed Computing, Minneapolis, MN, USA, 27 June 2022, 2022

HDF5 Cache VOL: Efficient and Scalable Parallel I/O through Caching Data on Node-local Storage.

[BibT_eX]

[DOI]

Proceedings of the 22nd IEEE International Symposium on Cluster, 2022

2021

User-Defined Tensor Data Analysis, 2

[BibT_eX]

[DOI]

Springer Briefs in Computer Science, Springer, ISBN: 978-3-030-70749-1, 2021

Exploiting user activeness for data retention in HPC systems.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2021

Data-Aware Storage Tiering for Deep Learning.

[BibT_eX]

[DOI]

Proceedings of the 6th IEEE/ACM International Parallel Data Systems Workshop, 2021

SCTuner: An Autotuner Addressing Dynamic I/O Needs on Supercomputer I/O Subsystems.

[BibT_eX]

[DOI]

Proceedings of the 6th IEEE/ACM International Parallel Data Systems Workshop, 2021

I/O Bottleneck Detection and Tuning: Connecting the Dots using Interactive Log Analysis.

[BibT_eX]

[DOI]

Jean Luca Bez

Houjun Tang

Bing Xie

David B. Williams-Young

Proceedings of the 6th IEEE/ACM International Parallel Data Systems Workshop, 2021

An In-Depth I/O Pattern Analysis in HPC Systems.

[BibT_eX]

[DOI]

Proceedings of the 28th IEEE International Conference on High Performance Computing, 2021

Characterizing Impacts of Storage Faults on HPC Applications: A Methodology and Insights.

[BibT_eX]

[DOI]

Sriram Krishnamoorthy

Dingwen Tao

Proceedings of the IEEE International Conference on Cluster Computing, 2021

Battle of the Defaults: Extracting Performance Characteristics of HDF5 under Production Load.

[BibT_eX]

[DOI]

Proceedings of the 21st IEEE/ACM International Symposium on Cluster, 2021

Tuning Parallel Data Compression and I/O for Large-scale Earthquake Simulation.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), 2021

Optimizing Performance of Parallel I/O Accesses to Non-contiguous Blocks in Multiple Array Variables.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), 2021

2020

ExaHDF5: Delivering Efficient Parallel I/O on Exascale Computing Systems.

[BibT_eX]

[DOI]

J. Comput. Sci. Technol., 2020

Interfacing HDF5 with a scalable object-centric storage system on hierarchical storage.

[BibT_eX]

[DOI]

Concurr. Comput. Pract. Exp., 2020

GPU Direct I/O with HDF5.

[BibT_eX]

[DOI]

John Ravi

Quincey Koziol

Proceedings of the Fifth IEEE/ACM International Parallel Data Systems Workshop, 2020

Cross-facility science with the Superfacility Project at LBNL.

[BibT_eX]

[DOI]

Proceedings of the 2nd IEEE/ACM Annual Workshop on Extreme-scale Experiment-in-the-Loop Computing, 2020

Parallel Query Service for Object-centric Data Management Systems.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium Workshops, 2020

Predicting and Comparing the Performance of Array Management Libraries.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020

DASSA: Parallel DAS Data Storage and Analysis for Subsurface Event Detection.

[BibT_eX]

[DOI]

Verónica Rodríguez Tribaldos

Xin Xing

Jonathan Ajo-Franklin

Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020

Towards HPC I/O Performance Prediction through Large-scale Log Analysis.

[BibT_eX]

[DOI]

Proceedings of the HPDC '20: The 29th International Symposium on High-Performance Parallel and Distributed Computing, 2020

HPC Workload Characterization Using Feature Selection and Clustering.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Workshop on Systems and Network Telemetry and Analytics, 2020

Uncovering Access, Reuse, and Sharing Characteristics of I/O-Intensive Files on Large-Scale Production HPC Systems.

[BibT_eX]

[DOI]

Proceedings of the 18th USENIX Conference on File and Storage Technologies, 2020

2019

Optimizing I/O Performance of HPC Applications with Autotuning.

[BibT_eX]

[DOI]

ACM Trans. Parallel Comput., 2019

Parallel membership queries on very large scientific data sets using bitmap indexes.

[BibT_eX]

[DOI]

Concurr. Comput. Pract. Exp., 2019

SLOPE: Structural Locality-Aware Programming Model for Composing Array Data Analysis.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing - 34th International Conference, 2019

Terabyte-scale Particle Data Analysis: An ArrayUDF Case Study.

[BibT_eX]

[DOI]

Proceedings of the 31st International Conference on Scientific and Statistical Database Management, 2019

Enabling Transparent Asynchronous I/O using Background Threads.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM Fourth International Parallel Data Systems Workshop, 2019

Revisiting I/O behavior in large-scale storage systems: the expected and the unexpected.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2019

Sparse Data Management in HDF5.

[BibT_eX]

[DOI]

Proceedings of the 1st IEEE/ACM Annual Workshop on Large-scale Experiment-in-the-Loop Computing, 2019

Understanding Data Motion in the Modern HPC Data Center.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM Fourth International Parallel Data Systems Workshop, 2019

Active Learning-based Automatic Tuning and Prediction of Parallel I/O Performance.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM Fourth International Parallel Data Systems Workshop, 2019

MIQS: metadata indexing and querying service for self-describing file formats.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2019

Exploring Metadata Search Essentials for Scientific Data Management.

[BibT_eX]

[DOI]

Proceedings of the 26th IEEE International Conference on High Performance Computing, 2019

Analysis in the Data Path of an Object-Centric Data Management System.

[BibT_eX]

[DOI]

Proceedings of the 26th IEEE International Conference on High Performance Computing, 2019

Tuning Object-Centric Data Management Systems for Large Scale Scientific Applications.

[BibT_eX]

[DOI]

Proceedings of the 26th IEEE International Conference on High Performance Computing, 2019

A Zoom-in Analysis of I/O Logs to Detect Root Causes of I/O Performance Bottlenecks.

[BibT_eX]

[DOI]

Proceedings of the 19th IEEE/ACM International Symposium on Cluster, 2019

DCA-IO: A Dynamic I/O Control Scheme for Parallel and Distributed File Systems.

[BibT_eX]

[DOI]

Proceedings of the 19th IEEE/ACM International Symposium on Cluster, 2019

2018

A year in the life of a parallel file system.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2018

Evaluation of HPC Application I/O on Object Storage Systems.

[BibT_eX]

[DOI]

Kristy A. Kallback-Rose

Damian Hazen

Prabhat

Proceedings of the 3rd IEEE/ACM International Workshop on Parallel Data Storage & Data Intensive Scalable Computing Systems, 2018

ArrayBridge: Interweaving Declarative Array Processing in SciDB with Imperative HDF5-Based Programs.

[BibT_eX]

[DOI]

Proceedings of the 34th IEEE International Conference on Data Engineering, 2018

Toward Transparent Data Management in Multi-Layer Storage Hierarchy of HPC Systems.

[BibT_eX]

[DOI]

Bharti Wadhwa

Nadathur Rajagopalan Satish

Ali Raza Butt

Proceedings of the 2018 IEEE International Conference on Cloud Engineering, 2018

IOMiner: Large-Scale Analytics Framework for Gaining Knowledge from I/O Logs.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Cluster Computing, 2018

UniviStor: Integrated Hierarchical and Distributed Storage for HPC.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Cluster Computing, 2018

A Transparent Server-Managed Object Storage System for HPC.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Cluster Computing, 2018

Toward Scalable and Asynchronous Object-Centric Data Management for HPC.

[BibT_eX]

[DOI]

Proceedings of the 18th IEEE/ACM International Symposium on Cluster, 2018

ARCHIE: Data Analysis Acceleration with Array Caching in Hierarchical Storage.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Big Data (IEEE BigData 2018), 2018

DART: distributed adaptive radix tree for efficient affix-based keyword search on HPC systems.

[BibT_eX]

[DOI]

Proceedings of the 27th International Conference on Parallel Architectures and Compilation Techniques, 2018

2017

ArrayBridge: Interweaving declarative array processing with high-performance computing.

[BibT_eX]

[DOI]

CoRR, 2017

UMAMI: a recipe for generating meaningful metrics through holistic I/O performance analysis.

[BibT_eX]

[DOI]

Proceedings of the 2nd Joint International Workshop on Parallel Data Storage & Data Intensive Scalable Computing Systems, 2017

ArrayUDF: User-Defined Scientific Data Analysis on Arrays.

[BibT_eX]

[DOI]

Proceedings of the 26th International Symposium on High-Performance Parallel and Distributed Computing, 2017

SoMeta: Scalable Object-Centric Metadata Management for High Performance Computing.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Cluster Computing, 2017

2016

AMR-aware in situ indexing and scalable querying.

[BibT_eX]

[DOI]

Proceedings of the 24th High Performance Computing Symposium, 2016

PANDA: Extreme Scale Parallel K-Nearest Neighbor on Distributed Architectures.

[BibT_eX]

[DOI]

Md. Mostofa Ali Patwary

Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016

In Situ Storage Layout Optimization for AMR Spatio-temporal Read Accesses.

[BibT_eX]

[DOI]

Proceedings of the 45th International Conference on Parallel Processing, 2016

SDS-Sort: Scalable Dynamic Skew-aware Parallel Sorting.

[BibT_eX]

[DOI]

Proceedings of the 25th ACM International Symposium on High-Performance Parallel and Distributed Computing, 2016

Data Elevator: Low-Contention Data Movement in Hierarchical Storage System.

[BibT_eX]

[DOI]

Proceedings of the 23rd IEEE International Conference on High Performance Computing, 2016

AMRZone: A Runtime AMR Data Sharing Framework for Scientific Applications.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM 16th International Symposium on Cluster, 2016

Usage Pattern-Driven Dynamic Data Layout Reorganization.

[BibT_eX]

[DOI]

Kristofer E. Bouchard

Scott Klasky

Nagiza F. Samatova

Proceedings of the IEEE/ACM 16th International Symposium on Cluster, 2016

Exploring memory hierarchy and network topology for runtime AMR data sharing across scientific applications.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Big Data (IEEE BigData 2016), 2016

2015

Towards Exascale Scientific Metadata Management.

[BibT_eX]

[DOI]

Spyros Blanas

CoRR, 2015

Techniques for modeling large-scale HPC I/O workloads.

[BibT_eX]

[DOI]

Christopher D. Carothers

Proceedings of the 6th International Workshop on Performance Modeling, 2015

BD-CATS: big data clustering at trillion particle scale.

[BibT_eX]

[DOI]

Md. Mostofa Ali Patwary

Nadathur Rajagopalan Satish

Proceedings of the International Conference for High Performance Computing, 2015

Heavy-tailed distribution of parallel I/O system response time.

[BibT_eX]

[DOI]

Proceedings of the 10th Parallel Data Storage Workshop, 2015

Pattern-driven parallel I/O tuning.

[BibT_eX]

[DOI]

Proceedings of the 10th Parallel Data Storage Workshop, 2015

Collective Computing for Scientific Big Data Analysis.

[BibT_eX]

[DOI]

Jialin Liu

Proceedings of the 44th International Conference on Parallel Processing Workshops, 2015

A Multiplatform Study of I/O Behavior on Petascale Supercomputers.

[BibT_eX]

[DOI]

Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing, 2015

Dynamic Model-Driven Parallel I/O Performance Tuning.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015

Parallel In Situ Detection of Connected Components in Adaptive Mesh Refinement Data.

[BibT_eX]

[DOI]

Proceedings of the 15th IEEE/ACM International Symposium on Cluster, 2015

TECA: Petascale Pattern Recognition for Climate Science.

[BibT_eX]

[DOI]

Proceedings of the Computer Analysis of Images and Patterns, 2015

Security for the scientific data services framework.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Big Data (IEEE BigData 2015), Santa Clara, CA, USA, October 29, 2015

Spatially clustered join on heterogeneous scientific data sets.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Big Data (IEEE BigData 2015), Santa Clara, CA, USA, October 29, 2015

2014

Parallel data analysis directly on scientific file formats.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Management of Data, 2014

Model-Driven Data Layout Selection for Improving Read Performance.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014

Simplifying index file structure to improve I/O performance of parallel indexing.

[BibT_eX]

[DOI]

Proceedings of the 20th IEEE International Conference on Parallel and Distributed Systems, 2014

Improving parallel I/O autotuning with performance modeling.

[BibT_eX]

[DOI]

Proceedings of the 23rd International Symposium on High-Performance Parallel and Distributed Computing, 2014

Parallel query evaluation as a Scientific Data Service.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Conference on Cluster Computing, 2014

2013

Why high performance visual data analytics is both relevant and difficult.

[BibT_eX]

[DOI]

Proceedings of the Visualization and Data Analysis 2013, 2013

Optimizing fastquery performance on lustre file system.

[BibT_eX]

[DOI]

Proceedings of the Conference on Scientific and Statistical Database Management, 2013

SDS: a framework for scientific data services.

[BibT_eX]

[DOI]

Proceedings of the 8th Parallel Data Storage Workshop, 2013

Taming parallel I/O complexity with auto-tuning.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2013

A framework for auto-tuning HDF5 applications.

[BibT_eX]

[DOI]

Proceedings of the 22nd International Symposium on High-Performance Parallel and Distributed Computing, 2013

Expediting scientific data analysis with reorganization of data.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Conference on Cluster Computing, 2013

Segmented analysis for reducing data movement.

[BibT_eX]

[DOI]

Jialin Liu

Proceedings of the 2013 IEEE International Conference on Big Data (IEEE BigData 2013), 2013

2012

TECA: A Parallel Toolkit for Extreme Climate Analysis.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Computational Science, 2012

Parallel I/O, analysis, and visualization of a trillion particle simulation.

[BibT_eX]

[DOI]

Proceedings of the SC Conference on High Performance Computing Networking, 2012

Abstract: Auto-Tuning of Parallel IO Parameters for HDF5 Applications.

[BibT_eX]

[DOI]

Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Boosting Application-Specific Parallel I/O Optimization Using IOSIG.

[BibT_eX]

[DOI]

Proceedings of the 12th IEEE/ACM International Symposium on Cluster, 2012

2011

Special issue on Data Intensive Computing.

[BibT_eX]

[DOI]

J. Parallel Distributed Comput., 2011

Energy-Aware Workload Consolidation on GPU.

[BibT_eX]

[DOI]

Dong Li

Srimat T. Chakradhar

Proceedings of the 2011 International Conference on Parallel Processing Workshops, 2011

2010

Data-aware scheduling of legacy kernels on heterogeneous platforms with distributed memory.

[BibT_eX]

[DOI]

Proceedings of the SPAA 2010: Proceedings of the 22nd Annual ACM Symposium on Parallelism in Algorithms and Architectures, 2010

Exploiting the forgiving nature of applications for scalable parallel execution.

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Best-effort semantic document search on GPUs.

[BibT_eX]

[DOI]

Proceedings of 3rd Workshop on General Purpose Processing on Graphics Processing Units, 2010

2009

Special Issue of the Journal of Parallel and Distributed Computing: Data-Intensive Computing.

[BibT_eX]

[DOI]

J. Parallel Distributed Comput., 2009

Taxonomy of Data Prefetching for Multicore Processors.

[BibT_eX]

[DOI]

J. Comput. Sci. Technol., 2009

Core-aware memory access scheduling schemes.

[BibT_eX]

[DOI]

Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009

Modeling Data Access Contention in Multicore Architectures.

[BibT_eX]

[DOI]

Donald J. Holmgren

Proceedings of the 15th IEEE International Conference on Parallel and Distributed Systems, 2009

2008

Hiding I/O latency with pre-execution prefetching for parallel applications.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE Conference on High Performance Computing, 2008

Parallel I/O prefetching using MPI file caching and I/O signatures.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE Conference on High Performance Computing, 2008

A Taxonomy of Data Prefetching Mechanisms.

[BibT_eX]

[DOI]