Haryadi S. Gunawi

Affiliations:
  • University of Chicago


According to our database1, Haryadi S. Gunawi authored at least 61 papers between 2003 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Performance Bug Analysis and Detection for Distributed Storage and Computing Systems.
ACM Trans. Storage, August, 2023

Extending and Programming the NVMe I/O Determinism Interface for Flash Arrays.
ACM Trans. Storage, February, 2023

Towards Continually Learning Application Performance Models.
CoRR, 2023

Design Considerations and Analysis of Multi-Level Erasure Coding in Large-Scale Data Centers.
Proceedings of the International Conference for High Performance Computing, 2023

RHIK: Re-configurable Hash-based Indexing for KVSSD.
Proceedings of the 32nd International Symposium on High-Performance Parallel and Distributed Computing, 2023

CNT: Semi-Automatic Translation from CWL to Nextflow for Genomic Workflows.
Proceedings of the 23rd IEEE International Conference on Bioinformatics and Bioengineering, 2023

EVStore: Storage and Caching Capabilities for Scaling Embedding Tables in Deep Recommendation Systems.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

2022
Fantastic SSD internals and how to learn and use them.
Proceedings of the SYSTOR '22: The 15th ACM International Systems and Storage Conference, Haifa, Israel, June 13, 2022

Layered Contention Mitigation for Cloud Storage.
Proceedings of the IEEE 15th International Conference on Cloud Computing, 2022

2021
Experiences in Managing the Performance and Reliability of a Large-Scale Genomics Cloud Platform.
Proceedings of the 2021 USENIX Annual Technical Conference, 2021

lODA: A Host/Device Co-Design for Strong Predictability Contract on Modern Flash Storage.
Proceedings of the SOSP '21: ACM SIGOPS 28th Symposium on Operating Systems Principles, 2021

2020
Lessons Learned from the Chameleon Testbed.
Proceedings of the 2020 USENIX Annual Technical Conference, 2020

Fractional-Overlap Declustered Parity: Evaluating Reliability for Storage Systems.
Proceedings of the Fifth IEEE/ACM International Parallel Data Systems Workshop, 2020

LinnOS: Predictability on Unpredictable Flash Storage with a Light Neural Network.
Proceedings of the 14th USENIX Symposium on Operating Systems Design and Implementation, 2020

Extreme Protection Against Data Loss with Single-Overlap Declustered Parity.
Proceedings of the 50th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, 2020

LeapIO: Efficient and Portable Virtual NVMe Storage on ARM SoCs.
Proceedings of the ASPLOS '20: Architectural Support for Programming Languages and Operating Systems, 2020

2019
Introduction to the Special Section on the 2018 USENIX Annual Technical Conference (ATC'18).
ACM Trans. Storage, 2019

IASO: A Fail-Slow Detection and Mitigation Framework for Distributed Storage Services.
Proceedings of the 2019 USENIX Annual Technical Conference, 2019

E2E: embracing user heterogeneity to improve quality of experience on the web.
Proceedings of the ACM Special Interest Group on Data Communication, 2019

DFix: automatically fixing timing bugs in distributed systems.
Proceedings of the 40th ACM SIGPLAN Conference on Programming Language Design and Implementation, 2019

ScaleCheck: A Single-Machine Approach for Discovering Scalability Bugs in Large Distributed Systems.
Proceedings of the 17th USENIX Conference on File and Storage Technologies, 2019

FlyMC: Highly Scalable Testing of Complex Interleavings in Distributed Systems.
Proceedings of the Fourteenth EuroSys Conference 2019, Dresden, Germany, March 25-28, 2019, 2019

2018
Fail-Slow at Scale: Evidence of Hardware Performance Faults in Large Production Systems.
login Usenix Mag., 2018

Fail-Slow at Scale: Evidence of Hardware Performance Faults in Large Production Systems.
ACM Trans. Storage, 2018

The CASE of FEMU: Cheap, Accurate, Scalable and Extensible Flash Emulator.
Proceedings of the 16th USENIX Conference on File and Storage Technologies, 2018

Fail-Slow at Scale: Evidence of Hardware Performance Faults in Large Production Systems.
Proceedings of the 16th USENIX Conference on File and Storage Technologies, 2018

Pcatch: automatically detecting performance cascading bugs in cloud systems.
Proceedings of the Thirteenth EuroSys Conference, 2018

StrongBox: Confidentiality, Integrity, and Performance using Stream Ciphers for Full Drive Encryption.
Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating Systems, 2018

2017
Tiny-Tail Flash: Near-Perfect Elimination of Garbage Collection Tail Latencies in NAND SSDs.
ACM Trans. Storage, 2017

MittOS: Supporting Millisecond Tail Tolerance with Fast Rejecting SLO-Aware OS Interface.
Proceedings of the 26th Symposium on Operating Systems Principles, 2017

Scalability Bugs: When 100-Node Testing is Not Enough.
Proceedings of the 16th Workshop on Hot Topics in Operating Systems, 2017

Exploring the Challenges and Opportunities of Cloud Stacks in Dynamic Resource Environments.
Proceedings of the 3rd IEEE International Conference on Collaboration and Internet Computing, 2017

Resilient cloud in dynamic resource environments.
Proceedings of the 2017 Symposium on Cloud Computing, SoCC 2017, Santa Clara, CA, USA, 2017

PBSE: a robust path-based speculative execution for degraded-network tail tolerance in data-parallel frameworks.
Proceedings of the 2017 Symposium on Cloud Computing, SoCC 2017, Santa Clara, CA, USA, 2017

DCatch: Automatically Detecting Distributed Concurrency Bugs in Cloud Systems.
Proceedings of the Twenty-Second International Conference on Architectural Support for Programming Languages and Operating Systems, 2017

2016
Manylogs: Improved CMR/SMR disk bandwidth and faster durability with scattered logs.
Proceedings of the 32nd Symposium on Mass Storage Systems and Technologies, 2016

The Tail at Store: A Revelation from Millions of Hours of Disk and SSD Deployments.
Proceedings of the 14th USENIX Conference on File and Storage Technologies, 2016

Why Does the Cloud Stop Computing? Lessons from Hundreds of Service Outages.
Proceedings of the Seventh ACM Symposium on Cloud Computing, 2016

TaxDC: A Taxonomy of Non-Deterministic Concurrency Bugs in Datacenter Distributed Systems.
Proceedings of the Twenty-First International Conference on Architectural Support for Programming Languages and Operating Systems, 2016

2015
What Bugs Live in the Cloud?: A Study of Issues in Scalable Distributed Systems.
login Usenix Mag., 2015

SAMC: a fast model checker for finding heisenbugs in distributed systems (demo).
Proceedings of the 2015 International Symposium on Software Testing and Analysis, 2015

Towards Pre-Deployment Detection of Performance Failures in Cloud Distributed Systems.
Proceedings of the 7th USENIX Workshop on Hot Topics in Cloud Computing, 2015

2014
SAMC: Semantic-Aware Model Checking for Fast Discovery of Deep Bugs in Cloud Systems.
Proceedings of the 11th USENIX Symposium on Operating Systems Design and Implementation, 2014

The Case for Drill-Ready Cloud Computing.
Proceedings of the ACM Symposium on Cloud Computing, 2014

What Bugs Live in the Cloud? A Study of 3000+ Issues in Cloud Systems.
Proceedings of the ACM Symposium on Cloud Computing, 2014

2013
Impact of Limpware on HDFS: A Probabilistic Estimation.
CoRR, 2013

The Case for Limping-Hardware Tolerant Clouds.
Proceedings of the 5th USENIX Workshop on Hot Topics in Cloud Computing, 2013

HARDFS: hardening HDFS with selective and lightweight versioning.
Proceedings of the 11th USENIX conference on File and Storage Technologies, 2013

Limplock: understanding the impact of limpware on scale-out cloud systems.
Proceedings of the ACM Symposium on Cloud Computing, SOCC '13, 2013

2011
PREFAIL: a programmable tool for multiple-failure injection.
Proceedings of the 26th Annual ACM SIGPLAN Conference on Object-Oriented Programming, 2011

FATE and DESTINI: A Framework for Cloud Recovery Testing.
Proceedings of the 8th USENIX Symposium on Networked Systems Design and Implementation, 2011

2010
Impact of disk corruption on open-source DBMS.
Proceedings of the 26th International Conference on Data Engineering, 2010

Towards Automatically Checking Thousands of Failures with Micro-specifications.
Proceedings of the Sixth Workshop on Hot Topics in System Dependability, 2010

2009
Error propagation analysis for file systems.
Proceedings of the 2009 ACM SIGPLAN Conference on Programming Language Design and Implementation, 2009

2008
SQCK: A Declarative File System Checker.
Proceedings of the 8th USENIX Symposium on Operating Systems Design and Implementation, 2008

EIO: Error Handling is Occasionally Correct.
Proceedings of the 6th USENIX Conference on File and Storage Technologies, 2008

2007
Improving file system reliability with I/O shepherding.
Proceedings of the 21st ACM Symposium on Operating Systems Principles 2007, 2007

2005
IRON file systems.
Proceedings of the 20th ACM Symposium on Operating Systems Principles 2005, 2005

Deconstructing Commodity Storage Clusters.
Proceedings of the 32st International Symposium on Computer Architecture (ISCA 2005), 2005

2004
Deploying Safe User-Level Network Services with icTCP.
Proceedings of the 6th Symposium on Operating System Design and Implementation (OSDI 2004), 2004

2003
Transforming policies into mechanisms with infokernel.
Proceedings of the 19th ACM Symposium on Operating Systems Principles 2003, 2003


  Loading...