Balazs Gerofi

Orcid: 0009-0004-8585-6031

According to our database1, Balazs Gerofi authored at least 65 papers between 2010 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
At the Locus of Performance: Quantifying the Effects of Copious 3D-Stacked Cache on HPC Workloads.
ACM Trans. Archit. Code Optim., December, 2023

Proactive Stripe Reconstruction to Improve Cache Use Efficiency of SSD-Based RAID Systems.
ACM Trans. Embed. Comput. Syst., October, 2023

Adaptive Management With Request Granularity for DRAM Cache Inside nand-Based SSDs.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2023

KAKURENBO: Adaptively Hiding Samples in Deep Neural Network Training.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Rep-RAID: An Integrated Approach to Optimizing Data Replication and Garbage Collection in RAID-Enabled SSDs.
Proceedings of the 24th ACM SIGPLAN/SIGBED International Conference on Languages, 2023

2022
Pattern-Based Prefetching with Adaptive Cache Management Inside of Solid-State Drives.
ACM Trans. Storage, 2022

At the Locus of Performance: A Case Study in Enhancing CPUs with Copious 3D-Stacked Cache.
CoRR, 2022

Rapid Execution Time Estimation for Heterogeneous Memory Systems Through Differential Tracing.
Proceedings of the High Performance Computing - 37th International Conference, 2022

On the Difference Between Shared Memory and Shared Address Space in HPC Communication.
Proceedings of the Supercomputing Frontiers - 7th Asian Conference, 2022

Why Globally Re-shuffle? Revisiting Data Shuffling in Large Scale Deep Learning.
Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022

DRAM Cache Management with Request Granularity for NAND-based SSDs.
Proceedings of the 51st International Conference on Parallel Processing, 2022

Communication-Computation Overlapping for Preconditioned Parallel Iterative Solvers with Dynamic Loop Scheduling.
Proceedings of the HPCAsia 2022 Workshop: International Conference on High Performance Computing in Asia-Pacific Region Workshops, Virtual Event Japan, January 11, 2022

Exploring Communication-Computation Overlap in Parallel Iterative Solvers on Manycore CPUs using Asynchronous Progress Control.
Proceedings of the HPCAsia 2022 Workshop: International Conference on High Performance Computing in Asia-Pacific Region Workshops, Virtual Event Japan, January 11, 2022

2021
Mitigating Negative Impacts of Read Disturb in SSDs.
ACM Trans. Design Autom. Electr. Syst., 2021

An international survey on MPI users.
Parallel Comput., 2021

MLPerf HPC: A Holistic Benchmark Suite for Scientific Machine Learning on HPC Systems.
CoRR, 2021

Linux vs. lightweight multi-kernels for high performance computing: experiences at pre-exascale.
Proceedings of the International Conference for High Performance Computing, 2021


Intra-page Cache Update in SLC-mode with Partial Programming in High Density SSDs.
Proceedings of the ICPP 2021: 50th International Conference on Parallel Processing, Lemont, IL, USA, August 9, 2021

A Scalability Study of Data Exchange in HPC Multi-component Workflows.
Proceedings of the IEEE International Conference on Cluster Computing, 2021

2020
Application-Driven Requirements for Node Resource Management in Next-Generation Systems.
Proceedings of the 2020 IEEE/ACM International Workshop on Runtime and Operating Systems for Supercomputers, 2020

An Implementation of User-Level Processes using Address Space Sharing.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium Workshops, 2020

2019
Parallel Multigrid Methods on Manycore Clusters with IHK/McKernel.
Proceedings of the 10th IEEE/ACM Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, 2019

Invited Talk 2.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2019

A New Age: An Overview of Multi-kernels.
Proceedings of the Operating Systems for Supercomputers and High Performance Computing, 2019

Overview: The Rise of Linux.
Proceedings of the Operating Systems for Supercomputers and High Performance Computing, 2019

Overview: The Birth of Lightweight Kernels.
Proceedings of the Operating Systems for Supercomputers and High Performance Computing, 2019

IHK/McKernel.
Proceedings of the Operating Systems for Supercomputers and High Performance Computing, 2019

Introduction to HPC Operating Systems.
Proceedings of the Operating Systems for Supercomputers and High Performance Computing, 2019

2018
Hardware Performance Variation: A Comparative Study Using Lightweight Kernels.
Proceedings of the High Performance Computing - 33rd International Conference, 2018

DTF: An I/O Arbitration Framework for Multi-component Data Processing Workflows.
Proceedings of the High Performance Computing - 33rd International Conference, 2018

On the Applicability of PEBS based Online Memory Access Tracking for Heterogeneous Memory Management at Scale.
Proceedings of the Workshop on Memory Centric High Performance Computing, 2018

Performance and Scalability of Lightweight Multi-kernel Based Operating Systems.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium, 2018

Process-in-process: techniques for practical address-space sharing.
Proceedings of the 27th International Symposium on High-Performance Parallel and Distributed Computing, 2018

PicoDriver: fast-path device drivers for multi-kernel operating systems.
Proceedings of the 27th International Symposium on High-Performance Parallel and Distributed Computing, 2018

2017
A flexible I/O arbitration framework for netCDF-based big data processing workflows on high-end supercomputers.
Concurr. Comput. Pract. Exp., 2017

Toward Full Specialization of the HPC Software Stack: Reconciling Application Containers and Lightweight Multi-kernels.
Proceedings of the 7th International Workshop on Runtime and Operating Systems for Supercomputers, 2017

2016
Prefetching on Storage Servers through Mining Access Patterns on Blocks.
IEEE Trans. Parallel Distributed Syst., 2016

"Big Data Assimilation" Toward Post-Petascale Severe Weather Prediction: An Overview and Progress.
Proc. IEEE, 2016

Revisiting RDMA Buffer Registration in the Context of Lightweight Multi-kernels.
Proceedings of the 23rd European MPI Users' Group Meeting, EuroMPI 2016, 2016

On the Scalability, Performance Isolation and Device Driver Transparency of the IHK/McKernel Hybrid Lightweight Kernel.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016

A Multi-Kernel Survey for High-Performance Computing.
Proceedings of the 6th International Workshop on Runtime and Operating Systems for Supercomputers, 2016

Toward a General I/O Arbitration Framework for netCDF Based Big Data Processing.
Proceedings of the Euro-Par 2016: Parallel Processing, 2016

Exploring Data Migration for Future Deep-Memory Many-Core Systems.
Proceedings of the 2016 IEEE International Conference on Cluster Computing, 2016

2015
Adaptive transport service selection for MPI with InfiniBand network.
Proceedings of the 3rd Workshop on Exascale MPI, 2015

Toward Operating System Support for Scalable Multithreaded Message Passing.
Proceedings of the 22nd European MPI Users' Group Meeting, 2015

What is a Lightweight Kernel?
Proceedings of the 5th International Workshop on Runtime and Operating Systems for Supercomputers, 2015

Exploring the Design Space of Combining Linux with Lightweight Kernels for Extreme Scale Computing.
Proceedings of the 5th International Workshop on Runtime and Operating Systems for Supercomputers, 2015

2014
Revisiting virtual memory for high performance computing on manycore architectures: a hybrid segmentation kernel approach.
Proceedings of the 4th International Workshop on Runtime and Operating Systems for Supercomputers, 2014

CMCP: a novel page replacement policy for system level hierarchical memory management on many-cores.
Proceedings of the 23rd International Symposium on High-Performance Parallel and Distributed Computing, 2014

Interface for heterogeneous kernels: A framework to enable hybrid OS designs targeting high performance computing on manycore architectures.
Proceedings of the 21st International Conference on High Performance Computing, 2014

Exploiting Hidden Non-uniformity of Uniform Memory Access on Manycore CPUs.
Proceedings of the Euro-Par 2014: Parallel Processing Workshops, 2014

2013
Utilizing memory content similarity for improving the performance of highly available virtual machines.
Future Gener. Comput. Syst., 2013

Revisiting rendezvous protocols in the context of RDMA-capable host channel adapters and many-core processors.
Proceedings of the 20th European MPI Users's Group Meeting, 2013

Proposing a new task model towards many-core architecture.
Proceedings of the 1st International Workshop on Many-core Embedded Systems 2013, 2013

Partially Separated Page Tables for Efficient Operating System Assisted Hierarchical Memory Management on Heterogeneous Architectures.
Proceedings of the 13th IEEE/ACM International Symposium on Cluster, 2013

2012
Enhancing TCP throughput of highly available virtual machines via speculative communication.
Proceedings of the 8th International Conference on Virtual Execution Environments, 2012

Poster: Toward Operating System Assisted Hierarchical Memory Management for Heterogeneous Architectures.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Abstract: Toward Operating System Assisted Hierarchical Memory Management for Heterogeneous Architectures.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

clone_n(): Parallel Thread Creation for Upcoming Many-Core Architectures.
Proceedings of the 2012 IEEE International Conference on Cluster Computing, 2012

2011
Utilizing Memory Content Similarity for Improving the Performance of Replicated Virtual Machines.
Proceedings of the IEEE 4th International Conference on Utility and Cloud Computing, 2011

Workload Adaptive Checkpoint Scheduling of Virtual Machine Replication.
Proceedings of the 17th IEEE Pacific Rim International Symposium on Dependable Computing, 2011

RDMA Based Replication of Multiprocessor Virtual Machines over High-Performance Interconnects.
Proceedings of the 2011 IEEE International Conference on Cluster Computing (CLUSTER), 2011

2010
A Multi-core Approach to Providing Fault Tolerance for Non-deterministic Services.
Proceedings of The Ninth IEEE International Symposium on Networking Computing and Applications, 2010

An Efficient Process Live Migration Mechanism for Load Balanced Distributed Virtual Environments.
Proceedings of the 2010 IEEE International Conference on Cluster Computing, 2010


  Loading...