We stand with Ukraine

We stand with Ukraine

Khaled Z. Ibrahim

Orcid: 0009-0004-5362-3612

According to our database¹, Khaled Z. Ibrahim authored at least 71 papers between 2001 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2025

GPU acceleration of non-equilibrium Green's function calculation using OpenACC and CUDA FORTRAN.

[DOI]

,

Khaled Z. Ibrahim

,

,

,

,

CoRR, May, 2025

Scalable training of trustworthy and energy-efficient predictive graph foundation models for atomistic materials modeling: a case study with HydraGNN.

[DOI]

Massimiliano Lupo Pasini

,

,

,

,

David M. Rogers

,

,

Khaled Z. Ibrahim

,

,

,

,

Prasanna Balaprakash

J. Supercomput., March, 2025

DyG-DPCD: A Distributed Parallel Community Detection Algorithm for Large-Scale Dynamic Graphs.

[DOI]

Naw Safrin Sattar

,

Khaled Z. Ibrahim

,

,

Shaikh Arifuzzaman

Int. J. Parallel Program., February, 2025

Parallelizing autotuning for HPC applications: Unveiling the potential of the speculation strategy in Bayesian optimization.

[DOI]

Adrián Pérez Diéguez

,

,

,

,

,

Khaled Z. Ibrahim

Int. J. High Perform. Comput. Appl., 2025

VAN-DAMME: GPU-accelerated and symmetry-assisted quantum optimal control of multi-qubit systems.

[DOI]

Jose M. Rodriguez Borbon

,

,

Adrián Pérez Diéguez

,

Khaled Z. Ibrahim

,

Comput. Phys. Commun., 2025

Accelerating I/O in Scientific Workflows with the Impact of Apache Ignite's In-Memory File System.

[DOI]

Vijayalakshmi Saravanan

,

Sai Karthik Navuluru

,

,

Khaled Z. Ibrahim

Proceedings of the High Performance Computing, 2025

Optimizing Nuclear Configuration Interaction Calculations on GPUs: A Comparative Performance Study of Programming Models.

[DOI]

Abdullah Alperen

,

,

Khaled Z. Ibrahim

,

,

,

,

Hasan Metin Aktulga

Proceedings of the ISC High Performance 2025 Research Paper Proceedings (40th International Conference), 2025

AdaptiveStream: A Scalable Distributed Framework for Real-Time Change Point Detection in Data Streams.

[DOI]

Vijayalakshmi Saravanan

,

Sai Karthik Navuluru

,

Khaled Z. Ibrahim

,

Proceedings of the IEEE International Conference on Data Mining, 2025

Uniconn: A Uniform High-Level Communication Library for Portable Multi-GPU Programming.

[DOI]

,

Sinan Ekmekçibasi

,

Khaled Z. Ibrahim

,

,

Proceedings of the IEEE International Conference on Cluster Computing, 2025

2024

QRCODE: Massively parallelized real-time time-dependent density functional theory for periodic systems.

[DOI]

,

,

Adrián Pérez Diéguez

,

,

Khaled Z. Ibrahim

,

Comput. Phys. Commun., 2024

TRAVOLTA: GPU acceleration and algorithmic improvements for constructing quantum optimal control fields in photo-excited systems.

[DOI]

Jose M. Rodriguez Borbon

,

,

Adrián Pérez Diéguez

,

Khaled Z. Ibrahim

,

Comput. Phys. Commun., 2024

Scalable Training of Graph Foundation Models for Atomistic Materials Modeling: A Case Study with HydraGNN.

[DOI]

Massimiliano Lupo Pasini

,

,

,

,

David M. Rogers

,

,

Khaled Z. Ibrahim

,

,

,

,

Prasanna Balaprakash

CoRR, 2024

An Evaluation of Real-time Adaptive Sampling Change Point Detection Algorithm using KCUSUM.

[DOI]

Vijayalakshmi Saravanan

,

,

,

Hubertus Van Dam

,

,

Christopher Kelly

,

Khaled Z. Ibrahim

CoRR, 2024

A Systematic Study of Parallelization Strategies for Optimizing Scientific Computing Performance Bounds.

[DOI]

Vijayalakshmi Saravanan

,

Sai Karthik Navuluru

,

Khaled Z. Ibrahim

Proceedings of the 37th IEEE International System-on-Chip Conference, 2024

MDLoader: A Hybrid Model-Driven Data Loader for Distributed Graph Neural Network Training.

[DOI]

,

,

Massimiliano Lupo Pasini

,

,

,

Khaled Z. Ibrahim

Proceedings of the SC24-W: Workshops of the International Conference for High Performance Computing, 2024

Cost-Effective Methodology for Complex Tuning Searches in HPC: Navigating Interdependencies and Dimensionality.

[DOI]

Adrián Pérez Diéguez

,

,

,

,

,

Khaled Z. Ibrahim

Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024

MDLoader: A Hybrid Model-driven Data Loader for Distributed Deep Neural Networks Training.

[DOI]

,

,

Massimiliano Lupo Pasini

,

,

Khaled Z. Ibrahim

Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024

2023

Exploring temporal community evolution: algorithmic approaches and parallel optimization for dynamic community detection.

[DOI]

Naw Safrin Sattar

,

,

Khaled Z. Ibrahim

,

Shaikh Arifuzzaman

Appl. Netw. Sci., December, 2023

2022

Enhancing scalability of a matrix-free eigensolver for studying many-body localization.

[DOI]

Roel Van Beeumen

,

Khaled Z. Ibrahim

,

Gregory D. Kahanamoku-Meyer

,

,

Int. J. High Perform. Comput. Appl., 2022

ML-based Performance Portability for Time-Dependent Density Functional Theory in HPC Environments.

[DOI]

Adrián Pérez Diéguez

,

,

,

,

Khaled Z. Ibrahim

Proceedings of the IEEE/ACM International Workshop on Performance Modeling, 2022

Performance Portability of Sparse Block Diagonal Matrix Multiple Vector Multiplications on GPUs.

[DOI]

Khaled Z. Ibrahim

,

,

Proceedings of the IEEE/ACM International Workshop on Performance, 2022

Preprocessing Pipeline Optimization for Scientific Deep Learning Workloads.

[DOI]

Khaled Z. Ibrahim

,

Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022

2021

Architectural Requirements for Deep Learning Workloads in HPC Environments.

[DOI]

Khaled Z. Ibrahim

,

,

,

,

,

,

,

Nicholas J. Wright

,

Samuel Williams

Proceedings of the 2021 International Workshop on Performance Modeling, 2021

Performance Modeling and Tuning for DFT Calculations on Heterogeneous Architectures.

[DOI]

,

David B. Williams-Young

,

Khaled Z. Ibrahim

,

Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2021

CSPACER: A Reduced API Set Runtime for the Space Consistency Model.

[DOI]

Khaled Z. Ibrahim

Proceedings of the HPC Asia 2021: The International Conference on High Performance Computing in Asia-Pacific Region, 2021

2020

Tuning floating-point precision using dynamic program information and temporal locality.

[DOI]

,

,

Khaled Z. Ibrahim

,

,

Proceedings of the International Conference for High Performance Computing, 2020

Performance Trade-offs in GPU Communication: A Study of Host and Device-initiated Approaches.

[DOI]

Taylor L. Groves

,

,

,

Khaled Z. Ibrahim

,

,

Nicholas J. Wright

,

Samuel Williams

,

Katherine A. Yelick

Proceedings of the 2020 IEEE/ACM Performance Modeling, 2020

2019

Modern gyrokinetic particle-in-cell simulation of fusion plasmas on top supercomputers.

[DOI]

,

Stéphane Ethier

,

William M. Tang

,

Khaled Z. Ibrahim

,

,

Samuel Williams

,

Int. J. High Perform. Comput. Appl., 2019

Performance analysis of deep learning workloads using roofline trajectories.

[DOI]

M. Haseeb Javed

,

Khaled Z. Ibrahim

,

CCF Trans. High Perform. Comput., 2019

Toward a Programmable Analysis and Visualization Framework for Interactive Performance Analytics.

[DOI]

Tanzima Z. Islam

,

,

,

Khaled Z. Ibrahim

Proceedings of the IEEE/ACM International Workshop on Programming and Performance Visualization Tools, 2019

Optimizing Breadth-First Search at Scale Using Hardware-Accelerated Space Consistency.

[DOI]

Khaled Z. Ibrahim

Proceedings of the 26th IEEE International Conference on High Performance Computing, 2019

Performance Analysis of GPU Programming Models Using the Roofline Scaling Trajectories.

[DOI]

Khaled Z. Ibrahim

,

Samuel Williams

,

Proceedings of the Benchmarking, Measuring, and Optimizing, 2019

2018

Roofline Scaling Trajectories: A Method for Parallel Application and Architectural Performance Analysis.

[DOI]

Khaled Z. Ibrahim

,

Samuel Williams

,

Proceedings of the 2018 International Conference on High Performance Computing & Simulation, 2018

2017

Cross-scale efficient tensor contractions for coupled cluster computations through multiple programming model backends.

[DOI]

Khaled Z. Ibrahim

,

Evgeny Epifanovsky

,

Samuel Williams

,

J. Parallel Distributed Comput., 2017

Reaching bandwidth saturation using transparent injection parallelization.

[DOI]

Nicholas Chaimov

,

Khaled Z. Ibrahim

,

Samuel Williams

,

Int. J. High Perform. Comput. Appl., 2017

APHiD: Hierarchical Task Placement to Enable a Tapered Fat Tree Topology for Lower Power and Cost in HPC Networks.

[DOI]

George Michelogiannakis

,

Khaled Z. Ibrahim

,

,

Jeremiah J. Wilke

,

,

Joseph P. Kenny

Proceedings of the 17th IEEE/ACM International Symposium on Cluster, 2017

2016

Scaling Spark on Lustre.

[DOI]

Nicholas Chaimov

,

Allen D. Malony

,

,

Khaled Z. Ibrahim

Proceedings of the High Performance Computing, 2016

Extreme scale plasma turbulence simulations on top supercomputers worldwide.

[DOI]

William M. Tang

,

,

Stéphane Ethier

,

Grzegorz Kwasniewski

,

Torsten Hoefler

,

Khaled Z. Ibrahim

,

,

Samuel Williams

,

,

Carlos Rosales-Fernandez

,

Timothy J. Williams

Proceedings of the International Conference for High Performance Computing, 2016

Characterizing the Performance of Hybrid Memory Cube Using ApexMAP Application Probes.

[DOI]

Khaled Z. Ibrahim

,

Farzad Fatollahi-Fard

,

,

Proceedings of the Second International Symposium on Memory Systems, 2016

Scaling Spark on HPC Systems.

[DOI]

Nicholas Chaimov

,

Allen D. Malony

,

,

,

Khaled Z. Ibrahim

,

Proceedings of the 25th ACM International Symposium on High-Performance Parallel and Distributed Computing, 2016

2015

Exploiting communication concurrency on high performance computing systems.

[DOI]

Nicholas Chaimov

,

Khaled Z. Ibrahim

,

Samuel Williams

,

Proceedings of the Sixth International Workshop on Programming Models and Applications for Multicores and Manycores, 2015

2014

The Case for Partitioning Virtual Machines on Multicore Architectures.

[DOI]

Khaled Z. Ibrahim

,

Steven A. Hofmeyr

,

IEEE Trans. Parallel Distributed Syst., 2014

Efficient Interoperability of OpenSHMEM on Multicore Architectures.

[DOI]

Khaled Z. Ibrahim

Proceedings of the 8th International Conference on Partitioned Global Address Space Programming Models, 2014

An Evaluation of One-Sided and Two-Sided Communication Paradigms on Relaxed-Ordering Interconnect.

[DOI]

Khaled Z. Ibrahim

,

,

,

Katherine A. Yelick

Proceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium, 2014

On the conditions for efficient interoperability with threads: an experience with PGAS languages using cray communication domains.

[DOI]

Khaled Z. Ibrahim

,

Katherine A. Yelick

Proceedings of the 2014 International Conference on Supercomputing, 2014

Analysis and tuning of libtensor framework on multicore architectures.

[DOI]

Khaled Z. Ibrahim

,

Samuel W. Williams

,

Evgeny Epifanovsky

,

Proceedings of the 21st International Conference on High Performance Computing, 2014

2013

Analysis and optimization of gyrokinetic toroidal simulations on homogenous and heterogenous platforms.

[DOI]

Khaled Z. Ibrahim

,

,

Samuel Williams

,

,

Stéphane Ethier

,

Int. J. High Perform. Comput. Appl., 2013

Kinetic turbulence simulations at extreme scale on leadership-class systems.

[DOI]

,

Stéphane Ethier

,

William M. Tang

,

Timothy J. Williams

,

Khaled Z. Ibrahim

,

,

Samuel Williams

,

Proceedings of the International Conference for High Performance Computing, 2013

2012

Code Development of High-Performance Applications for Power-Efficient Architectures.

[DOI]

Khaled Z. Ibrahim

Proceedings of the Handbook of Energy-Aware and Green Computing - Two Volume Set., 2012

Poster: Advances in Gyrokinetic Particle in Cell Simulation for Fusion Plasmas to Extreme Scale.

[DOI]

,

Stéphane Ethier

,

William M. Tang

,

Khaled Z. Ibrahim

,

,

Samuel W. Williams

,

,

Timothy J. Williams

Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Abstract: Advances in Gyrokinetic Particle in Cell Simulation for Fusion Plasmas to Extreme Scale.

[DOI]

,

Stéphane Ethier

,

William M. Tang

,

Khaled Z. Ibrahim

,

,

Samuel W. Williams

,

,

Timothy J. Williams

Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Congestion avoidance on manycore high performance computing systems.

[DOI]

,

Dhabaleswar K. Panda

,

Khaled Z. Ibrahim

,

Proceedings of the International Conference on Supercomputing, 2012

Concurrent Phase Classification for Accelerating MPSoC Simulation.

[DOI]

,

Khaled Z. Ibrahim

,

Proceedings of the ARCS 2012 Workshops, 28. Februar - 2. März 2012, München, Germany, 2012

2011

Gyrokinetic particle-in-cell optimization on emerging multi- and manycore platforms.

[DOI]

,

,

Khaled Z. Ibrahim

,

Samuel Williams

,

Stéphane Ethier

,

Parallel Comput., 2011

Gyrokinetic toroidal simulations on leading multi- and manycore HPC systems.

[DOI]

,

Khaled Z. Ibrahim

,

Samuel Williams

,

,

Stéphane Ethier

,

,

Proceedings of the Conference on High Performance Computing Networking, 2011

Optimized pre-copy live migration for memory intensive applications.

[DOI]

Khaled Z. Ibrahim

,

Steven A. Hofmeyr

,

,

Proceedings of the Conference on High Performance Computing Networking, 2011

Characterizing the Performance of Parallel Applications on Multi-socket Virtual Machines.

[DOI]

Khaled Z. Ibrahim

,

Steven A. Hofmeyr

,

Proceedings of the 11th IEEE/ACM International Symposium on Cluster, 2011

2010

Parallel application sampling for accelerating MPSoC simulation.

[DOI]

,

Khaled Z. Ibrahim

,

Des. Autom. Embed. Syst., 2010

Characterizing the Relation Between Apex-Map Synthetic Probes and Reuse Distance Distributions.

[DOI]

Khaled Z. Ibrahim

,

Erich Strohmaier

Proceedings of the 39th International Conference on Parallel Processing, 2010

Bridging the gap between complex software paradigms and power-efficient parallel architectures.

[DOI]

Khaled Z. Ibrahim

Proceedings of the International Green Computing Conference 2010, 2010

2009

Power-Aware Bus Coscheduling for Periodic Realtime Applications Running on Multiprocessor SoC.

[DOI]

Khaled Z. Ibrahim

,

Trans. High Perform. Embed. Archit. Compil., 2009

Efficient SIMDization and data management of the Lattice QCD computation on the Cell Broadband Engine.

[DOI]

Khaled Z. Ibrahim

,

François Bodin

Sci. Program., 2009

2008

Fine-grained parallelization of lattice QCD kernel routine on GPUs.

[DOI]

Khaled Z. Ibrahim

,

François Bodin

,

J. Parallel Distributed Comput., 2008

Implementing Wilson-Dirac operator on the cell broadband engine.

[DOI]

Khaled Z. Ibrahim

,

François Bodin

Proceedings of the 22nd Annual International Conference on Supercomputing, 2008

Multi-granularity sampling for simulating concurrent heterogeneous applications.

[DOI]

,

Khaled Z. Ibrahim

,

Proceedings of the 2008 International Conference on Compilers, 2008

2007

Adaptive Sampling for Efficient MPSoC Architecture Simulation.

[DOI]

,

Khaled Z. Ibrahim

,

Proceedings of the 15th International Symposium on Modeling, 2007

2005

Correlation between Detailed and Simplified Simulations in Studying Multiprocessor Architecture.

[DOI]

Khaled Z. Ibrahim

Proceedings of the 23rd International Conference on Computer Design (ICCD 2005), 2005

Efficient Architectural Support for Secure Bus-Based Shared Memory Multiprocessor.

[DOI]

Khaled Z. Ibrahim

Proceedings of the Advances in Computer Systems Architecture, 10th Asia-Pacific Conference, 2005

2003

Extending OpenMP to Support Slipstream Execution Mode.

[DOI]

Khaled Z. Ibrahim

,

Gregory T. Byrd

Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS 2003), 2003

Slipstream Execution Mode for CMP-Based Multiprocessors.

[DOI]

Khaled Z. Ibrahim

,

Gregory T. Byrd

,

Proceedings of the Ninth International Symposium on High-Performance Computer Architecture (HPCA'03), 2003

2001

On the Exploitation of Value Predication and Producer Identification to Reduce Barrier Synchronization Time.

[DOI]

Khaled Z. Ibrahim

,

Gregory T. Byrd

Proceedings of the 15th International Parallel & Distributed Processing Symposium (IPDPS-01), 2001

Loading...