Ahmed E. Helal

According to our database1, Ahmed E. Helal authored at least 13 papers between 2015 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Accelerating Sparse Tensor Decomposition Using Adaptive Linearized Representation.
CoRR, 2024

2023
Dynamic Tensor Linearization and Time Slicing for Efficient Factorization of Infinite Data Streams.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023

2022
Efficient, out-of-memory sparse MTTKRP on massively parallel architectures.
Proceedings of the ICS '22: 2022 International Conference on Supercomputing, Virtual Event, June 28, 2022

2021
ALTO: adaptive linearized storage of sparse tensors.
Proceedings of the ICS '21: 2021 International Conference on Supercomputing, 2021

2019
Adaptive Task Aggregation for High-Performance Sparse Solvers on GPUs.
Proceedings of the 28th International Conference on Parallel Architectures and Compilation Techniques, 2019

2018
A Composable Workflow for Productive Heterogeneous Computing on FPGAs via Whole-Program Analysis and Transformation.
Proceedings of the 2018 International Conference on ReConFigurable Computing and FPGAs, 2018

Exploring FPGA-specific Optimizations for Irregular OpenCL Applications.
Proceedings of the 2018 International Conference on ReConFigurable Computing and FPGAs, 2018

CommAnalyzer: automated estimation of communication cost and scalability on HPC clusters from sequential code.
Proceedings of the 27th International Symposium on High-Performance Parallel and Distributed Computing, 2018

2017
AutoMatch: An automated framework for relative performance estimation and workload distribution on heterogeneous HPC systems.
Proceedings of the 2017 IEEE International Symposium on Workload Characterization, 2017

2016
MetaMorph: a library framework for interoperable kernels on multi- and many-core clusters.
Proceedings of the International Conference for High Performance Computing, 2016

Bridging the Performance-Programmability Gap for FPGAs via OpenCL: A Case Study with OpenDwarfs.
Proceedings of the 24th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2016

2015
High Performance Sparse LU Solver FPGA Accelerator Using a Static Synchronous Data Flow Model.
Proceedings of the 23rd IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2015

Parallel circuit simulation using the direct method on a heterogeneous cloud.
Proceedings of the 52nd Annual Design Automation Conference, 2015


  Loading...