Mostafa Mahmoud

Orcid: 0000-0002-8950-6221

According to our database1, Mostafa Mahmoud authored at least 28 papers between 1995 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Schrodinger's FP Training Neural Networks with Dynamic Floating-Point Containers.
Proceedings of the Seventh Annual Conference on Machine Learning and Systems, 2024

Marple: Scalable Spike Sorting for Untethered Brain-Machine Interfacing.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

Atalanta: A Bit is Worth a "Thousand" Tensor Values.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

2023
Power Level Control of Nuclear Power Plants During Load Following Operation Using Fractional Order Controller Based on a Modified Algorithm.
IEEE Access, 2023

2022
Schrödinger's FP: Dynamic Adaptation of Floating-Point Containers for Deep Learning Training.
CoRR, 2022

APack: Off-Chip, Lossless Data Compression for Efficient Deep Learning Inference.
CoRR, 2022

Mokey: enabling narrow fixed-point inference for out-of-the-box floating-point transformer models.
Proceedings of the ISCA '22: The 49th Annual International Symposium on Computer Architecture, New York, New York, USA, June 18, 2022

2021
Boveda: Building an On-Chip Deep Learning Memory Hierarchy Brick by Brick.
Proceedings of the Fourth Conference on Machine Learning and Systems, 2021

FPRaker: A Processing Element For Accelerating Neural Network Training.
Proceedings of the MICRO '21: 54th Annual IEEE/ACM International Symposium on Microarchitecture, 2021

2020
TensorDash: Exploiting Sparsity to Accelerate Deep Neural Network Training and Inference.
CoRR, 2020

TensorDash: Exploiting Sparsity to Accelerate Deep Neural Network Training.
Proceedings of the 53rd Annual IEEE/ACM International Symposium on Microarchitecture, 2020

Late Breaking Results: Building an On-Chip Deep Learning Memory Hierarchy Brick by Brick.
Proceedings of the 57th ACM/IEEE Design Automation Conference, 2020

2019
Accelerating Image-Sensor-Based Deep Learning Applications.
IEEE Micro, 2019

ShapeShifter: Enabling Fine-Grain Data Width Adaptation in Deep Learning.
Proceedings of the 52nd Annual IEEE/ACM International Symposium on Microarchitecture, 2019

Characterizing Sources of Ineffectual Computations in Deep Learning Networks.
Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2019

Laconic deep learning inference acceleration.
Proceedings of the 46th International Symposium on Computer Architecture, 2019

Bit-Tactical: A Software/Hardware Approach to Exploiting Value and Bit Sparsity in Neural Networks.
Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems, 2019

2018
Laconic Deep Learning Computing.
CoRR, 2018

Bit-Tactical: Exploiting Ineffectual Computations in Convolutional Neural Networks: Which, Why, and How.
CoRR, 2018

Identifying and Exploiting Ineffectual Computations to Enable Hardware Acceleration of Deep Learning.
Proceedings of the 16th IEEE International New Circuits and Systems Conference, 2018

Diffy: a Déjà vu-Free Differential Deep Neural Network Accelerator.
Proceedings of the 51st Annual IEEE/ACM International Symposium on Microarchitecture, 2018

Memory Requirements for Convolutional Neural Network Hardware Accelerators.
Proceedings of the 2018 IEEE International Symposium on Workload Characterization, 2018

Characterizing Sources of Ineffectual Computations in Deep Learning Networks.
Proceedings of the 2018 IEEE International Symposium on Workload Characterization, 2018

2017
IDEAL: image denoising accelerator.
Proceedings of the 50th Annual IEEE/ACM International Symposium on Microarchitecture, 2017

2016
Memory controller design under cloud workloads.
Proceedings of the 2016 IEEE International Symposium on Workload Characterization, 2016

2008
Usability and its Role in Enhancing the Online User Experience in the Egyptian Web-Based Governmental Services Portals.
Proceedings of the WEBIST 2008, 2008

Using expert systems technology to increase agriculture production and water conservation.
Proceedings of the Third IEEE International Conference on Digital Information Management (ICDIM), 2008

1995
Experience with the Development and Deployment of Expert Systems in Agriculture.
Proceedings of the Seventh Conference on Innovative Applications of Artificial Intelligence, 1995


  Loading...