We stand with Ukraine

We stand with Ukraine

Evangelos Georganas

Orcid: 0009-0007-8738-3532

According to our database¹, Evangelos Georganas authored at least 41 papers between 2012 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Space Filling Curves is All You Need: Communication-Avoiding Matrix Multiplication Made Simple.

[DOI]

Evangelos Georganas

,

Alexander Heinecke

,

CoRR, January, 2026

2025

Pushing the Envelope of LLM Inference on AI-PC.

[DOI]

Evangelos Georganas

,

Dhiraj D. Kalamkar

,

Alexander Heinecke

CoRR, August, 2025

DECA: A Near-Core LLM Decompression Accelerator Supporting Out-of-Order Invocation.

[DOI]

Gerasimos Gerogiannis

,

,

Evangelos Georganas

,

,

Josep Torrellas

CoRR, May, 2025

ML-SpecQD: Multi-Level Speculative Decoding with Quantized Drafts.

[DOI]

Evangelos Georganas

,

Dhiraj D. Kalamkar

,

Alexander Kozlov

,

Alexander Heinecke

CoRR, March, 2025

DECA: A Near-Core LLM Decompression Accelerator Grounded on a 3D Roofline Model.

[DOI]

Gerasimos Gerogiannis

,

,

Evangelos Georganas

,

,

Josep Torrellas

Proceedings of the 58th IEEE/ACM International Symposium on Microarchitecture, 2025

2024

Towards a high-performance AI compiler with upstream MLIR.

[DOI]

,

Lorenzo Chelini

,

Adam Siemieniuk

,

,

Niranjan Hasabnis

,

,

Evangelos Georganas

,

Alexander Heinecke

CoRR, 2024

Harnessing Deep Learning and HPC Kernels via High-Level Loop and Tensor Abstractions on CPU Architectures.

[DOI]

Evangelos Georganas

,

Dhiraj D. Kalamkar

,

,

,

,

,

Alexander Breuer

,

Alexander Heinecke

Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024

2023

Harnessing Deep Learning and HPC Kernels via High-Level Loop and Tensor Abstractions on CPU Architectures.

[DOI]

Evangelos Georganas

,

Dhiraj D. Kalamkar

,

,

,

,

Alexander Breuer

,

Alexander Heinecke

CoRR, 2023

2022

Tensor Processing Primitives: A Programming Abstraction for Efficiency and Portability in Deep Learning and HPC Workloads.

[DOI]

Evangelos Georganas

,

Dhiraj D. Kalamkar

,

Sasikanth Avancha

,

Menachem Adelman

,

Deepti Aggarwal

,

Cristina Anderson

,

Alexander Breuer

,

Jeremy Bruestle

,

Narendra Chaudhary

,

,

,

,

,

,

Ramanarayan Mohanty

,

,

,

,

Alexander Heinecke

Frontiers Appl. Math. Stat., 2022

FPGA-based AI Smart NICs for Scalable Distributed AI Training Systems.

[DOI]

,

Evangelos Georganas

,

Alexander Heinecke

,

,

Eriko Nurvitadhi

CoRR, 2022

FPGA-Based AI Smart NICs for Scalable Distributed AI Training Systems.

[DOI]

,

Evangelos Georganas

,

Alexander Heinecke

,

,

,

Eriko Nurvitadhi

IEEE Comput. Archit. Lett., 2022

Accelerating Deep Learning based Identification of Chromatin Accessibility from noisy ATAC-seq Data.

[DOI]

Narendra Chaudhary

,

,

Dhiraj D. Kalamkar

,

Alexander Heinecke

,

Evangelos Georganas

,

,

Menachem Adelman

,

Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2022

2021

Efficient and Generic 1D Dilated Convolution Layer for Deep Learning.

[DOI]

Narendra Chaudhary

,

,

Dhiraj D. Kalamkar

,

Alexander Heinecke

,

Evangelos Georganas

,

,

Menachem Adelman

,

CoRR, 2021

Tensor Processing Primitives: A Programming Abstraction for Efficiency and Portability in Deep Learning Workloads.

[DOI]

Evangelos Georganas

,

Dhiraj D. Kalamkar

,

Sasikanth Avancha

,

Menachem Adelman

,

Cristina Anderson

,

Alexander Breuer

,

Narendra Chaudhary

,

,

,

,

Ramanarayan Mohanty

,

,

,

Alexander Heinecke

CoRR, 2021

DistGNN: scalable distributed training for large-scale graph neural networks.

[DOI]

,

,

,

Ramanarayan Mohanty

,

Evangelos Georganas

,

Alexander Heinecke

,

Dhiraj D. Kalamkar

,

Nesreen K. Ahmed

,

Sasikanth Avancha

Proceedings of the International Conference for High Performance Computing, 2021

Tensor processing primitives: a programming abstraction for efficiency and portability in deep learning workloads.

[DOI]

Evangelos Georganas

,

Dhiraj D. Kalamkar

,

Sasikanth Avancha

,

Menachem Adelman

,

Cristina Anderson

,

Alexander Breuer

,

Jeremy Bruestle

,

Narendra Chaudhary

,

,

,

,

,

,

Ramanarayan Mohanty

,

,

,

Alexander Heinecke

Proceedings of the International Conference for High Performance Computing, 2021

Towards Flexible and Compiler-Friendly Layer Fusion for CNNs on Multicore CPUs.

[DOI]

,

Evangelos Georganas

,

Proceedings of the Euro-Par 2021: Parallel Processing, 2021

2020

The Parallelism Motifs of Genomic Data Analysis.

[DOI]

Katherine A. Yelick

,

,

,

,

,

,

Saliya Ekanayake

,

,

Evangelos Georganas

,

,

Steven A. Hofmeyr

,

,

Cristina Teodoropol

,

CoRR, 2020

Optimizing deep learning recommender systems training on CPU cluster architectures.

[DOI]

Dhiraj D. Kalamkar

,

Evangelos Georganas

,

Sudarshan Srinivasan

,

,

Mikhail Shiryaev

,

Alexander Heinecke

Proceedings of the International Conference for High Performance Computing, 2020

Harnessing Deep Learning via a Single Building Block.

[DOI]

Evangelos Georganas

,

,

Dhiraj D. Kalamkar

,

Sasikanth Avancha

,

,

Michael J. Anderson

,

,

,

Alexander Heinecke

Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020

2019

Optimizing Deep Learning RNN Topologies on Intel Architecture.

[DOI]

,

Evangelos Georganas

,

Dhiraj D. Kalamkar

,

,

,

Cristina Anderson

,

Alexander Heinecke

Supercomput. Front. Innov., 2019

High-Performance Deep Learning via a Single Building Block.

[DOI]

Evangelos Georganas

,

,

Dhiraj D. Kalamkar

,

Sasikanth Avancha

,

,

Michael J. Anderson

,

,

,

Alexander Heinecke

CoRR, 2019

A Study of BFLOAT16 for Deep Learning Training.

[DOI]

Dhiraj D. Kalamkar

,

Dheevatsa Mudigere

,

Naveen Mellempudi

,

,

,

Sasikanth Avancha

,

Dharma Teja Vooturi

,

Nataraj Jammalamadaka

,

,

,

,

,

Alexander Heinecke

,

Evangelos Georganas

,

Sudarshan Srinivasan

,

,

Misha Smelyanskiy

,

,

CoRR, 2019

Training Google Neural Machine Translation on an Intel CPU Cluster.

[DOI]

Dhiraj D. Kalamkar

,

,

Sudarshan Srinivasan

,

Srinivas Sridharan

,

Evangelos Georganas

,

Mikhail E. Smorkalov

,

,

Alexander Heinecke

Proceedings of the 2019 IEEE International Conference on Cluster Computing, 2019

ISA mapper: a compute and hardware agnostic deep learning compiler.

[DOI]

Matthew Sotoudeh

,

,

Michael J. Anderson

,

Evangelos Georganas

,

Alexander Heinecke

,

Proceedings of the 16th ACM International Conference on Computing Frontiers, 2019

2018

Extreme scale de novo metagenome assembly.

[DOI]

Evangelos Georganas

,

,

Steven A. Hofmeyr

,

Eugene Goltsman

,

,

,

,

,

Katherine A. Yelick

Proceedings of the International Conference for High Performance Computing, 2018

Anatomy of high-performance deep learning convolutions on SIMD architectures.

[DOI]

Evangelos Georganas

,

Sasikanth Avancha

,

,

Dhiraj D. Kalamkar

,

,

,

Alexander Heinecke

Proceedings of the International Conference for High Performance Computing, 2018

Mixed Precision Training of Convolutional Neural Networks using Integer Operations.

[DOI]

,

Naveen Mellempudi

,

Dheevatsa Mudigere

,

Dhiraj D. Kalamkar

,

Sasikanth Avancha

,

,

Srinivas Sridharan

,

Karthik Vaidyanathan

,

,

Evangelos Georganas

,

Alexander Heinecke

,

,

,

Nikita Shustrov

,

,

Evarist Fomenko

,

Vadim O. Pirogov

Proceedings of the 6th International Conference on Learning Representations, 2018

2017

Extreme-Scale De Novo Genome Assembly.

[DOI]

Evangelos Georganas

,

Steven A. Hofmeyr

,

,

,

,

,

Katherine A. Yelick

CoRR, 2017

A New Parallel Research Kernel to Expand Research on Dynamic Load-Balancing Capabilities.

[DOI]

Rob F. Van der Wijngaart

,

Evangelos Georganas

,

Timothy G. Mattson

,

Andrew M. Wissink

Proceedings of the High Performance Computing - 32nd International Conference, 2017

MerBench: PGAS Benchmarks for High Performance Genome Assembly.

[DOI]

Evangelos Georganas

,

,

,

Steven A. Hofmeyr

,

,

,

,

Katherine A. Yelick

Proceedings of PAW@SC 2017: Second Annual PGAS Applications Workshop, 2017

Performance Characterization of De Novo Genome Assembly on Leading Parallel Systems.

[DOI]

,

Evangelos Georganas

,

,

Steven A. Hofmeyr

,

,

,

,

Katherine A. Yelick

Proceedings of the Euro-Par 2017: Parallel Processing - 23rd International Conference on Parallel and Distributed Computing, Santiago de Compostela, Spain, August 28, 2017

2016

Scalable Parallel Algorithms for Genome Analysis.

[DOI]

Evangelos Georganas

PhD thesis, 2016

Design and Implementation of a Parallel Research Kernel for Assessing Dynamic Load-Balancing Capabilities.

[DOI]

Evangelos Georganas

,

Rob F. Van der Wijngaart

,

Timothy G. Mattson

Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016

2015

HipMer: an extreme-scale de novo genome assembler.

[DOI]

Evangelos Georganas

,

,

,

Steven A. Hofmeyr

,

Chaitanya Aluru

,

,

,

,

Katherine A. Yelick

Proceedings of the International Conference for High Performance Computing, 2015

merAligner: A Fully Parallel Sequence Aligner.

[DOI]

Evangelos Georganas

,

,

,

,

,

Katherine A. Yelick

Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015

2014

Scalable multimedia content analysis on parallel platforms using python.

[DOI]

Ekaterina Gonina

,

Gerald Friedland

,

Eric Battenberg

,

Penporn Koanantakool

,

Michael B. Driscoll

,

Evangelos Georganas

,

ACM Trans. Multim. Comput. Commun. Appl., 2014

Constructing Performance Models for Dense Linear Algebra Algorithms on Cray XE Systems.

[DOI]

Jorge González-Domínguez

,

Evangelos Georganas

,

,

María J. Martín

CoRR, 2014

Parallel De Bruijn Graph Construction and Traversal for De Novo Genome Assembly.

[DOI]

Evangelos Georganas

,

,

,

,

,

Katherine A. Yelick

Proceedings of the International Conference for High Performance Computing, 2014

2013

A Communication-Optimal N-Body Algorithm for Direct Interactions.

[DOI]

Michael B. Driscoll

,

Evangelos Georganas

,

Penporn Koanantakool

,

Edgar Solomonik

,

Katherine A. Yelick

Proceedings of the 27th IEEE International Symposium on Parallel and Distributed Processing, 2013

2012

Communication avoiding and overlapping for numerical linear algebra.

[DOI]

Evangelos Georganas

,

Jorge González-Domínguez

,

Edgar Solomonik

,

,

,

Katherine A. Yelick

Proceedings of the SC Conference on High Performance Computing Networking, 2012

Loading...