Endri Taka

Orcid: 0009-0000-5136-7580

According to our database1, Endri Taka authored at least 15 papers between 2019 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
From Loop Nests to Silicon: Mapping AI Workloads onto AMD NPUs with MLIR-AIR.
ACM Trans. Reconfigurable Technol. Syst., June, 2026

Striking the Balance: GEMM Performance Optimization Across Generations of Ryzen™ AI NPUs.
Proceedings of the 2026 ACM/SIGDA International Symposium on Field Programmable Gate Arrays, 2026

2025
Striking the Balance: GEMM Performance Optimization Across Generations of Ryzen AI NPUs.
CoRR, December, 2025

Can Asymmetric Tile Buffering Be Beneficial?
CoRR, November, 2025

Performance Analysis of GEMM Workloads on the AMD Versal Platform.
Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2025

GAMA: High-Performance GEMM Acceleration on AMD Versal ML-Optimized AI Engines.
Proceedings of the 35th International Conference on Field-Programmable Logic and Applications, 2025

Systolic Sparse Tensor Slices: FPGA Building Blocks for Sparse and Dense AI Acceleration.
Proceedings of the 2025 ACM/SIGDA International Symposium on Field Programmable Gate Arrays, 2025

Performance Analysis of GEMM Workloads on the AMD Versal Platform.
Proceedings of the 2025 ACM/SIGDA International Symposium on Field Programmable Gate Arrays, 2025

2024
ELSA: Exploiting Layer-wise N:M Sparsity for Vision Transformer Acceleration.
CoRR, 2024

Efficient Approaches for GEMM Acceleration on Leading AI-Optimized FPGAs.
Proceedings of the 32nd IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2024

ELSA: Exploiting Layer-wise N: M Sparsity for Vision Transformer Acceleration.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
MaxEVA: Maximizing the Efficiency of Matrix Multiplication on Versal AI Engine.
Proceedings of the International Conference on Field Programmable Technology, 2023

2022
Improving the performance of RISC-V softcores on FPGA by exploiting PVT variability and DVFS.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2022

2021
Process Variability Analysis in Interconnect, Logic, and Arithmetic Blocks of 16-nm FinFET FPGAs.
ACM Trans. Reconfigurable Technol. Syst., 2021

2019
Analysis of Performance Variation in 16nm FinFET FPGA Devices.
Proceedings of the 29th International Conference on Field Programmable Logic and Applications, 2019


  Loading...