Jan Laukemann

Orcid: 0000-0002-3776-9353

According to our database1, Jan Laukemann authored at least 14 papers between 2017 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Accelerating Sparse Tensor Decomposition Using Adaptive Linearized Representation.
CoRR, 2024

2023
MD-Bench: A performance-focused prototyping harness for state-of-the-art short-range molecular dynamics algorithms.
Future Gener. Comput. Syst., December, 2023

CloverLeaf on Intel Multi-Core CPUs: A Case Study in Write-Allocate Evasion.
CoRR, 2023

MD-Bench: Engineering the in-core performance of short-range molecular dynamics kernels from state-of-the-art simulation packages.
CoRR, 2023

Core-Level Performance Engineering with the Open-Source Architecture Code Analyzer (OSACA) and the Compiler Explorer.
Proceedings of the Companion of the 2023 ACM/SPEC International Conference on Performance Engineering, 2023

Dynamic Tensor Linearization and Time Slicing for Efficient Factorization of Infinite Data Streams.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023

2022
Execution-Cache-Memory modeling and performance tuning of sparse matrix-vector multiplication and Lattice quantum chromodynamics on A64FX.
Concurr. Comput. Pract. Exp., 2022

Efficient, out-of-memory sparse MTTKRP on massively parallel architectures.
Proceedings of the ICS '22: 2022 International Conference on Supercomputing, Virtual Event, June 28, 2022

2021
ECM modeling and performance tuning of SpMV and Lattice QCD on A64FX.
CoRR, 2021

ALTO: adaptive linearized storage of sparse tensors.
Proceedings of the ICS '21: 2021 International Conference on Supercomputing, 2021

2020
Performance Modeling of Streaming Kernels and Sparse Matrix-Vector Multiplication on A64FX.
Proceedings of the 2020 IEEE/ACM Performance Modeling, 2020

2019
Automatic Throughput and Critical Path Analysis of x86 and ARM Assembly Kernels.
Proceedings of the 2019 IEEE/ACM Performance Modeling, 2019

2018
Automated Instruction Stream Throughput Prediction for Intel and AMD Microarchitectures.
Proceedings of the 2018 IEEE/ACM Performance Modeling, 2018

2017
Reproducibility report: Team SegFAUlt @ SCC 2016.
Parallel Comput., 2017


  Loading...