We stand with Ukraine

We stand with Ukraine

Michael Klemm

Orcid: 0000-0002-8634-4634

According to our database¹, Michael Klemm authored at least 49 papers between 1979 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2025

Automatically Parallelizing Batch Inference on Deep Neural Networks Using Fiats and Fortran 2023 "Do Concurrent".

[DOI]

Damian W. I. Rouson

,

,

,

,

Ethan D. Gutmann

,

,

Katherine Rasmussen

,

Brad Richardson

,

,

David J. Torres

,

Proceedings of the High Performance Computing, 2025

Implementing OpenMP Offload Support in the AMD Next Generation Fortran Compiler.

[DOI]

Dominik Adamski

,

,

,

Pranav Bhandarkar

,

,

Andrew Gozillon

,

,

,

Proceedings of the SC '25 Workshops of the International Conference for High Performance Computing, 2025

Demonstrating OpenMP<sup>®</sup> Offload Performance with the STREAmS-2 Application and the AMD Next-Gen Fortran Compiler.

[DOI]

,

Francesco Salvadore

,

Proceedings of the OpenMP: Balancing Productivity and Performance Portability, 2025

2024

Detrimental task execution patterns in mainstream OpenMP runtimes.

[DOI]

,

Tobias Weinzierl

,

CoRR, 2024

Detrimental Task Execution Patterns in Mainstream OpenMP<sup>®</sup> Runtimes.

[DOI]

,

Tobias Weinzierl

,

Proceedings of the Advancing OpenMP for Future Accelerators, 2024

2023

Quantum Task Offloading with the OpenMP API.

[DOI]

Joseph K. L. Lee

,

Oliver Thomson Brown

,

,

Martin Ruefenacht

,

Johannes Doerfert

,

,

CoRR, 2023

2022

Evaluating GPU Programming Models for the LUMI Supercomputer.

[DOI]

George S. Markomanolis

,

,

,

,

Nicholas Malaya

,

Aniello Esposito

,

,

Sergey I. Bastrakov

,

Alexander Debus

,

,

Klaus Steiniger

,

,

,

Michael Bussmann

Proceedings of the Supercomputing Frontiers - 7th Asian Conference, 2022

2019

Toward a Standard Interface for User-Defined Scheduling in OpenMP.

[DOI]

,

Christian Iwainsky

,

,

Jonas H. Müller Korndörfer

,

Florina M. Ciorba

Proceedings of the OpenMP: Conquering the Full Hardware Spectrum, 2019

2018

The Ongoing Evolution of OpenMP.

[DOI]

Bronis R. de Supinski

,

Thomas R. W. Scogland

,

Alejandro Duran

,

,

Sergi Mateo Bellido

,

Stephen L. Olivier

,

Christian Terboven

,

Timothy G. Mattson

Proc. IEEE, 2018

Assessing Task-to-Data Affinity in the LLVM OpenMP Runtime.

[DOI]

Jannis Klinkenberg

,

Philipp Samfass

,

Christian Terboven

,

Alejandro Duran

,

,

,

,

Stephen L. Olivier

,

Matthias S. Müller

Proceedings of the Evolving OpenMP for Evolving Architectures, 2018

Visualization of OpenMP* Task Dependencies Using Intel® Advisor - Flow Graph Analyzer.

[DOI]

Vishakha Agrawal

,

Michael J. Voss

,

,

Vasanth Tovinkere

,

Jeff R. Hammond

,

Proceedings of the Evolving OpenMP for Evolving Architectures, 2018

2017

KART - A Runtime Compilation Library for Improving HPC Application Performance.

[DOI]

,

,

Georg Zitzlsberger

,

,

Proceedings of the High Performance Computing, 2017

Performance Evaluation of NWChem Ab-Initio Molecular Dynamics (AIMD) Simulations on the Intel® Xeon Phi™ Processor.

[DOI]

Eric J. Bylaska

,

Mathias Jacquelin

,

Wibe A. de Jong

,

Jeff R. Hammond

,

Proceedings of the High Performance Computing, 2017

A Pattern for Overlapping Communication and Computation with OpenMP ^* Target Directives.

[DOI]

,

,

,

Christian Terboven

,

Matthias S. Müller

Proceedings of the Scaling OpenMP for Exascale Performance and Portability, 2017

OpenMP ^* SIMD Vectorization and Threading of the Elmer Finite Element Software.

[DOI]

,

,

,

Proceedings of the Scaling OpenMP for Exascale Performance and Portability, 2017

Performance Optimization of OpenFOAM* on Clusters of Intel® Xeon Phi (TM) Processors.

[DOI]

,

,

,

,

Proceedings of the 24th IEEE International Conference on High Performance Computing Workshops, 2017

2016

Using the pyMIC Offload Module in PyFR.

[DOI]

,

Freddie D. Witherden

,

Peter E. Vincent

CoRR, 2016

Approaches for Task Affinity in OpenMP.

[DOI]

Christian Terboven

,

,

,

,

Alejandro Duran

,

,

Stephen L. Olivier

,

Bronis R. de Supinski

Proceedings of the OpenMP: Memory, Devices, and Tasks, 2016

Recent Processor Technologies and Co-Scheduling.

[DOI]

,

Christopher Dahnken

Proceedings of the Co-Scheduling of HPC Applications [extended versions of all papers from COSH@HiPEAC 2016, 2016

Portable SIMD Performance with OpenMP* 4.x Compiler Directives.

[DOI]

,

,

,

,

Chris J. Newburn

,

Georg Zitzlsberger

Proceedings of the Euro-Par 2016: Parallel Processing, 2016

2015

Performance Evaluation of OpenFOAM* with MPI-3 RMA Routines on Intel® Xeon® Processors and Intel® Xeon Phi™ Coprocessors.

[DOI]

Nishant Agrawal

,

,

,

,

,

Rihab Abdul Razak

Proceedings of the 22nd European MPI Users' Group Meeting, 2015

On the Algorithmic Aspects of Using OpenMP Synchronization Mechanisms II: User-Guided Speculative Locks.

[DOI]

Barna L. Bihari

,

,

,

,

Christian Terboven

,

Lori A. Diachin

Proceedings of the OpenMP: Heterogenous Execution and Data Movements, 2015

Packet-Oriented Streamline Tracing on Modern SIMD Architectures.

[DOI]

Bernd Hentschel

,

Jens Henrik Göbbert

,

,

,

,

Torsten W. Kuhlen

Proceedings of the 15th Eurographics Symposium on Parallel Graphics and Visualization, 2015

2014

Efficient Implementation of Many-Body Quantum Chemical Methods on the Intel® Xeon Phi Coprocessor.

[DOI]

,

,

Proceedings of the International Conference for High Performance Computing, 2014

A User-Guided Locking API for the OpenMP* Application Program Interface.

[DOI]

,

,

,

Christian Terboven

Proceedings of the Using and Improving OpenMP for Devices, Tasks, and More, 2014

2013

A Proposal for Task-Generating Loops in OpenMP.

[DOI]

,

,

,

Xavier Martorell

,

Stephen L. Olivier

,

Christian Terboven

Proceedings of the OpenMP in the Era of Low Power Devices and Accelerators, 2013

2012

From GPGPU to Many-Core: Nvidia Fermi and Intel Many Integrated Core Architecture.

[DOI]

Alexander Heinecke

,

,

Hans-Joachim Bungartz

Comput. Sci. Eng., 2012

OpenMP Programming on Intel Xeon Phi Coprocessors: An Early Performance Comparison.

,

,

,

Proceedings of the Many-core Applications Research Community (MARC) Symposium at RWTH Aachen University, 2012

Extending OpenMP* with Vector Constructs for Modern Multicore SIMD Architectures.

[DOI]

,

Alejandro Duran

,

,

,

Diego Caballero

,

Xavier Martorell

Proceedings of the OpenMP in a Heterogeneous World - 8th International Workshop on OpenMP, 2012

Performance of a Structure-Detecting SpMV Using the CSR Matrix Representation.

[DOI]

,

,

Proceedings of the 11th International Symposium on Parallel and Distributed Computing, 2012

The Intel® Many Integrated Core Architecture.

[DOI]

Alejandro Duran

,

Proceedings of the 2012 International Conference on High Performance Computing & Simulation, 2012

2011

Towards High-Performance Implementations of a Custom HPC Kernel Using ® Array Building Blocks.

[DOI]

Alexander Heinecke

,

,

,

Proceedings of the Facing the Multicore - Challenge II, 2011

Extending a Highly Parallel Data Mining Algorithm to the Intel ® Many Integrated Core Architecture.

[DOI]

Alexander Heinecke

,

,

,

,

Hans-Joachim Bungartz

Proceedings of the Euro-Par 2011: Parallel Processing Workshops - CCPI, CGWS, HeteroPar, HiBB, HPCVirt, HPPC, HPSS, MDGS, ProPer, Resilience, UCHPC, VHPC, Bordeaux, France, August 29, 2011

2010

Towards an Error Model for OpenMP.

[DOI]

,

,

Alejandro Duran

,

,

,

Bronis R. de Supinski

,

Andrey Churbanov

Proceedings of the Beyond Loop Level Parallelism in OpenMP: Accelerators, 2010

A Proposal for User-Defined Reductions in OpenMP.

[DOI]

Alejandro Duran

,

,

,

Bronis R. de Supinski

,

Eduard Ayguadé

Proceedings of the Beyond Loop Level Parallelism in OpenMP: Accelerators, 2010

JCudaMP: OpenMP/Java on CUDA.

[DOI]

,

,

Proceedings of the 3rd International Workshop on Multicore Software Engineering, 2010

2009

Reparallelization and migration of OpenMP applications in grid environments.

[DOI]

PhD thesis, 2009

Reparallelization techniques for migrating OpenMP codes in computational grids.

[DOI]

,

Matthias Bezold

,

,

,

Michael Philippsen

Concurr. Comput. Pract. Exp., 2009

A meta-predictor framework for prefetching in object-based DSMs.

[DOI]

Jean Christophe Beyler

,

,

Philippe Clauss

,

Michael Philippsen

Concurr. Comput. Pract. Exp., 2009

Dynamic code footprint optimization for the IBM Cell Broadband Engine.

[DOI]

,

Tobias Flossmann

,

,

,

,

Michael Philippsen

Proceedings of the 2009 ICSE Workshop on Multicore Software Engineering, 2009

2008

Automatic Prefetching with Binary Code Rewriting in Object-Based DSMs.

[DOI]

Jean Christophe Beyler

,

,

Michael Philippsen

,

Philippe Clauss

Proceedings of the Euro-Par 2008, 2008

2007

JaMP: an implementation of OpenMP for a Java DSM.

[DOI]

,

Matthias Bezold

,

,

Michael Philippsen

Concurr. Comput. Pract. Exp., 2007

Esodyp+: Prefetching in the Jackal Software DSM.

[DOI]

,

Jean Christophe Beyler

,

Ronny T. Lampert

,

Michael Philippsen

,

Philippe Clauss

Proceedings of the Euro-Par 2007, 2007

Reparallelization and Migration of OpenMP Programs.

[DOI]

,

Matthias Bezold

,

,

,

Michael Philippsen

Proceedings of the Seventh IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2007), 2007

2006

A Proposal for OpenMP for Java.

[DOI]

,

,

Matthias Bezold

,

Michael Philippsen

Proceedings of the OpenMP Shared Memory Parallel Programming - International Workshops, 2006

1991

Über die Schnittzahlen mehrfach balancierter blockpläne.

[DOI]

J. Comb. Theory A, 1991

1986

Über den <i>p</i>-rang von inzidenzmatrizen.

[DOI]

J. Comb. Theory A, 1986

1984

Über die Wurzelschranke für das Minimalgewicht von Codes.

[DOI]

J. Comb. Theory A, 1984

1979

A matrix of combinatorial numbers related to the symmetric groups.

[DOI]

,

Discret. Math., 1979

Loading...