Wolfgang Karl

Affiliations:
  • KIT


According to our database1, Wolfgang Karl authored at least 118 papers between 1990 and 2020.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2020
Memristoren für zukünftige Rechnersysteme.
Inform. Spektrum, 2020

2019
Evaluating Dynamic Task Scheduling in a Task-Based Runtime System for Heterogeneous Architectures.
Proceedings of the Architecture of Computing Systems - ARCS 2019, 2019

2018
A Transparent View on Approximate Computing Methods for Tuning Applications.
Proceedings of the High Performance Computing, 2018

2016
An Energy-Efficient Middleware for Computation Offloading in Real-Time Embedded Systems.
Proceedings of the 22nd IEEE International Conference on Embedded and Real-Time Computing Systems and Applications, 2016

FPGA-accelerated Richardson-Lucy deconvolution for 3D image data.
Proceedings of the 13th IEEE International Symposium on Biomedical Imaging, 2016

Reducing Energy Consumption of Data Transfers Using Runtime Data Type Conversion.
Proceedings of the Architecture of Computing Systems - ARCS 2016, 2016

2015
Combined hardware-software multi-parallel prefiltering on the Convey HC-1 for fast homology detection.
Parallel Comput., 2015

Automatic task mapping and heterogeneity-aware fault tolerance: The benefits for runtime optimization and application development.
J. Syst. Archit., 2015

Interdisciplinary Practical Course on Parallel Finite Element Method Using HiFlow ^3.
Proceedings of the Euro-Par 2015: Parallel Processing Workshops, 2015

2014
Evaluating the Self-Optimization Process of the Adaptive Memory Management Architecture Self-aware Memory.
CoRR, 2014

Heterogeneity-aware Fault Tolerance using a Self-Organizing Runtime System.
CoRR, 2014

An Architecture Framework for Porting Applications to FPGAs.
Proceedings of the ARCS 2014, 2014

Evaluation of Adaptive Memory Management Techniques on the Tilera TILE-Gx Platform.
Proceedings of the ARCS 2014, 2014

2013
Self-aware Memory: an adaptive memory management system for upcoming manycore architectures and its decentralized self-optimization process.
Des. Autom. Embed. Syst., 2013

Multi-parallel prefiltering on the convey HC-1 for supporting homology detection.
Proceedings of the 20th European MPI Users's Group Meeting, 2013

Evaluation of Two Formulations of the Conjugate Gradients Method with Transactional Memory.
Proceedings of the Euro-Par 2013 Parallel Processing, 2013

Topic 4: High-Performance Architectures and Compilers - (Introduction).
Proceedings of the Euro-Par 2013 Parallel Processing, 2013

A Data-Driven Approach for Executing the CG Method on Reconfigurable High-Performance Systems.
Proceedings of the Architecture of Computing Systems - ARCS 2013, 2013

2012
Seamlessly portable applications: Managing the diversity of modern heterogeneous systems.
ACM Trans. Archit. Code Optim., 2012

A survey on hardware-aware and heterogeneous computing on multicore processors and accelerators.
Concurr. Comput. Pract. Exp., 2012

Software Transactional Memory, OpenMP and Pthread Implementations of the Conjugate Gradients Method - A Preliminary Evaluation.
Proceedings of the High Performance Computing for Computational Science, 2012

What scientific applications can benefit from hardware transactional memory?
Proceedings of the SC Conference on High Performance Computing Networking, 2012

Realizing a Proactive, Self-Optimizing System Behavior within Adaptive, Heterogeneous Many-Core Architectures.
Proceedings of the Sixth IEEE International Conference on Self-Adaptive and Self-Organizing Systems, 2012

Capturing Transactional Memory Application's Behavior - The Prerequisite for Performance Analysis.
Proceedings of the Multicore Software Engineering, Performance, and Tools, 2012

A Low-Overhead Profiling and Visualization Framework for Hybrid Transactional Memory.
Proceedings of the 2012 IEEE 20th Annual International Symposium on Field-Programmable Custom Computing Machines, 2012

A Scalable Monitoring Infrastructure for Self-Organizing Many-Core Architectures.
Proceedings of the 15th Euromicro Conference on Digital System Design, 2012

2011
An Intuitive Framework for Accessing Computing Clouds.
Proceedings of the International Conference on Computational Science, 2011

Konrad Zuse.
Inform. Spektrum, 2011

Digital On-demand Computing Organism - Interaction between Monitoring and Middleware.
Proceedings of the 14th IEEE International Symposium on Object/Component/Service-Oriented Real-Time Distributed Computing, 2011

Cost-aware function migration in heterogeneous systems.
Proceedings of the High Performance Embedded Architectures and Compilers, 2011

Introduction.
Proceedings of the Euro-Par 2011 Parallel Processing - 17th International Conference, 2011

Economic learning for thermal-aware power budgeting in many-core architectures.
Proceedings of the 9th International Conference on Hardware/Software Codesign and System Synthesis, 2011

Compiler-Assisted Selection of a Software Transactional Memory System.
Proceedings of the Architecture of Computing Systems - ARCS 2011, 2011

A Light-Weight Approach for Online State Classification of Self-organizing Parallel Systems.
Proceedings of the Architecture of Computing Systems - ARCS 2011, 2011

Monitoring and Self-awareness for Heterogeneous, Adaptive Computing Systems.
Proceedings of the Organic Computing - A Paradigm Shift for Complex Systems, 2011

DodOrg - A Self-adaptive Organic Many-core Architecture.
Proceedings of the Organic Computing - A Paradigm Shift for Complex Systems, 2011

2010
From source code to runtime behaviour: Software metrics help to select the computer architecture.
Knowl. Based Syst., 2010

Cyberaide onServe: Software as a Service on Production Grids.
Proceedings of the 39th International Conference on Parallel Processing, 2010

Thread Creation for Self-aware Parallel Systems.
Proceedings of the Facing the Multicore-Challenge, 2010

Delivering Guidance Information in Heterogeneous Systems.
Proceedings of the ARCS '10, 2010

Extending a Light-weight Runtime System by Dynamic Instrumentation for Performance Evaluation.
Proceedings of the ARCS '10, 2010

2009
An Embrace-and-Extend Approach to Managing the Complexity of Future Heterogeneous Systems.
Proceedings of the Embedded Computer Systems: Architectures, 2009

Pervasive University - Anwendungsszenarien, technische Voraussetzungen und Perspektiven.
Proceedings of the 39. Jahrestagung der Gesellschaft für Informatik, Im Focus das Leben, INFORMATIK 2009, Lübeck, Germany, September 28, 2009

Introduction.
Proceedings of the Euro-Par 2009 Parallel Processing, 2009

Cyberaide Virtual Applicance: On-Demand Deploying Middleware for Cyberinfrastructure.
Proceedings of the Cloud Computing - First International Conference, 2009

A Light-Weight Approach to Dynamical Runtime Linking Supporting Heterogenous, Parallel, and Reconfigurable Architectures.
Proceedings of the Architecture of Computing Systems, 2009

A Seamless Virtualization Approach for Transparent Dynamical Function Mapping Targeting Heterogeneous and Reconfigurable Systems.
Proceedings of the Reconfigurable Computing: Architectures, 2009

2008
Design Aspects of Self-Organizing Heterogeneous Multi-Core Architectures (Entwurfsaspekte selbstorganisierender, heterogener Multicore-Architekturen).
it Inf. Technol., 2008

Performance Advantage of Reconfigurable Cache Design on Multicore Processor Systems.
Int. J. Parallel Program., 2008

Evaluating the Cache Architecture of Multicore Processors.
Proceedings of the 16th Euromicro International Conference on Parallel, 2008

An Organic Computing Approach to Sustained Real-time Monitoring.
Proceedings of the Biologically-Inspired Collaborative Computing, 2008

Guided Prefetching Based on Runtime Access Patterns.
Proceedings of the Computational Science, 2008

Scientific Cloud Computing: Early Definition and Experience.
Proceedings of the 10th IEEE International Conference on High Performance Computing and Communications, 2008

On-Demand Build a Virtual e-Science Workflow.
Proceedings of the Workshops at the Grid and Pervasive Computing Conference, 2008

A Generic Tool Supporting Cache Designs and Optimisation on Shared Memory Systems.
Proceedings of the 9th Workshop on Parallel Systems and Algorithms (PASA) held at the 21st Conference on the Architecture of Computing Systems (ARCS), 2008

Adaptive Cache Infrastructure: Supporting Dynamic Program Changes following Dynamic Program Behavior.
Proceedings of the 9th Workshop on Parallel Systems and Algorithms (PASA) held at the 21st Conference on the Architecture of Computing Systems (ARCS), 2008

Grid Virtualization Engine: Providing Virtual Resources for Grid Infrastructure.
Proceedings of the 9th Workshop on Parallel Systems and Algorithms (PASA) held at the 21st Conference on the Architecture of Computing Systems (ARCS), 2008

Self-aware Memory: Managing Distributed Memory in an Autonomous Multi-master Environment.
Proceedings of the Architecture of Computing Systems, 2008

2007
A Run-time Reconfigurable Cache Architecture.
Proceedings of the Parallel Computing: Architectures, 2007

CMP Cache Architecture and the OpenMP Performance.
Proceedings of the A Practical Programming Model for the Multi-Core Era, 2007

Optimizing Cache Performance of the Discrete Wavelet Transform Using a Visualization Tool.
Proceedings of the Ninth IEEE International Symposium on Multimedia, 2007

An Interactive Graphical Environment for Code Optimization.
Proceedings of the Computational Science - ICCS 2007, 7th International Conference, Beijing, China, May 27, 2007

A Profiling Tool for Detecting Cache-Critical Data Structures.
Proceedings of the Euro-Par 2007, 2007

2006
Detailed cache simulation for detecting bottleneck, miss reason and optimization potentialities.
Proceedings of the 1st International Conference on Performance Evaluation Methodolgies and Tools, 2006

Analysis of the Spatial and Temporal Locality in Data Accesses.
Proceedings of the Computational Science, 2006

Supporting Cache Locality Optimization with a Toolset.
Proceedings of the Euro-Par 2006, Parallel Processing, 12th International Euro-Par Conference, Dresden, Germany, August 28, 2006

Topic 7: Parallel Computer Architecture and Instruction Level Parallelism.
Proceedings of the Euro-Par 2006, Parallel Processing, 12th International Euro-Par Conference, Dresden, Germany, August 28, 2006

Automatic Data Locality Optimization Through Self-optimization.
Proceedings of the Self-Organizing Systems, First International Workshop, 2006

A network agent for diagnosis and analysis of real-time Ethernet networks.
Proceedings of the 2006 International Conference on Compilers, 2006

Performance Evaluation of Adaptive Caching Schemes.
Proceedings of the ARCS 2006, 2006

Digital On-Demand Computing Organism for Real-Time Systems.
Proceedings of the ARCS 2006, 2006

2005
Simulation as a tool for optimizing memory accesses on NUMA machines.
Perform. Evaluation, 2005

Rechnerarchitektur in Deutschland.
it Inf. Technol., 2005

Monitoring cache behavior on parallel SMP architectures and related programming tools.
Future Gener. Comput. Syst., 2005

Comprehensive Cache Inspection with Hardware Monitors.
Proceedings of the Parallel Computing Technologies, 2005

Implementing an OpenMP Execution Environment on InfiniBand Clusters.
Proceedings of the OpenMP Shared Memory Parallel Programming - International Workshops, 2005

Optimization-Oriented Visualization of Cache Access Behavior.
Proceedings of the Computational Science, 2005

CacheIn: A Toolset for Comprehensive Cache Inspection.
Proceedings of the Computational Science, 2005

YACO: A User Conducted Visualization Tool for Supporting Cache Optimization.
Proceedings of the High Performance Computing and Communications, 2005

2004
SIMT/OMP: A Toolset to Study and Exploit Memory Locality of OpenMP Applications on NUMA Architectures.
Proceedings of the Shared Memory Parallel Programming with OpenMP, 2004

Impact of Cache Coherence Models on Performance of OpenMP Applications.
Proceedings of the Euro-Par 2004 Parallel Processing, 2004

Topic 8: Parallel Computer Architecture and Instruction-Level Parallelism.
Proceedings of the Euro-Par 2004 Parallel Processing, 2004

On the Cache Access Behavior of OpenMP Applications.
Proceedings of the ARCS 2004, 2004

2003
ARS: an adaptive runtime system for locality optimization.
Future Gener. Comput. Syst., 2003

SMiLE: an integrated, multi-paradigm software infrastructure for SCI-basedclusters.
Future Gener. Comput. Syst., 2003

A Simulation Tool for Evaluating Shared Memory Systems.
Proceedings of the Proceedings 36th Annual Simulation Symposium (ANSS-36 2003), Orlando, Florida, USA, March 30, 2003

2002
Memory access behavior analysis of NUMA-based shared memory programs.
Sci. Program., 2002

A Comprehensive Electric Field Simulation Environment on Top of SCI.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 9th European PVM/MPI Users' Group Meeting, Linz, Austria, September 29, 2002

Boosting the Performance of Electromagnetic Simulations on a PC-Cluster.
Proceedings of the 2002 International Conference on Parallel Computing in Electrical Engineering (PARELEC 2002), 2002

A proposal for a new hardware cache monitoring architecture.
Proceedings of The Workshop on Memory Systems Performance (MSP 2002), 2002

Improving Data Locality Using Dynamic Page Migration Based on Memory Access Histograms.
Proceedings of the Computational Science - ICCS 2002, 2002

SMiLE: An Integrated, Multi-Paradigm Software Infrastructure for SCI-Based Clusters.
Proceedings of the 2nd IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2002), 2002

2001
Monitoring Concepts for Parallel Systems - An Evolution towards Interoperable Tool Environments.
Scalable Comput. Pract. Exp., 2001

OpenSESAME: An Intuitive Dependability Modeling Environment Supporting Inter-Component Dependencies.
Proceedings of the 8th Pacific Rim International Symposium on Dependable Computing (PRDC 2001), 2001

SCI-Based LINUX PC-Clusters as a Platform for Electromagnetic Field Calculations.
Proceedings of the Parallel Computing Technologies, 2001

Visualizing the Memory Access Behavior of Shared Memory Applications on NUMA Architectures.
Proceedings of the Computational Science - ICCS 2001, 2001

Meeting the Computational Demands of Nuclear Medical Imaging Using Commodity Clusters.
Proceedings of the Computational Science - ICCS 2001, 2001

2000
Cache-Aware Multigrid Methods for Solving Poisson's Equation in Two Dimensions.
Computing, 2000

Electrical phenomena during Hot Swap events.
Proceedings of the 2000 Pacific Rim International Symposium on Dependable Computing (PRDC 2000), 2000

Numerical Calculation of Electromagnetic Problems on an SCI Based PC-Cluster.
Proceedings of the 2000 International Conference on Parallel Computing in Electrical Engineering (PARELEC 2000), 2000

Using the SMiLE Monitoring Infrastructure to Detect and Lower the Inefficiency of Parallel Applications.
Proceedings of the High-Performance Computing and Networking, 8th International Conference, 2000

NEPHEW: Applying a Toolset for the Efficient Deployment of a Medical Image Application on SCI-Based Clusters.
Proceedings of the Euro-Par 2000, Parallel Processing, 6th International Euro-Par Conference, Munich, Germany, August 29, 2000

Multilayer Online-Monitoring for Hybrid DSM Systems on Top of PC Clusters with a SMiLE.
Proceedings of the Computer Performance Evaluation: Modelling Techniques and Tools, 2000

1999
SCI Monitoring Hardware and Software: Supporting Performance Evaluation and Debugging.
Proceedings of the SCI: Scalable Coherent Interface, 1999

The TUM/SCI Adapter.
Proceedings of the SCI: Scalable Coherent Interface, 1999

Memory Characteristics of Iterative Methods.
Proceedings of the ACM/IEEE Conference on Supercomputing, 1999

Parallel Computing on PC Clusters - An Alternative to Supercomputers for Industrial Applications.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 1999

Supporting Shared Memory and Message Passing on Clusters of PCs with a SMiLE.
Proceedings of the Network-Based Parallel Computing: Communication, 1999

Optimizing Data Locality for SCI-Based PC-Clusters with the SmiLE Monitoring Approach.
Proceedings of the 1999 International Conference on Parallel Architectures and Compilation Techniques, 1999

1998
JVX - A Rapid Prototyping System Based on Java and FPGAs.
Proceedings of the Field-Programmable Logic and Applications, 1998

PCI-SCI Protocol Translations: Applying Microprogramming Concepts to FPGAs.
Proceedings of the Field-Programmable Logic and Applications, 1998

Exploiting Spatial and Temporal Locality of Accesses: A New Hardware-Based Monitoring Approach for DSM Systems.
Proceedings of the Euro-Par '98 Parallel Processing, 1998

1997
Fast Communication Mechanisms - Coupling Hardware Distributed Shared Memory and User-Level Messaging.
Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 1997

Sicherheit und Effizienz in einer Active-Message-Kommunikationsschicht.
Proceedings of the Architektur von Rechensystemen, Arbeitsteilige Systemarchitekturen: Konzepte, Lösungen, Anwendungen, Trends, 1997

1995
Architektur und Technologie von Mikroprozessoren.
Informationstechnik Tech. Inform., 1995

1993
Some Design Aspects for VLIW Architectures Exploiting Fine - Grained Parallelism.
Proceedings of the PARLE '93, 1993

1992
Architektureigenschaften und Parallelisierungsmethoden für Rechner mit Funktionspipelining.
PhD thesis, 1992

1990
Evaluierung von Architekturparametern verschiedener Rechnerstrukturen mit Hilfe von CAE-Workstations.
Proceedings of the Architektur von Rechensystemen, 1990


  Loading...