We stand with Ukraine

We stand with Ukraine

Frank Hannig

Orcid: 0000-0003-3663-6484

Affiliations:

University of Erlangen-Nuremberg, Germany

According to our database¹, Frank Hannig authored at least 197 papers between 2001 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

Online presence:

On csauthors.net:

Bibliography

2024

Hardware-Aware Evolutionary Explainable Filter Pruning for Convolutional Neural Networks.

[BibT_eX]

[DOI]

Christian Heidorn

,

,

Nicolai Meyerhöfer

,

Christian Schinabeck

,

,

Int. J. Parallel Program., April, 2024

SNN vs. CNN Implementations on FPGAs: An Empirical Evaluation.

[BibT_eX]

[DOI]

Patrick Plagwitz

,

,

,

Oliver Keszöcze

Proceedings of the Applied Reconfigurable Computing. Architectures, Tools, and Applications, 2024

2023

To Spike or Not to Spike? A Quantitative Comparison of SNN and CNN FPGA Implementations.

[BibT_eX]

[DOI]

Patrick Plagwitz

,

,

,

Oliver Keszöcze

CoRR, 2023

Robust and Tiny Binary Neural Networks using Gradient-based Explainability Methods.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 3rd Workshop on Machine Learning and Systems, 2023

2022

Special Issue on Applied Reconfigurable Computing.

[BibT_eX]

[DOI]

,

J. Signal Process. Syst., 2022

Precision- and Accuracy-Reconfigurable Processor Architectures - An Overview.

[BibT_eX]

[DOI]

,

,

Oliver Keszöcze

,

IEEE Trans. Circuits Syst. II Express Briefs, 2022

The HighPerMeshes framework for numerical algorithms on unstructured grids.

[BibT_eX]

[DOI]

,

,

,

Daniel Grünewald

,

,

,

,

Franz-Josef Pfreundt

,

Christian Plessl

,

Merlind Schotte

,

,

,

,

Concurr. Comput. Pract. Exp., 2022

Hardware-Aware Evolutionary Filter Pruning.

[BibT_eX]

[DOI]

Christian Heidorn

,

Nicolai Meyerhöfer

,

Christian Schinabeck

,

,

Proceedings of the Embedded Computer Systems: Architectures, Modeling, and Simulation, 2022

MOSP: Multi-Objective Sensitivity Pruning of Deep Neural Networks.

[BibT_eX]

[DOI]

,

Ashutosh Mishra

,

,

Proceedings of the 13th IEEE International Green and Sustainable Computing Conference, 2022

TRAC: Compilation-Based Design of Transformer Accelerators for FPGAs.

[BibT_eX]

[DOI]

Patrick Plagwitz

,

,

Proceedings of the 32nd International Conference on Field-Programmable Logic and Applications, 2022

DyFiP: explainable AI-based dynamic filter pruning of convolutional neural networks.

[BibT_eX]

[DOI]

,

,

Proceedings of the EuroMLSys '22: Proceedings of the 2nd European Workshop on Machine Learning and Systems, Rennes, France, April 5, 2022

2021

Introduction to the Special Issue on Application-Specific Systems, Architectures and Processors.

[BibT_eX]

[DOI]

,

J. Signal Process. Syst., 2021

Symbolic Loop Compilation for Tightly Coupled Processor Arrays.

[BibT_eX]

[DOI]

Michael Witterauf

,

,

,

ACM Trans. Embed. Comput. Syst., 2021

HipaccVX: wedding of OpenVX and DSL-based code generation.

[BibT_eX]

[DOI]

,

,

,

,

J. Real Time Image Process., 2021

Proceedings of the DATE Friday Workshop on System-level Design Methods for Deep Learning on Heterogeneous Architectures (SLOHA 2021).

[BibT_eX]

[DOI]

,

,

Matteo Spallanzani

,

Matthias Ziegler

CoRR, 2021

*-Predictable MPSoC execution of real-time control applications using invasive computing.

[BibT_eX]

[DOI]

,

Michael Witterauf

,

,

Alexandru Tanase

,

,

Concurr. Comput. Pract. Exp., 2021

Open Source Hardware.

[BibT_eX]

[DOI]

,

Computer, 2021

Efficient Application of Tensor Core Units for Convolving Images.

[BibT_eX]

[DOI]

,

,

Proceedings of the SCOPES '21: 24th International Workshop on Software and Compilers for Embedded Systems, Eindhoven, The Netherlands, November 1, 2021

An Efficient Approach for Image Border Handling on GPUs via Iteration Space Partitioning.

[BibT_eX]

[DOI]

,

,

Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2021

Hand Sign Recognition via Deep Learning on Tightly Coupled Processor Arrays.

[BibT_eX]

[DOI]

Christian Heidorn

,

,

Yunus Emre Candir

,

,

Proceedings of the 31st International Conference on Field-Programmable Logic and Applications, 2021

A Safari through FPGA-based Neural Network Compilation and Design Automation Flows.

[BibT_eX]

[DOI]

Patrick Plagwitz

,

,

Martin Ströbel

,

Christoph Strohmeyer

,

Proceedings of the 29th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2021

Fault-Tolerant Low-Precision DNNs using Explainable AI.

[BibT_eX]

[DOI]

,

,

Proceedings of the 51st Annual IEEE/IFIP International Conference on Dependable Systems and Networks Workshops, 2021

2020

ExaStencils: Advanced Multigrid Solver Generation.

[BibT_eX]

[DOI]

Christian Lengauer

,

,

Matthias Bolten

,

,

,

,

Armin Größlinger

,

,

Harald Köstler

,

,

Alexander Grebhahn

,

,

Stefan Kronawitter

,

Sebastian Kuckuk

,

,

Christian Schmitt

,

Proceedings of the Software for Exascale Computing - SPPEXA 2016-2019, 2020

AnyHLS: High-Level Synthesis With Partial Evaluation.

[BibT_eX]

[DOI]

,

Arsène Pérard-Gayot

,

Richard Membarth

,

Philipp Slusallek

,

,

,

,

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2020

Utilizing Explainable AI for Quantization and Pruning of Deep Neural Networks.

[BibT_eX]

[DOI]

,

,

CoRR, 2020

A bibliometric approach for detecting the gender gap in computer science.

[BibT_eX]

[DOI]

Sandra Mattauch

,

,

,

,

Commun. ACM, 2020

Hardware Implementation of Hyperbolic Tangent Activation Function for Floating Point Formats.

[BibT_eX]

[DOI]

Arvind Thumatti K. R.

,

,

Christian Heidorn

,

,

,

Proceedings of the 2020 24th International Symposium on VLSI Design and Test (VDAT), 2020

Efficient parallel reduction on GPUs with Hipacc.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the SCOPES '20: 23rd International Workshop on Software and Compilers for Embedded Systems, 2020

Design space exploration for layer-parallel execution of convolutional neural networks on CGRAs.

[BibT_eX]

[DOI]

Christian Heidorn

,

,

Proceedings of the SCOPES '20: 23rd International Workshop on Software and Compilers for Embedded Systems, 2020

Unveiling kernel concurrency in multiresolution filters on GPUs with an image processing DSL.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the GPGPU@PPoPP '20: 13th Annual Workshop on General Purpose Processing using Graphics Processing Unit colocated with 25th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2020

HighPerMeshes - A Domain-Specific Language for Numerical Algorithms on Unstructured Grids.

[BibT_eX]

[DOI]

,

,

,

Daniel Grünewald

,

,

,

,

Franz-Josef Pfreundt

,

Christian Plessl

,

Merlind Schotte

,

,

,

,

Proceedings of the Euro-Par 2020: Parallel Processing Workshops, 2020

The Best of Both Worlds: Combining CUDA Graph with an Image Processing DSL.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 57th ACM/IEEE Design Automation Conference, 2020

A runtime system for finite element methods in a partitioned global address space.

[BibT_eX]

[DOI]

,

Daniel Grünewald

,

,

Proceedings of the 17th ACM International Conference on Computing Frontiers, 2020

Message from the Conference Chairs - ASAP 2020.

[BibT_eX]

[DOI]

,

,

Javier Navaridas

Proceedings of the 31st IEEE International Conference on Application-specific Systems, 2020

2019

Efficient Mapping of Streaming Applications for Image Processing on Graphics Cards.

[BibT_eX]

[DOI]

Richard Membarth

,

,

,

Trans. High Perform. Embed. Archit. Compil., 2019

Efficient Mapping of CNNs onto Tightly Coupled Processor Arrays.

[BibT_eX]

[DOI]

Christian Heidorn

,

Michael Witterauf

,

,

J. Comput., 2019

SYCL Code Generation for Multigrid Methods.

[BibT_eX]

[DOI]

,

Christian Schmitt

,

,

Proceedings of the 22nd International Workshop on Software and Compilers for Embedded Systems, 2019

Polyhedral fragments: an efficient representation for symbolically generating code for processor arrays.

[BibT_eX]

[DOI]

Michael Witterauf

,

,

Proceedings of the 17th ACM-IEEE International Conference on Formal Methods and Models for System Design, 2019

From Loop Fusion to Kernel Fusion: A Domain-Specific Approach to Locality Optimization.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the IEEE/ACM International Symposium on Code Generation and Optimization, 2019

Anytime instructions for programmable accuracy floating-point arithmetic.

[BibT_eX]

[DOI]

,

Michael Witterauf

,

,

Proceedings of the 16th ACM International Conference on Computing Frontiers, 2019

DSL-Based Acceleration of Automotive Environment Perception and Mapping Algorithms for Embedded CPUs, GPUs, and FPGAs.

[BibT_eX]

[DOI]

Jörg Fickenscher

,

,

Proceedings of the Architecture of Computing Systems - ARCS 2019, 2019

Modeling and Simulation of Invasive Applications and Architectures.

[BibT_eX]

[DOI]

,

,

Computer Architecture and Design Methodologies, Springer, ISBN: 978-981-13-8387-8, 2019

2018

Loop Parallelization Techniques for FPGA Accelerator Synthesis.

[BibT_eX]

[DOI]

,

,

,

,

J. Signal Process. Syst., 2018

Symbolic Multi-Level Loop Mapping of Loop Programs for Massively Parallel Processor Arrays.

[BibT_eX]

[DOI]

Alexandru Tanase

,

Michael Witterauf

,

,

ACM Trans. Embed. Comput. Syst., 2018

Reconfigurable Hardware Generation of Multigrid Solvers with Conjugate Gradient Coarse-Grid Solution.

[BibT_eX]

[DOI]

Christian Schmitt

,

,

Sebastian Kuckuk

,

Harald Köstler

,

,

Parallel Process. Lett., 2018

Automating the Development of High-Performance Multigrid Solvers.

[BibT_eX]

[DOI]

Christian Schmitt

,

Stefan Kronawitter

,

,

,

Christian Lengauer

Proc. IEEE, 2018

Special issue on heterogeneous real-time image processing.

[BibT_eX]

[DOI]

,

J. Real Time Image Process., 2018

Base Algorithms of Environment Maps and Efficient Occupancy Grid Mapping on Embedded GPUs.

[BibT_eX]

[DOI]

Jörg Fickenscher

,

,

,

Mohamed Essayed Bouzouraa

Proceedings of the 4th International Conference on Vehicle Technology and Intelligent Transport Systems, 2018

Automatic Kernel Fusion for Image Processing DSLs.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 21st International Workshop on Software and Compilers for Embedded Systems, 2018

OpenCL-Based FPGA Design to Accelerate the Nodal Discontinuous Galerkin Method for Unstructured Meshes.

[BibT_eX]

[DOI]

,

Gopinath Mahale

,

,

,

Christian Schmitt

,

,

,

,

Christian Plessl

Proceedings of the 26th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2018

Cell-based update algorithm for occupancy grid maps and hybrid map for ADAS on embedded GPUs.

[BibT_eX]

[DOI]

Jörg Fickenscher

,

Jens Schlumberger

,

,

,

Mohamed Essayed Bouzouraa

Proceedings of the 2018 Design, Automation & Test in Europe Conference & Exhibition, 2018

Invasive Computing for Predictability of Multiple Non-functional Properties: A Cyber-Physical System Case Study.

[BibT_eX]

[DOI]

,

Michael Witterauf

,

,

Alexandru Tanase

,

,

Proceedings of the 29th IEEE International Conference on Application-specific Systems, 2018

Solving Maxwell's Equations with Modern C++ and SYCL: A Case Study.

[BibT_eX]

[DOI]

,

Christian Schmitt

,

,

,

,

,

Proceedings of the 29th IEEE International Conference on Application-specific Systems, 2018

Symbolic Parallelization of Nested Loop Programs

[BibT_eX]

[DOI]

Alexandru-Petru Tanase

,

,

Springer, ISBN: 978-3-319-73909-0, 2018

2017

A Novel Image Impulse Noise Removal Algorithm Optimized for Hardware Accelerators.

[BibT_eX]

[DOI]

Vivek Singh Bhadouria

,

Alexandru Tanase

,

,

,

,

Dibyendu Ghoshal

J. Signal Process. Syst., 2017

Trends in Data Locality Abstractions for HPC Systems.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2017

Power Density-Aware Resource Management for Heterogeneous Tiled Multicores.

[BibT_eX]

[DOI]

,

Santiago Pagani

,

,

,

,

,

Muhammad Shafique

,

,

IEEE Trans. Computers, 2017

Introduction to the special issue on architecture of computing systems.

[BibT_eX]

[DOI]

,

João M. P. Cardoso

,

J. Syst. Archit., 2017

A Scala prototype to generate multigrid solver implementations for different problems and target multi-core platforms.

[BibT_eX]

[DOI]

Harald Köstler

,

Christian Schmitt

,

Sebastian Kuckuk

,

Stefan Kronawitter

,

,

,

,

Christian Lengauer

Int. J. Comput. Sci. Eng., 2017

Constructing fast and cycle-accurate simulators for configurable accelerators using C++ templates.

[BibT_eX]

[DOI]

Michael Witterauf

,

,

Proceedings of the International Symposium on Rapid System Prototyping, 2017

A reconfigurable memory architecture for system integration of coarse-grained reconfigurable arrays.

[BibT_eX]

[DOI]

,

Alexandru Tanase

,

,

Proceedings of the International Conference on ReConFigurable Computing and FPGAs, 2017

TCPA editor: A design automation environment for a class of coarse-grained reconfigurable arrays.

[BibT_eX]

[DOI]

,

Arindam Chakraborty

,

Alexandru Tanase

,

,

Proceedings of the International Conference on ReConFigurable Computing and FPGAs, 2017

Orthogonal Instruction Processing: An Alternative to Lightweight VLIW Processors.

[BibT_eX]

[DOI]

,

,

Alexandru Tanase

,

Proceedings of the 11th IEEE International Symposium on Embedded Multicore/Many-core Systems-on-Chip, 2017

Auto-vectorization for image processing DSLs.

[BibT_eX]

[DOI]

,

Christof Kobylko

,

,

Proceedings of the 18th ACM SIGPLAN/SIGBED Conference on Languages, 2017

Convoy tracking for ADAS on embedded GPUs.

[BibT_eX]

[DOI]

Jörg Fickenscher

,

Sebastian Reinhart

,

,

,

Mohamed Essayed Bouzouraa

Proceedings of the IEEE Intelligent Vehicles Symposium, 2017

Generating FPGA-based image processing accelerators with Hipacc: (Invited paper).

[BibT_eX]

[DOI]

,

,

Richard Membarth

,

,

Proceedings of the 2017 IEEE/ACM International Conference on Computer-Aided Design, 2017

High performance network-on-chip simulation by interval-based timing predictions.

[BibT_eX]

[DOI]

,

,

Proceedings of the 15th IEEE/ACM Symposium on Embedded Systems for Real-Time Multimedia, 2017

Hardware design and analysis of efficient loop coarsening and border handling for image processing.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 28th IEEE International Conference on Application-specific Systems, 2017

Efficiency in ILP processing by using orthogonality.

[BibT_eX]

[DOI]

,

,

Alexandru Tanase

,

Proceedings of the 28th IEEE International Conference on Application-specific Systems, 2017

2016

Systems of Partial Differential Equations in ExaSlang.

[BibT_eX]

[DOI]

Christian Schmitt

,

Sebastian Kuckuk

,

,

,

Harald Köstler

,

,

Christian Lengauer

Proceedings of the Software for Exascale Computing - SPPEXA 2013-2015, 2016

HIPA<sup>cc</sup>: A Domain-Specific Language and Compiler for Image Processing.

[BibT_eX]

[DOI]

Richard Membarth

,

,

,

,

,

IEEE Trans. Parallel Distributed Syst., 2016

Dark silicon management: an integrated and coordinated cross-layer approach.

[BibT_eX]

[DOI]

Santiago Pagani

,

,

,

Elisabeth Glocker

,

,

Andreas Herkersdorf

,

,

,

Ulf Schlichtmann

,

Doris Schmitt-Landsiedel

,

,

,

,

,

,

it Inf. Technol., 2016

Hybrid code description for developing fast and resource efficient image processing architectures.

[BibT_eX]

[DOI]

Konrad Häublein

,

Marc Reichenbach

,

,

,

,

,

Proceedings of the International Conference on Embedded Computer Systems: Architectures, 2016

ActorX10: an actor library for X10.

[BibT_eX]

[DOI]

,

Alexander Pöppl

,

Tobias Schwarzer

,

Stefan Wildermann

,

,

,

,

Proceedings of the 6th ACM SIGPLAN Workshop on X10, 2016

Modeling, programming and performance analysis of automotive environment map representations on embedded GPUs.

[BibT_eX]

[DOI]

Jörg Fickenscher

,

,

Jens Schlumberger

,

,

Proceedings of the IEEE International High Level Design Validation and Test Workshop, 2016

FPGA-based accelerator design from a domain-specific language.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 26th International Conference on Field Programmable Logic and Applications, 2016

Modulo scheduling of symbolically tiled loops for tightly coupled processor arrays.

[BibT_eX]

[DOI]

Michael Witterauf

,

Alexandru Tanase

,

,

Proceedings of the 27th IEEE International Conference on Application-specific Systems, 2016

Big Data and HPC Acceleration with Vivado HLS.

[BibT_eX]

[DOI]

,

Christian Schmitt

,

,

Gorker Alp Malazgirt

,

,

,

Adrián Cristal

Proceedings of the FPGAs for Software Programmers, 2016

HIPA<sup>cc</sup>.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the FPGAs for Software Programmers, 2016

FPGA Versus Software Programming: Why, When, and How?

[BibT_eX]

[DOI]

,

,

Proceedings of the FPGAs for Software Programmers, 2016

A Quick Tour of High-Level Synthesis Solutions for FPGAs.

[BibT_eX]

[DOI]

Proceedings of the FPGAs for Software Programmers, 2016

2015

Synthesis and optimization of image processing accelerators using domain knowledge.

[BibT_eX]

[DOI]

,

Konrad Häublein

,

Marc Reichenbach

,

,

,

,

J. Syst. Archit., 2015

Resource-awareness on heterogeneous MPSoCs for image processing.

[BibT_eX]

[DOI]

,

Walter Stechele

,

Benjamin Oechslein

,

Christoph Erhardt

,

,

,

Wolfgang Schröder-Preikschat

,

Manfred Kröhnert

,

,

,

,

,

,

Artjom Grudnitsky

,

,

J. Syst. Archit., 2015

Introduction to the Special Issue on Testing, prototyping, and debugging of multi-core architectures.

[BibT_eX]

[DOI]

,

Andreas Herkersdorf

J. Syst. Archit., 2015

Automatic Optimization of Hardware Accelerators for Image Processing.

[BibT_eX]

[DOI]

,

Konrad Häublein

,

Marc Reichenbach

,

,

,

CoRR, 2015

Proceedings of the Second International Workshop on FPGAs for Software Programmers (FSP 2015).

[BibT_eX]

[DOI]

,

,

CoRR, 2015

Proceedings of the DATE Friday Workshop on Heterogeneous Architectures and Design Methods for Embedded Image Systems (HIS 2015).

[BibT_eX]

[DOI]

,

,

Anton Lokhmotov

CoRR, 2015

Runtime Adaptation of Application Execution under Thermal and Power Constraints in Massively Parallel Processor Arrays.

[BibT_eX]

[DOI]

,

,

,

,

Ulf Schlichtmann

Proceedings of the 18th International Workshop on Software and Compilers for Embedded Systems, 2015

Symbolic loop parallelization for balancing I/O and memory accesses on processor arrays.

[BibT_eX]

[DOI]

Alexandru Tanase

,

Michael Witterauf

,

,

Proceedings of the 13. ACM/IEEE International Conference on Formal Methods and Models for Codesign, 2015

Reconfigurable Buffer Structures for Coarse-Grained Reconfigurable Arrays.

[BibT_eX]

[DOI]

,

,

Proceedings of the System Level Design from HW/SW to Memory for Embedded Systems, 2015

Invasive computing for predictable stream processing: a simulation-based case study.

[BibT_eX]

[DOI]

,

Stefan Wildermann

,

,

Proceedings of the 13th IEEE Symposium on Embedded Systems For Real-time Multimedia, 2015

Execution-driven parallel simulation of PGAS applications on heterogeneous tiled architectures.

[BibT_eX]

[DOI]

,

David Schafhauser

,

,

Proceedings of the 52nd Annual Design Automation Conference, 2015

On-demand fault-tolerant loop processing on massively parallel processor arrays.

[BibT_eX]

[DOI]

Alexandru Tanase

,

Michael Witterauf

,

,

,

Proceedings of the 26th IEEE International Conference on Application-specific Systems, 2015

Loop coarsening in C-based High-Level Synthesis.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 26th IEEE International Conference on Application-specific Systems, 2015

A co-design approach for fault-tolerant loop execution on Coarse-Grained Reconfigurable Arrays.

[BibT_eX]

[DOI]

,

Alexandru Tanase

,

,

Michael Witterauf

,

Faramarz Khosravi

,

,

Proceedings of the 2015 NASA/ESA Conference on Adaptive Hardware and Systems, 2015

2014

Symbolic Mapping of Loop Programs onto Processor Arrays.

[BibT_eX]

[DOI]

,

Alexandru Tanase

,

J. Signal Process. Syst., 2014

Compact Code Generation for Tightly-Coupled Processor Arrays.

[BibT_eX]

[DOI]

,

,

J. Signal Process. Syst., 2014

Invasive Tightly-Coupled Processor Arrays: A Domain-Specific Architecture/Compiler Co-Design Approach.

[BibT_eX]

[DOI]

,

,

,

Alexandru Tanase

,

ACM Trans. Embed. Comput. Syst., 2014

Experiments on Optimizing the Performance of Stencil Codes with SPL Conqueror.

[BibT_eX]

[DOI]

Alexander Grebhahn

,

Sebastian Kuckuk

,

Christian Schmitt

,

Harald Köstler

,

Norbert Siegmund

,

,

,

Parallel Process. Lett., 2014

Towards a performance-portable description of geometric multigrid algorithms using a domain-specific language.

[BibT_eX]

[DOI]

Richard Membarth

,

,

Christian Schmitt

,

,

,

Markus Stürmer

,

Harald Köstler

J. Parallel Distributed Comput., 2014

Code Generation for High-Level Synthesis of Multiresolution Applications on FPGAs.

[BibT_eX]

[DOI]

,

,

Christian Schmitt

,

,

CoRR, 2014

Massively Parallel Processor Architectures for Resource-aware Computing.

[BibT_eX]

[DOI]

,

Alexandru Tanase

,

,

CoRR, 2014

A Scala Prototype to Generate Multigrid Solver Implementations for Different Problems and Target Multi-Core Platforms.

[BibT_eX]

[DOI]

Harald Köstler

,

Christian Schmitt

,

Sebastian Kuckuk

,

,

,

CoRR, 2014

Proceedings of the First Workshop on Resource Awareness and Adaptivity in Multi-Core Computing (Racing 2014).

[BibT_eX]

[DOI]

,

CoRR, 2014

Proceedings of the First International Workshop on FPGAs for Software Programmers (FSP 2014).

[BibT_eX]

[DOI]

,

,

CoRR, 2014

ExaSlang: a domain-specific language for highly scalable multigrid solvers.

[BibT_eX]

[DOI]

Christian Schmitt

,

Sebastian Kuckuk

,

,

Harald Köstler

,

Proceedings of the Fourth International Workshop on Domain-Specific Languages and High-Level Frameworks for High Performance Computing, 2014

Symbolic inner loop parallelisation for massively parallel processor arrays.

[BibT_eX]

[DOI]

Alexandru Tanase

,

Michael Witterauf

,

,

Proceedings of the Twelfth ACM/IEEE International Conference on Formal Methods and Models for Codesign, 2014

An Evaluation of Domain-Specific Language Technologies for Code Generation.

[BibT_eX]

[DOI]

Christian Schmitt

,

Sebastian Kuckuk

,

Harald Köstler

,

,

Proceedings of the 2014 14th International Conference on Computational Science and Its Applications, Guimaraes, Portugal, June 30, 2014

An image processing library for C-based high-level synthesis.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 24th International Conference on Field Programmable Logic and Applications, 2014

ExaStencils: Advanced Stencil-Code Engineering.

[BibT_eX]

[DOI]

Christian Lengauer

,

,

Matthias Bolten

,

Armin Größlinger

,

,

Harald Köstler

,

,

,

Alexander Grebhahn

,

Stefan Kronawitter

,

Sebastian Kuckuk

,

,

Christian Schmitt

Proceedings of the Euro-Par 2014: Parallel Processing Workshops, 2014

Runtime Reconfigurable Bus Arbitration for Concurrent Applications on Heterogeneous MPSoC Architectures.

[BibT_eX]

[DOI]

,

Deepak Gangadharan

,

,

Proceedings of the 17th Euromicro Conference on Digital System Design, 2014

Code generation for embedded heterogeneous architectures on android.

[BibT_eX]

[DOI]

Richard Membarth

,

,

,

Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2014

Self-adaptive harris corner detector on heterogeneous many-core processor.

[BibT_eX]

[DOI]

,

Walter Stechele

,

,

,

,

,

Manfred Kröhnert

,

Proceedings of the 2014 Conference on Design and Architectures for Signal and Image Processing, 2014

Code generation from a domain-specific language for C-based HLS of hardware accelerators.

[BibT_eX]

[DOI]

,

,

,

Richard Membarth

,

Proceedings of the 2014 International Conference on Hardware/Software Codesign and System Synthesis, 2014

Domain-specific augmentations for High-Level Synthesis.

[BibT_eX]

[DOI]

,

Alexandru Tanase

,

,

,

Vivek Singh Bhadouria

,

Dibyendu Ghoshal

Proceedings of the IEEE 25th International Conference on Application-Specific Systems, 2014

Towards Actor-oriented Programming on PGAS-based Multicore Architectures.

[BibT_eX]

[DOI]

,

,

Proceedings of the ARCS 2014, 2014

Application-driven reconfiguration of shared resources for timing predictability of MPSoC platforms.

[BibT_eX]

[DOI]

Deepak Gangadharan

,

,

,

,

Proceedings of the 48th Asilomar Conference on Signals, Systems and Computers, 2014

2013

NoC simulation in heterogeneous architectures for PGAS programming model.

[BibT_eX]

[DOI]

,

Andreas Weichslgartner

,

,

,

Proceedings of the International Workshop on Software and Compilers for Embedded Systems, 2013

Real-timerange image preprocessing on FPGAs.

[BibT_eX]

[DOI]

,

Markus Blocherer

,

,

Proceedings of the 2012 International Conference on Reconfigurable Computing and FPGAs, 2013

Exploitation of Quality/Throughput Tradeoffs in Image Processing through Invasive Computing.

[BibT_eX]

[DOI]

Alexandru Tanase

,

,

,

Proceedings of the Parallel Computing: Accelerating Computational Science and Engineering (CSE), 2013

High-Level Synthesis Revised - Generation of FPGA Accelerators from a Domain-Specific Language using the Polyhedron Model.

[BibT_eX]

[DOI]

,

,

Alexandru Tanase

,

Proceedings of the Parallel Computing: Accelerating Computational Science and Engineering (CSE), 2013

A prototype of an adaptive computer vision algorithm on MPSoC architecture.

[BibT_eX]

[DOI]

Éricles Rodrigues Sousa

,

Alexandru Tanase

,

,

Proceedings of the 2013 Conference on Design and Architectures for Signal and Image Processing, 2013

Accuracy and performance analysis of Harris Corner computation on tightly-coupled processor arrays.

[BibT_eX]

[DOI]

Éricles Rodrigues Sousa

,

Alexandru Tanase

,

,

Proceedings of the 2013 Conference on Design and Architectures for Signal and Image Processing, 2013

System integration of tightly-coupled processor arrays using reconfigurable buffer structures.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Computing Frontiers Conference, 2013

Symbolic parallelization of loop programs for massively parallel processor arrays.

[BibT_eX]

[DOI]

,

Alexandru Tanase

,

Proceedings of the 24th International Conference on Application-Specific Systems, 2013

Loop program mapping and compact code generation for programmable hardware accelerators.

[BibT_eX]

[DOI]

,

,

Proceedings of the 24th International Conference on Application-Specific Systems, 2013

2012

Hierarchical power management for adaptive tightly-coupled processor arrays.

[BibT_eX]

[DOI]

,

Shravan Muddasani

,

,

,

,

ACM Trans. Design Autom. Electr. Syst., 2012

Fast architecture evaluation of heterogeneous MPSoCs by host-compiled simulation.

[BibT_eX]

[DOI]

,

,

Proceedings of the Workshop on Software and Compilers for Embedded Systems, 2012

Towards Domain-Specific Computing for Stencil Codes in HPC.

[BibT_eX]

[DOI]

Richard Membarth

,

,

,

Harald Köstler

Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Automatic Optimization of In-Flight Memory Transactions for GPU Accelerators Based on a Domain-Specific Language for Medical Imaging.

[BibT_eX]

[DOI]

Richard Membarth

,

,

,

,

Proceedings of the 11th International Symposium on Parallel and Distributed Computing, 2012

Generating Device-specific GPU Code for Local Operators in Medical Imaging.

[BibT_eX]

[DOI]

Richard Membarth

,

,

,

,

Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012

An integrated simulation framework for invasive computing.

[BibT_eX]

[DOI]

,

,

Andreas Herkersdorf

,

Andreas Hollmann

,

,

,

Josef Weidendorfer

,

,

Proceedings of the Proceeding of the 2012 Forum on Specification and Design Languages, 2012

Power Management Strategies for Serial RapidIO Endpoints in FPGAs.

[BibT_eX]

[DOI]

,

,

Proceedings of the 2012 IEEE 20th Annual International Symposium on Field-Programmable Custom Computing Machines, 2012

Mastering Software Variant Explosion for GPU Accelerators.

[BibT_eX]

[DOI]

Richard Membarth

,

,

,

,

Proceedings of the Euro-Par 2012: Parallel Processing Workshops, 2012

A prototype of an invasive tightly-coupled processor array.

[BibT_eX]

[DOI]

Shravan Muddasani

,

,

,

,

,

Proceedings of the 2012 Conference on Design and Architectures for Signal and Image Processing, 2012

Approximate time functional simulation of resource-aware programming concepts for heterogeneous MPSoCs.

[BibT_eX]

[DOI]

,

,

Proceedings of the 17th Asia and South Pacific Design Automation Conference, 2012

Design of Low Power On-chip Processor Arrays.

[BibT_eX]

[DOI]

,

Shravan Muddasani

,

,

,

Proceedings of the 23rd IEEE International Conference on Application-Specific Systems, 2012

Dynamic Task-Scheduling and Resource Management for GPU Accelerators in Medical Imaging.

[BibT_eX]

[DOI]

Richard Membarth

,

,

,

,

,

Proceedings of the Architecture of Computing Systems - ARCS 2012 - 25th International Conference, Munich, Germany, February 28, 2012

2011

Efficient Evaluation of Power/Area/Latency Design Trade-Offs for Coarse-Grained Reconfigurable Processor Arrays.

[BibT_eX]

[DOI]

Dmitrij Kissler

,

,

J. Low Power Electron., 2011

Scalable Many-Domain Power Gating in Coarse-Grained Reconfigurable Processor Arrays.

[BibT_eX]

[DOI]

Dmitrij Kissler

,

,

,

,

IEEE Embed. Syst. Lett., 2011

Resource-aware programming and simulation of MPSoC architectures through extension of X10.

[BibT_eX]

[DOI]

,

,

Gregor Snelting

,

,

Andreas Zwinkau

Proceedings of the 14th International Workshop on Software and Compilers for Embedded Systems, 2011

Frameworks for GPU Accelerators: A comprehensive evaluation using 2D/3D image registration.

[BibT_eX]

[DOI]

Richard Membarth

,

,

,

,

Proceedings of the IEEE 9th Symposium on Application Specific Processors, 2011

Towards Symbolic Run-Time Reconfiguration in Tightly-Coupled Processor Arrays.

[BibT_eX]

[DOI]

,

,

,

Roberto Perez-Andrade

Proceedings of the 2011 International Conference on Reconfigurable Computing and FPGAs, 2011

Detector defect correction of medical images on graphics processors.

[BibT_eX]

[DOI]

Richard Membarth

,

,

,

,

Heinz Hornegger

Proceedings of the Medical Imaging 2011: Image Processing, 2011

Towards Resource Aware Programming on Intel's Single-Chip Cloud Computer Processor.

[BibT_eX]

[DOI]

Georgia Kouveli

,

,

,

Proceedings of the 3rd Many-core Applications Research Community (MARC) Symposium. Proceedings of the 3rd MARC Symposium, 2011

Distributed Resource Reservation in Massively Parallel Processor Arrays.

[BibT_eX]

[DOI]

,

,

Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

Decentralized dynamic resource management support for massively parallel processor arrays.

[BibT_eX]

[DOI]

,

Andriy Narovlyanskyy

,

,

Proceedings of the 22nd IEEE International Conference on Application-specific Systems, 2011

Frameworks for Multi-core Architectures: A Comprehensive Evaluation Using 2D/3D Image Registration.

[BibT_eX]

[DOI]

Richard Membarth

,

,

,

,

Proceedings of the Architecture of Computing Systems - ARCS 2011, 2011

2010

A deeply pipelined and parallel architecture for denoising medical images.

[BibT_eX]

[DOI]

,

,

,

Heinz Hornegger

Proceedings of the International Conference on Field-Programmable Technology, 2010

Compilation techniques for CGRAs: exploring all parallelization approaches.

[BibT_eX]

[DOI]

,

Praveen Raghavan

,

Scott A. Mahlke

,

Bjorn De Sutter

,

Aviral Shrivastava

,

Proceedings of the 8th International Conference on Hardware/Software Codesign and System Synthesis, 2010

Modeling and synthesis of communication subsystems for loop accelerator pipelines.

[BibT_eX]

[DOI]

,

,

,

Joachim Keinert

Proceedings of the 21st IEEE International Conference on Application-specific Systems Architectures and Processors, 2010

2009

Scheduling Techniques for High-Throughput Loop Accelerators.

[BibT_eX]

[DOI]

PhD thesis, 2009

A holistic approach for tightly coupled reconfigurable parallel processors.

[BibT_eX]

[DOI]

,

Dmitrij Kissler

,

,

Alexey Kupriyanov

,

,

Bernard Pottier

Microprocess. Microsystems, 2009

Power-Efficient Reconfiguration Control in Coarse-Grained Dynamically Reconfigurable Architectures.

[BibT_eX]

[DOI]

Dmitrij Kissler

,

Andreas Strawetz

,

,

J. Low Power Electron., 2009

Efficient Mapping of Multiresolution Image Filtering Algorithms on Graphics Processors.

[BibT_eX]

[DOI]

Richard Membarth

,

,

,

Proceedings of the Embedded Computer Systems: Architectures, 2009

System Integration of Tightly-Coupled Reconfigurable Processor Arrays and Evaluation of Buffer Size Effects on Their Performance.

[BibT_eX]

[DOI]

,

,

Proceedings of the ICPPW 2009, 2009

FPGA implementation of an invasive computing architecture.

[BibT_eX]

[DOI]

Abdulazim Amouri

,

Farhadur Arifin

,

,

Proceedings of the 2009 International Conference on Field-Programmable Technology, 2009

Model-based synthesis and optimization of static multi-rate image processing algorithms.

[BibT_eX]

[DOI]

Joachim Keinert

,

,

,

Christian Haubelt

,

Proceedings of the Design, Automation and Test in Europe, 2009

Acceleration of Multiresolution Imaging Algorithms: A Comparative Study.

[BibT_eX]

[DOI]

Richard Membarth

,

,

,

,

Proceedings of the 20th IEEE International Conference on Application-Specific Systems, 2009

Impact of Loop Tiling on the Controller Logic of Acceleration Engines.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 20th IEEE International Conference on Application-Specific Systems, 2009

Parallelization Approaches for Hardware Accelerators - Loop Unrolling Versus Loop Partitioning.

[BibT_eX]

[DOI]

,

,

Proceedings of the Architecture of Computing Systems, 2009

Performance Matching of Hardware Acceleration Engines for Heterogeneous MPSoC Using Modular Performance Analysis.

[BibT_eX]

[DOI]

,

,

Proceedings of the Architecture of Computing Systems, 2009

2008

The PAULA Language for Designing Multi-Dimensional Dataflow-Intensive Applications.

[BibT_eX]

,

Holger Ruckdeschel

,

Proceedings of the Methoden und Beschreibungssprachen zur Modellierung und Verifikation von Schaltungen und Systemen (MBMV), 2008

Area and reconfiguration time minimization of the communication network in regular 2D reconfigurable architectures.

[BibT_eX]

[DOI]

Christophe Wolinski

,

Krzysztof Kuchcinski

,

,

Proceedings of the FPL 2008, 2008

Coarse-grained reconfiguration.

[BibT_eX]

[DOI]

Sven Eisenhardt

,

Thomas Schweizer

,

Julio A. de Oliveira Filho

,

,

Wolfgang Rosenstiel

,

Alexander Thomas

,

,

,

Dmitrij Kissler

,

,

,

Heiko Hinkelmann

,

,

Manfred Glesner

Proceedings of the FPL 2008, 2008

Optimization of Routing and Reconfiguration Overhead in Programmable Processor Array Architectures.

[BibT_eX]

[DOI]

Christophe Wolinski

,

Krzysztof Kuchcinski

,

,

Proceedings of the 16th IEEE International Symposium on Field-Programmable Custom Computing Machines, 2008

Communication Network Reconfiguration Overhead Optimization in Programmable Processor Array Architectures.

[BibT_eX]

[DOI]

Christophe Wolinski

,

Krzysztof Kuchcinski

,

,

Proceedings of the 11th Euromicro Conference on Digital System Design: Architectures, 2008

Utilization of all Levels of Parallelism in a Processor Array with Subword Parallelism.

[BibT_eX]

[DOI]

Rainer Schaffer

,

,

,

Proceedings of the 11th Euromicro Conference on Digital System Design: Architectures, 2008

PARO: Synthesis of Hardware Accelerators for Multi-Dimensional Dataflow-Intensive Applications.

[BibT_eX]

[DOI]

,

Holger Ruckdeschel

,

,

Proceedings of the Reconfigurable Computing: Architectures, 2008

2007

Efficient control generation for mapping nested loop programs onto processor arrays.

[BibT_eX]

[DOI]

,

,

Holger Ruckdeschel

,

J. Syst. Archit., 2007

Efficient event-driven simulation of parallel processor architectures.

[BibT_eX]

[DOI]

Alexey Kupriyanov

,

Dmitrij Kissler

,

,

Proceedings of the 10th International Workshop on Software and Compilers for Embedded Systems, 2007

Massively Parallel Processor Architectures: A Co-design Approach.

[BibT_eX]

,

,

Alexey Kupriyanov

,

Dmitrij Kissler

,

,

Rainer Schaffer

,

Sebastian Siegel

,

,

Bernard Pottier

Proceedings of the 3rd International Workshop on Reconfigurable Communication-centric Systems-on-Chip, 2007

A Unified Retargetable Design Methodology for Dedicated and Re-Programmable Multiprocessor Arrays: Case Study and Quantitative Evaluation.

[BibT_eX]

,

,

Holger Ruckdeschel

,

,

Dmitrij Kissler

,

Proceedings of the 2007 International Conference on Engineering of Reconfigurable Systems & Algorithms, 2007

Modeling of Interconnection Networks in Massively Parallel Processor Architectures.

[BibT_eX]

[DOI]

Alexey Kupriyanov

,

,

Dmitrij Kissler

,

,

,

Olivier Sentieys

,

Sébastien Pillement

Proceedings of the Architecture of Computing Systems, 2007

2006

Mapping a class of dependence algorithms to coarse-grained reconfigurable arrays: architectural parameters and methodology.

[BibT_eX]

[DOI]

,

,

Int. J. Embed. Syst., 2006

A Dynamically Reconfigurable Weakly Programmable Processor Array Architecture Template.

[BibT_eX]

Dmitrij Kissler

,

,

Alexey Kupriyanov

,

Proceedings of the 2nd International Workshop on Reconfigurable Communication-centric Systems-on-Chip, 2006

Hierarchical Partitioning for Piecewise Linear Algorithms.

[BibT_eX]

[DOI]

,

,

Proceedings of the Fifth International Conference on Parallel Computing in Electrical Engineering (PARELEC 2006), 2006

An Architecture Description Language for Massively Parallel Processor Architectures.

[BibT_eX]

Alexey Kupriyanov

,

,

Dmitrij Kissler

,

,

Rainer Schaffer

,

Proceedings of the Methoden und Beschreibungssprachen zur Modellierung und Verifikation von Schaltungen und Systemen (MBMV), 2006

Hardware Cost Analysis for Weakly Programmable Processor Arrays.

[BibT_eX]

[DOI]

Dmitrij Kissler

,

,

Alexey Kupriyanov

,

Proceedings of the International Symposium on System-on-Chip, 2006

A highly parameterizable parallel processor array architecture.

[BibT_eX]

[DOI]

Dmitrij Kissler

,

,

Alexey Kupriyanov

,

Proceedings of the 2006 IEEE International Conference on Field Programmable Technology, 2006

A Generic Framework for Rapid Prototyping of System-on-Chip Designs.

[BibT_eX]

Dmitrij Kissler

,

Alexey Kupriyanov

,

,

,

Proceedings of the 2006 International Conference on Computer Design & Conference on Computing in Nanotechnology, 2006

A Design Methodology for Hardware Acceleration of Adaptive Filter Algorithms in Image Processing.

[BibT_eX]

[DOI]

,

,

,

,

Heinz Hornegger

Proceedings of the 2006 IEEE International Conference on Application-Specific Systems, 2006

Controller Synthesis for Mapping Partitioned Programs on Array Architectures.

[BibT_eX]

[DOI]

,

,

Proceedings of the Architecture of Computing Systems, 2006

2005

Defragmenting the Module Layout of a Partially Reconfigurable Device

[BibT_eX]

[DOI]

Jan van der Veen

,

Sándor P. Fekete

,

,

Christophe Bobda

,

,

CoRR, 2005

Automatic FIR Filter Generation for FPGAs.

[BibT_eX]

[DOI]

Holger Ruckdeschel

,

,

,

Proceedings of the Embedded Computer Systems: Architectures, 2005

Co-Design of Massively Parallel Embedded Processor Architectures.

[BibT_eX]

,

,

Alexey Kupriyanov

,

,

Rainer Schaffer

,

Sebastian Siegel

,

,

,

Bernard Pottier

,

,

,

Olivier Sentieys

Proceedings of the 1st International Workshop on Reconfigurable Communication-centric Systems-on-Chip, 2005

Defragmenting the Module Layout of a Partially Reconfigurable Device.

[BibT_eX]

Jan van der Veen

,

Sándor P. Fekete

,

,

,

Christophe Bobda

,

,

Proceedings of The 2005 International Conference on Engineering of Reconfigurable Systems and Algorithms, 2005

Output Serialization for FPGA-based and Coarse-grained Processor Arrays.

[BibT_eX]

,

Proceedings of The 2005 International Conference on Engineering of Reconfigurable Systems and Algorithms, 2005

Using Symbolic Feasibility Tests during Design Space Exploration of Heterogeneous Multi-Processor Systems.

[BibT_eX]

[DOI]

Thomas Schlichter

,

Christian Haubelt

,

,

Proceedings of the 16th IEEE International Conference on Application-Specific Systems, 2005

2004

High-Speed Event-Driven RTL Compiled Simulation.

[BibT_eX]

[DOI]

Alexey Kupriyanov

,

,

Proceedings of the Computer Systems: Architectures, 2004

Dynamic Piecewise Linear/Regular Algorithms.

[BibT_eX]

[DOI]

,

Proceedings of the 2004 International Conference on Parallel Computing in Electrical Engineering (PARELEC 2004), 2004

Mapping of Regular Nested Loop Programs to Coarse-Grained Reconfigurable Arrays - Constraints and Methodology.

[BibT_eX]

[DOI]

,

,

Proceedings of the 18th International Parallel and Distributed Processing Symposium (IPDPS 2004), 2004

Regular mapping for coarse-grained reconfigurable architectures.

[BibT_eX]

[DOI]

,

,

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Resource Constrained and Speculative Scheduling of an Algorithm Class with Run-Time Dependent Conditionals.

[BibT_eX]

[DOI]

,

Proceedings of the 15th IEEE International Conference on Application-Specific Systems, 2004

2002

Energy estimation of nested loop programs.

[BibT_eX]

[DOI]

,

Proceedings of the Fourteenth Annual ACM Symposium on Parallel Algorithms and Architectures, 2002

Generation of Distributed Loop Control.

[BibT_eX]

[DOI]

,

,

Proceedings of the Embedded Processor Design Challenges: Systems, Architectures, Modeling, and Simulation, 2002

2001

Design Space Exploration for Massively Parallel Processor Arrays.

[BibT_eX]

[DOI]

,

Proceedings of the Parallel Computing Technologies, 2001

Loading...