Sven Karlsson

Orcid: 0000-0003-0737-9992

Affiliations:
  • Technical University of Denmark, Lyngby, Denmark


According to our database1, Sven Karlsson authored at least 35 papers between 1998 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Design of an Application-specific VLIW Vector Processor for ORB Feature Extraction.
J. Signal Process. Syst., July, 2023

Modeling of Errors in Quantum Computers with Generated Structural Circuits.
Proceedings of the IEEE International Conference on Quantum Computing and Engineering, 2023


Improving a Multigrid Poisson Solver with Peer-to-Peer Communication and Task Dependencies.
Proceedings of the OpenMP: Advanced Task-Based, Device and Compiler Programming, 2023

OpenMP Target Offload Utilizing GPU Shared Memory.
Proceedings of the OpenMP: Advanced Task-Based, Device and Compiler Programming, 2023


2022
Feasibility Studies in Multi-GPU Target Offloading.
Proceedings of the OpenMP in a Modern World: From Multi-device Support to Meta Programming, 2022

2021
Energy-Efficient Application-Specific Instruction-Set Processor for Feature Extraction in Smart Vision Systems.
Proceedings of the 55th Asilomar Conference on Signals, Systems, and Computers, 2021

2017
Improving Loop Dependence Analysis.
ACM Trans. Archit. Code Optim., 2017

2016
A scalable lock-free hash table with open addressing.
Proceedings of the 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2016

Towards Unifying OpenMP Under the Task-Parallel Paradigm - Implementation and Performance of the taskloop Construct.
Proceedings of the OpenMP: Memory, Devices, and Tasks, 2016

2015
Implementation of BT-trees.
CoRR, 2015

Hardware Transactional Memory Optimization Guidelines, Applied to Ordered Maps.
Proceedings of the 2015 IEEE TrustCom/BigDataSE/ISPA, 2015

Experiences with Compiler Support for Processors with Exposed Pipelines.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, 2015

A Scalable Prescriptive Parallel Debugging Model.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015

2014
Library Support for Resource Constrained Accelerators.
Proceedings of the Using and Improving OpenMP for Devices, Tasks, and More, 2014

Hardware realization of an FPGA processor - Operating system call offload and experiences.
Proceedings of the 2014 Conference on Design and Architectures for Signal and Image Processing, 2014

A Synthesizable Multicore Platform for Microwave Imaging.
Proceedings of the Reconfigurable Computing: Architectures, Tools, and Applications, 2014

Automatic generation of application specific FPGA multicore accelerators.
Proceedings of the 48th Asilomar Conference on Signals, Systems and Computers, 2014

2013
ELB-Trees, An Efficient and Lock-free B-tree Derivative.
CoRR, 2013

Synthetic Aperture Radar Data Processing on an FPGA Multi-core System.
Proceedings of the Architecture of Computing Systems - ARCS 2013, 2013

2012
Parallelizing more Loops with Compiler Guided Refactoring.
Proceedings of the 41st International Conference on Parallel Processing, 2012

Design Principles for Synthesizable Processor Cores.
Proceedings of the Architecture of Computing Systems - ARCS 2012 - 25th International Conference, Munich, Germany, February 28, 2012

2011
Expressing Coarse-Grain Dependencies Among Tasks in Shared Memory Programs.
IEEE Trans. Ind. Informatics, 2011

SRC: FenixOS - a research operating system focused on high scalability and reliability.
Proceedings of the 25th International Conference on Supercomputing, 2011, Tucson, AZ, USA, May 31, 2011

2009
Parallelism and Scalability in an Image Processing Application.
Int. J. Parallel Program., 2009

Identifying Inter-task Communication in Shared Memory Programming Models.
Proceedings of the Evolving OpenMP in an Age of Extreme Parallelism, 2009

2008
Exploiting spatial parallelism in Ethernet-based cluster interconnects.
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

2007
MultiEdge: An Edge-based Communication Subsystem for Scalable Commodity Servers.
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

2005
An Introduction to Balder - An OpenMP Run-time Library for Clusters of SMPs.
Proceedings of the OpenMP Shared Memory Parallel Programming - International Workshops, 2005

2004
Shared Memory and OpenMP on Clusters.
PhD thesis, 2004

2003
Priority Based Messaging for Software Distributed Shared Memory.
Clust. Comput., 2003

2002
A Fully Compliant OpenMP Implementationon Software Distributed Shared Memory.
Proceedings of the High Performance Computing, 2002

1999
Producer-Push - A Protocol Enhancement to Page-Based Software Distributed Shared Memory Systems.
Proceedings of the International Conference on Parallel Processing 1999, 1999

1998
A Comparative Characterization of Communication Patterns in Applications Using MPI and Shared Memory on an IBM SP2.
Proceedings of the Network-Based Parallel Computing: Communication, 1998


  Loading...