Biagio Cosenza

Orcid: 0000-0002-8869-6705

According to our database1, Biagio Cosenza authored at least 52 papers between 2008 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
SYCL-Bench 2020: Benchmarking SYCL 2020 on AMD, Intel, and NVIDIA GPUs.
Proceedings of the 12th International Workshop on OpenCL and SYCL, 2024

Unlocking performance portability on LUMI-G supercomputer: A virtual screening case study.
Proceedings of the 12th International Workshop on OpenCL and SYCL, 2024

2023
Improving computation efficiency using input and architecture features for a virtual screening application.
CoRR, 2023

SYnergy: Fine-grained Energy-Efficient Heterogeneous Computing for Scalable Energy Saving.
Proceedings of the International Conference for High Performance Computing, 2023

Domain-Specific Energy Modeling for Drug Discovery and Magnetohydrodynamics Applications.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

Towards a SYCL API for Approximate Computing.
Proceedings of the 2023 International Workshop on OpenCL, 2023

Algorithm Selection of MPI Collectives Considering System Utilization.
Proceedings of the Euro-Par 2023: Parallel Processing Workshops - Euro-Par 2023 International Workshops, Limassol, Cyprus, August 28, 2023


An Asynchronous Dataflow-Driven Execution Model For Distributed Accelerator Computing.
Proceedings of the 23rd IEEE/ACM International Symposium on Cluster, 2023

EMPI: Enhanced Message Passing Interface in Modern C++.
Proceedings of the 23rd IEEE/ACM International Symposium on Cluster, 2023

2022
Celerity: How (Well) Does the SYCL API Translate to Distributed Clusters?
Proceedings of the IWOCL'22: International Workshop on OpenCL, Bristol, United Kingdom, May 10, 2022

Towards a Portable Drug Discovery Pipeline with SYCL 2020.
Proceedings of the IWOCL'22: International Workshop on OpenCL, Bristol, United Kingdom, May 10, 2022

An Analysis of Performance Variability on Dragonfly+topology.
Proceedings of the IEEE International Conference on Cluster Computing, 2022

FLEXDP: flexible frequency scaling for energy-delay product optimization of GPU applications.
Proceedings of the CF '22: 19th ACM International Conference on Computing Frontiers, Turin, Italy, May 17, 2022

An Analysis of Long-Tailed Network Latency Distribution and Background Traffic on Dragonfly+.
Proceedings of the Benchmarking, Measuring, and Optimizing, 2022

2021
Easy and efficient agent-based simulations with the OpenABL language and compiler.
Future Gener. Comput. Syst., 2021

ALONA: Automatic Loop Nest Approximation with Reconstruction and Space Pruning.
Proceedings of the Euro-Par 2021: Parallel Processing, 2021


2020
Vectorization cost modeling for NEON, AVX and SVE.
Perform. Evaluation, 2020

Accurate Energy and Performance Prediction for Frequency-Scaled GPU Kernels.
Comput., 2020

SYCL-Bench: A Versatile Single-Source Benchmark Suite for Heterogeneous Computing.
Proceedings of the IWOCL '20: International Workshop on OpenCL, 2020

SYCL-Bench: A Versatile Cross-Platform Benchmark Suite for Heterogeneous Computing.
Proceedings of the Euro-Par 2020: Parallel Processing, 2020

2019
Portable Cost Modeling for Auto-Vectorizers.
Proceedings of the 27th IEEE International Symposium on Modeling, 2019

A Performance Analysis of Vector Length Agnostic Code.
Proceedings of the 17th International Conference on High Performance Computing & Simulation, 2019

Approximating Memory-bound Applications on Mobile GPUs.
Proceedings of the 17th International Conference on High Performance Computing & Simulation, 2019

Predictable GPUs Frequency Scaling for Energy and Performance.
Proceedings of the 48th International Conference on Parallel Processing, 2019

Celerity: High-Level C++ for Accelerator Clusters.
Proceedings of the Euro-Par 2019: Parallel Processing, 2019

2018
Control Flow Vectorization for ARM NEON.
Proceedings of the 21st International Workshop on Software and Compilers for Embedded Systems, 2018

Accelerating the RICH Particle Detector Algorithm on Intel Xeon Phi.
Proceedings of the 26th Euromicro International Conference on Parallel, 2018

OpenABL: A Domain-Specific Language for Parallel and Distributed Agent-Based Simulations.
Proceedings of the Euro-Par 2018: Parallel Processing, 2018

Cost Modelling for Vectorization on ARM.
Proceedings of the IEEE International Conference on Cluster Computing, 2018

Local memory-aware kernel perforation.
Proceedings of the 2018 International Symposium on Code Generation and Optimization, 2018

2017
Stencil Autotuning with Ordinal Regression: Extended Abstract.
Proceedings of the 20th International Workshop on Software and Compilers for Embedded Systems, 2017

Autotuning Stencil Computations with Structural Ordinal Regression Learning.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017

Static optimization in PHP 7.
Proceedings of the 26th International Conference on Compiler Construction, 2017

2016
An evaluation of current SIMD programming models for C++.
Proceedings of the 3rd Workshop on Programming Models for SIMD/Vector Processing, 2016

2015
Spectral turning bands for efficient Gaussian random fields generation on GPUs and accelerators.
Concurr. Comput. Pract. Exp., 2015

Point Distribution Tensor Computation on Heterogeneous Systems.
Proceedings of the International Conference on Computational Science, 2015

Automatic Data Layout Optimizations for GPUs.
Proceedings of the Euro-Par 2015: Parallel Processing, 2015

Behavioral Spherical Harmonics for Long-Range Agents' Interaction.
Proceedings of the Euro-Par 2015: Parallel Processing Workshops, 2015

2014
A uniform approach for programming distributed heterogeneous computing systems.
J. Parallel Distributed Comput., 2014

Kd-Tree Based N-Body Simulations with Volume-Mass Heuristic on the GPU.
Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014

Random Fields Generation on the GPU with the Spectral Turning Bands Method.
Proceedings of the Euro-Par 2014 Parallel Processing, 2014

2013
Automatic problem size sensitive task partitioning on heterogeneous parallel systems.
Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2013

An automatic input-sensitive approach for heterogeneous task partitioning.
Proceedings of the International Conference on Supercomputing, 2013

LibWater: heterogeneous distributed computing made easy.
Proceedings of the International Conference on Supercomputing, 2013

GPU Cost Estimation for Load Balancing in Parallel Ray Tracing.
Proceedings of the GRAPP & IVAPP 2013: Proceedings of the International Conference on Computer Graphics Theory and Applications and International Conference on Information Visualization Theory and Applications, 2013

2011
Distributed Load Balancing for Parallel Agent-Based Simulations.
Proceedings of the 19th International Euromicro Conference on Parallel, 2011

2009
Experiences with Mesh-like computations using Prediction Binary Trees.
Scalable Comput. Pract. Exp., 2009

2008
Load Balancing in Mesh-like Computations using Prediction Binary Trees.
Proceedings of the 7th International Symposium on Parallel and Distributed Computing (ISPDC 2008), 2008

On Estimating the Effectiveness of Temporal and Spatial Coherence in Parallel Ray Tracing.
Proceedings of the Eurographics Italian Chapter Conference 2008, Salerno, Italy, 2008, 2008

A Survey on Exploiting Grids for Ray Tracing.
Proceedings of the Eurographics Italian Chapter Conference 2008, Salerno, Italy, 2008, 2008


  Loading...