Filippo Mantovani

Filippo Spiga

Proceedings of the Supercomputing Asia and International Conference on High Performance Computing in Asia Pacific Region Workshops, 2026

2025

RISC-V in HPC: a Look Into Tools for Performance Monitoring.

[BibT_eX]

[DOI]

Rafel Albert Bros Esqueu

Proceedings of the High Performance Computing, 2025

2024

RAVE: RISC-V Analyzer of Vector Executions, A QEMU Tracing Plugin.

[BibT_eX]

[DOI]

Proceedings of the Parallel Processing and Applied Mathematics, 2024

QR Factorization on a Long-Vector Processor.

[BibT_eX]

[DOI]

Andrés E. Tomás

Enrique S. Quintana-Ortí

Proceedings of the Parallel Processing and Applied Mathematics, 2024

Batched DGEMMs for Scientific Codes Running on Long Vector Architectures.

[BibT_eX]

[DOI]

Marta Garcia-Gasulla

Proceedings of the Parallel Processing and Applied Mathematics, 2024

Graph Computing on Long Vector Architectures (Yes, It Works!).

[BibT_eX]

[DOI]

Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024

Exploiting long vectors with a CFD code: a co-design show case.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024

NVIDIA Grace Superchip Early Evaluation for HPC Applications.

[BibT_eX]

[DOI]

Joan Vinyals-Ylla-Catala

Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region Workshops, 2024

2023

HPCG on long-vector architectures: Evaluation and optimization on NEC SX-Aurora and RISC-V.

[BibT_eX]

[DOI]

Future Gener. Comput. Syst., June, 2023

Top-Down Models across CPU Architectures: Applicability and Comparison in a High-Performance Computing Environment.

[BibT_eX]

[DOI]

Marta Garcia-Gasulla

Inf., 2023

Compressed Real Numbers for AI: a case-study using a RISC-V CPU.

[BibT_eX]

[DOI]

CoRR, 2023

Acceleration with long vector architectures: Implementation and evaluation of the FFT kernel on NEC SX-Aurora and RISC-V vector extension.

[BibT_eX]

[DOI]

Concurr. Comput. Pract. Exp., 2023

Software Development Vehicles to Enable Extended and Early Co-design: A RISC-V and HPC Case of Study.

[BibT_eX]

[DOI]

Vassilis Papaefstathiou

Proceedings of the High Performance Computing, 2023

Short Reasons for Long Vectors in HPC CPUs: A Study Based on RISC-V.

[BibT_eX]

[DOI]

Georgios Ieronymakis

Nikolaos Dimou

Vassilis Papaefstathiou

Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

2022

Asymmetric HMMs for Online Ball-Bearing Health Assessments.

[BibT_eX]

[DOI]

Carlos Puerto-Santana

Concha Bielza

Javier Diaz-Rozo

IEEE Internet Things J., 2022

A portable coding strategy to exploit vectorization on combustion simulations.

[BibT_eX]

[DOI]

CoRR, 2022

2021

Efficiently running SpMV on long vector architectures.

[BibT_eX]

[DOI]

Proceedings of the PPoPP '21: 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2021

Accelerating FFT Using NEC SX-Aurora Vector Engine.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2021: Parallel Processing Workshops, 2021

Cluster of emerging technology: evaluation of a production HPC system based on A64FX.

[BibT_eX]

[DOI]

Kilian Peiro

Proceedings of the IEEE International Conference on Cluster Computing, 2021

2020

Runtime mechanisms to survive new HPC architectures: A use case in human respiratory simulations.

[BibT_eX]

[DOI]

Int. J. High Perform. Comput. Appl., 2020

Performance and energy consumption of HPC workloads on a cluster based on Arm ThunderX2 CPU.

[BibT_eX]

[DOI]

Future Gener. Comput. Syst., 2020

Performance study of HPC applications on an Arm-based cluster using a generic efficiency model.

[BibT_eX]

[DOI]

Kilian Peiro

Andrea Querol

Guillem Ramirez-Miranda

Proceedings of the 28th Euromicro International Conference on Parallel, 2020

Benchmarking of state-of-the-art HPC Clusters with a Production CFD Code.

[BibT_eX]

[DOI]

Proceedings of the PASC '20: Platform for Advanced Scientific Computing Conference, Geneva, Switzerland, June 29, 2020

CoreNEURON: Performance and Energy Efficiency Evaluation on Intel and Arm CPUs.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Cluster Computing, 2020

2019

Containers in HPC: A Scalability and Portability Study in Production Biological Simulations.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE International Parallel and Distributed Processing Symposium, 2019

Design Space Exploration of Next-Generation HPC Machines.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE International Parallel and Distributed Processing Symposium, 2019

Open-Source Shared Memory implementation of the HPCG benchmark: analysis, improvements and evaluation on Cavium ThunderX2.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on High Performance Computing & Simulation, 2019

TensorFlow on State-of-the-Art HPC Clusters: A Machine Learning use Case.

[BibT_eX]

[DOI]

Marta Garcia-Gasulla

Proceedings of the 19th IEEE/ACM International Symposium on Cluster, 2019

2018

Efficient CFD code implementation for the ARM-based Mont-Blanc architecture.

[BibT_eX]

[DOI]

Future Gener. Comput. Syst., 2018

Filling the gap between education and industry: evidence-based methods for introducing undergraduate students to HPC.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE/ACM Workshop on Education for High-Performance Computing, 2018

Teaching HPC Systems and Parallel Programming with Small-Scale Clusters.

[BibT_eX]

[DOI]

Lluc Alvarez

Eduard Ayguadé

Proceedings of the 2018 IEEE/ACM Workshop on Education for High-Performance Computing, 2018

Advanced Performance Analysis of HPC Workloads on Cavium ThunderX.

[BibT_eX]

[DOI]

Enrico Calore

Daniel Ruiz

Proceedings of the 2018 International Conference on High Performance Computing & Simulation, 2018

Computational Fluid and Particle Dynamics Simulations for Respiratory System: Runtime Optimization on an Arm Cluster.

[BibT_eX]

[DOI]

Proceedings of the 47th International Conference on Parallel Processing, 2018

2017

Energy Analysis of a 4D Variational Data Assimilation Algorithm and Evaluation on ARM-Based HPC Systems.

[BibT_eX]

[DOI]

Proceedings of the Parallel Processing and Applied Mathematics, 2017

Implementation of the K-Means Algorithm on Heterogeneous Devices: A Use Case Based on an Industrial Dataset.

[BibT_eX]

[DOI]

Daniel Jiménez-González

Xavier Martorell

Proceedings of the Parallel Computing is Everywhere, 2017

Multi-Node Advanced Performance and Power Analysis with Paraver.

[BibT_eX]

[DOI]

Enrico Calore

Proceedings of the Parallel Computing is Everywhere, 2017

2016

The mont-blanc prototype: an alternative approach for HPC systems.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2016

2014

Janus II: A new generation application-driven computer for spin-system simulations.

[BibT_eX]

[DOI]

Comput. Phys. Commun., 2014

High Performance Computing based on embedded processors.

[BibT_eX]

[DOI]

Proceedings of the International Conference on High Performance Computing & Simulation, 2014

2013

An Optimized Lattice Boltzmann Code for BlueGene/Q.

[BibT_eX]

[DOI]

Marcello Pivanti

Luca Zenesini

Proceedings of the Parallel Processing and Applied Mathematics, 2013

Early Experience on Porting and Running a Lattice Boltzmann Code on the Xeon-Phi Co-Processor.

[BibT_eX]

[DOI]

G. Crimi

Marcello Pivanti

Proceedings of the International Conference on Computational Science, 2013

2012

Reconfigurable computing for Monte Carlo simulations: results and prospects of the Janus project

[BibT_eX]

[DOI]

CoRR, 2012

Janus2: an FPGA-based supercomputer for spin glass simulations.

[BibT_eX]

[DOI]

Proceedings of the Future HPC Systems - the Challenges of Power-Constrained Performance, 2012

Spin Glass Simulations on the Janus Architecture: A Desperate Quest for Strong Scaling.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2012: Parallel Processing Workshops, 2012

2011

Optimization of Multi-Phase Compressible Lattice Boltzmann Codes on Massively Parallel Multi-Core Systems.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Computational Science, 2011

Lattice Boltzmann method simulations on massively parallel multi-core architectures.

[BibT_eX]

[DOI]

Fabio Pozzati

Proceedings of the 2011 Spring Simulation Multi-conference, 2011

A Multi-GPU Implementation of a D2Q37 Lattice Boltzmann Code.

[BibT_eX]

[DOI]

Proceedings of the Parallel Processing and Applied Mathematics, 2011

2010

Lattice Boltzmann fluid-dynamics on the QPACE supercomputer.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Computational Science, 2010

Monte Carlo Simulations of Spin Systems on Multi-core Processors.

[BibT_eX]

[DOI]

Proceedings of the Applied Parallel and Scientific Computing, 2010

2009

Janus: a recongurable system for scientic computing.

[BibT_eX]

[DOI]

PhD thesis, 2009

Janus: An FPGA-Based System for High-Performance Scientific Computing.

[BibT_eX]

[DOI]

Comput. Sci. Eng., 2009

Monte Carlo Simulations of Spin Glass Systems on the Cell Broadband Engine.

[BibT_eX]

[DOI]

Proceedings of the Parallel Processing and Applied Mathematics, 2009

2008

Simulating spin systems on IANUS, an FPGA-based computer.

[BibT_eX]

[DOI]

Francesco Belletti

Maria Cotallo

Andres Cruz Flor

Luis Antonio Fernandez

Juan Jesus Ruiz-Lorenzo

Comput. Phys. Commun., 2008

2007

IANUS: an FPGA-based System for High Performance Scientific Computing

[BibT_eX]

[DOI]

Francesco Belletti

Maria Cotallo

Andres Cruz Flor

Luis Antonio Fernandez

Juan Jesus Ruiz-Lorenzo

CoRR, 2007

IANUS: Scientific Computing on an FPGA-Based Architecture.

[BibT_eX]

Francesco Belletti

Maria Cotallo

Andres Cruz Flor

Luis Antonio Fernandez

Juan Jesus Ruiz-Lorenzo

Proceedings of the Parallel Computing: Architectures, 2007

2006

Ianus: An Adaptive FPGA Computer.

[BibT_eX]

[DOI]

Comput. Sci. Eng., 2006

Poster reception - IANUS: scientific computing on an FPGA-based architecture.

[BibT_eX]

[DOI]