João V. F. Lima

Orcid: 0000-0002-2670-6963

Affiliations:
  • Federal University of Santa Maria, Brazil (since 2014)
  • Federal University of Rio Grande do Sul, Porto Alegre, Brazil (PhD 2014)
  • Université Grenoble Alpes, Saint-Martin-d'Heres, France (PhD 2014)


According to our database1, João V. F. Lima authored at least 24 papers between 2009 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
An evaluation of relational and NoSQL distributed databases on a low-power cluster.
J. Supercomput., August, 2023

NAS Parallel Benchmarks with Python: a performance and programming effort analysis focusing on GPUs.
J. Supercomput., May, 2023

2022
NAS Parallel Benchmark Kernels with Python: A performance and programming effort analysis focusing on GPUs.
Proceedings of the 30th Euromicro International Conference on Parallel, 2022

Impact of Reduced and Mixed-Precision on the Efficiency of a Multi-GPU Platform on CFD Applications.
Proceedings of the Computational Science and Its Applications - ICCSA 2022 Workshops, 2022

2021
Collaborative execution of fluid flow simulation using non-uniform decomposition on heterogeneous architectures.
J. Parallel Distributed Comput., 2021

Evaluation of two topology-aware heuristics on level- 3 BLAS library for multi-GPU platforms.
Proceedings of the 2021 SC Workshops Supplementary Proceedings, 2021

A Memory Affinity Analysis of Scientific Applications on NUMA Platforms.
Proceedings of the 33rd International Symposium on Computer Architecture and High Performance Computing, 2021

An evaluation of Cassandra NoSQL database on a low-power cluster.
Proceedings of the 33rd International Symposium on Computer Architecture and High Performance Computing, 2021

2020
XKBlas: a High Performance Implementation of BLAS-3 Kernels on Multi-GPU Server.
Proceedings of the 28th Euromicro International Conference on Parallel, 2020

2019
Performance and energy analysis of OpenMP runtime systems with dense linear algebra algorithms.
Int. J. High Perform. Comput. Appl., 2019

HPSM: a programming framework to exploit multi-CPU and multi-GPU systems simultaneously.
Int. J. Grid Util. Comput., 2019

Non-uniform Partitioning for Collaborative Execution on Heterogeneous Architectures.
Proceedings of the 31st International Symposium on Computer Architecture and High Performance Computing, 2019

A Dynamic Task-Based D3Q19 Lattice-Boltzmann Method for Heterogeneous Architectures.
Proceedings of the 27th Euromicro International Conference on Parallel, 2019

2018
Performance Evaluation of Deep Learning Frameworks over Different Architectures.
Proceedings of the High Performance Computing for Computational Science - VECPAR 2018, 2018

Non-uniform Domain Decomposition for Heterogeneous Accelerated Processing Units.
Proceedings of the High Performance Computing for Computational Science - VECPAR 2018, 2018

2017
HPSM: A Programming Framework for Multi-CPU and Multi-GPU Systems.
Proceedings of the 2017 International Symposium on Computer Architecture and High Performance Computing Workshops, 2017

2015
Design and analysis of scheduling strategies for multi-CPU and multi-GPU architectures.
Parallel Comput., 2015

2014
A Runtime System for Data-Flow Task Programming on Multicore Architectures with Accelerators. (Vers un support exécutif avec dépendance de données pour les architectures multicoeur avec des accélérateurs / Uma Ferramenta para Programação com Dependência de Dados em Arquiteturas Multicore com Aceleradores).
PhD thesis, 2014

Scheduling Data Flow Program in XKaapi: A New Affinity Based Algorithm for Heterogeneous Architectures.
Proceedings of the Euro-Par 2014 Parallel Processing, 2014

2013
Preliminary Experiments with XKaapi on Intel Xeon Phi Coprocessor.
Proceedings of the 25th International Symposium on Computer Architecture and High Performance Computing, 2013

XKaapi: A Runtime System for Data-Flow Task Programming on Heterogeneous Architectures.
Proceedings of the 27th IEEE International Symposium on Parallel and Distributed Processing, 2013

2012
Exploiting Concurrent GPU Operations for Efficient Work Stealing on Multi-GPUs.
Proceedings of the IEEE 24th International Symposium on Computer Architecture and High Performance Computing, 2012

2010
Challenges and Issues of Supporting Task Parallelism in MPI.
Proceedings of the Recent Advances in the Message Passing Interface, 2010

2009
Online mapping of MPI-2 dynamic tasks to processes and threads.
Int. J. High Perform. Syst. Archit., 2009


  Loading...