Márcio Machado Pereira

According to our database1, Márcio Machado Pereira authored at least 22 papers between 1989 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Advancing Direct Convolution Using Convolution Slicing Optimization and ISA Extensions.
ACM Trans. Archit. Code Optim., December, 2023

Tensor slicing and optimization for multicore NPUs.
J. Parallel Distributed Comput., May, 2023

2022
Implementing the Broadcast Operation in a Distributed Task-based Runtime.
Proceedings of the International Symposium on Computer Architecture and High Performance Computing Workshops, 2022

An OpenMP-only Linear Algebra Library for Distributed Architectures.
Proceedings of the International Symposium on Computer Architecture and High Performance Computing Workshops, 2022

The OpenMP Cluster Programming Model.
Proceedings of the Workshop Proceedings of the 51st International Conference on Parallel Processing, 2022

Improving Convolution via Cache Hierarchy Tiling and Reduced Packing.
Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2022

2021
Enabling OpenMP Task Parallelism on Multi-FPGAs.
Proceedings of the 29th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2021

2020
OmpTracing: Easy Profiling of OpenMP Programs.
Proceedings of the 32nd IEEE International Symposium on Computer Architecture and High Performance Computing, 2020

2019
Data-flow analysis and optimization for data coherence in heterogeneous architectures.
J. Parallel Distributed Comput., 2019

2018
DOACROSS Parallelization Based on Component Annotation and Loop-Carried Probability.
Proceedings of the 30th International Symposium on Computer Architecture and High Performance Computing, 2018

Automatic Offloading of Cluster Accelerators.
Proceedings of the 26th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2018

2017
DawnCC: Automatic Annotation for Data Parallelism and Offloading.
ACM Trans. Archit. Code Optim., 2017

Automatic Scan Parallelization in OpenMP.
Proceedings of the 2017 International Symposium on Computer Architecture and High Performance Computing Workshops, 2017

Data Coherence Analysis and Optimization for Heterogeneous Computing.
Proceedings of the 29th International Symposium on Computer Architecture and High Performance Computing, 2017

Compiling and Optimizing OpenMP 4.X Programs to OpenCL and SPIR.
Proceedings of the Scaling OpenMP for Exascale Performance and Portability, 2017

2016
Study of hardware transactional memory characteristics and serialization policies on Haswell.
Parallel Comput., 2016

Automatic Insertion of Copy Annotation in Data-Parallel Programs.
Proceedings of the 28th International Symposium on Computer Architecture and High Performance Computing, 2016

2015
Técnicas de escalonamento e serialização para memórias transacionais.
PhD thesis, 2015

2014
Multi-dimensional Evaluation of Haswell's Transactional Memory Performance.
Proceedings of the 26th IEEE International Symposium on Computer Architecture and High Performance Computing, 2014

Measuring Effective Work to Reward Success in Dynamic Transaction Scheduling.
Proceedings of the 43rd International Conference on Parallel Processing, 2014

2013
Transaction scheduling using conflict avoidance and Contention Intensity.
Proceedings of the 20th Annual International Conference on High Performance Computing, 2013

1989
A Linguagem de Programação CHILL.
Proceedings of the 3rd Brazilian Symposium on Software Engineering, 1989


  Loading...