João P. L. de Carvalho
Orcid: 0000-0002-3476-184X
  According to our database1,
  João P. L. de Carvalho
  authored at least 27 papers
  between 2012 and 2025.
  
  
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
- 
    on orcid.org
On csauthors.net:
Bibliography
  2025
Scalar Interpolation: A Better Balance between Vector and Scalar Execution for SuperScalar Architectures.
    
  
    Proceedings of the 23rd ACM/IEEE International Symposium on Code Generation and Optimization, 2025
    
  
  2024
    Proceedings of the 33rd ACM SIGPLAN International Conference on Compiler Construction, 2024
    
  
  2023
Advancing Direct Convolution Using Convolution Slicing Optimization and ISA Extensions.
    
  
    ACM Trans. Archit. Code Optim., December, 2023
    
  
Fast matrix multiplication via compiler-only layered data reorganization and intrinsic lowering.
    
  
    Softw. Pract. Exp., September, 2023
    
  
    ACM Trans. Archit. Code Optim., March, 2023
    
  
    J. Parallel Distributed Comput., March, 2023
    
  
    Proceedings of the International Symposium on Computer Architecture and High Performance Computing Workshops , 2023
    
  
    Proceedings of the 21st ACM/IEEE International Symposium on Code Generation and Optimization, 2023
    
  
Efficient Auto-Vectorization for Control-flow Dependent Loops through Data Permutation.
    
  
    Proceedings of the 33rd Annual International Conference on Computer Science and Software Engineering, 2023
    
  
Stub Folding: Retaining Type Specialization to Increase the Efficiency of Highly Polymorphic Inline Caches.
    
  
    Proceedings of the 33rd Annual International Conference on Computer Science and Software Engineering, 2023
    
  
  2022
Vectorizing divergent control flow with active-lane consolidation on long-vector architectures.
    
  
    J. Supercomput., 2022
    
  
    ACM Trans. Archit. Code Optim., 2022
    
  
    Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2022
    
  
  2021
    ACM Trans. Archit. Code Optim., 2021
    
  
Pooling Acceleration in the DaVinci Architecture Using Im2col and Col2im Instructions.
    
  
    Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2021
    
  
    Proceedings of the Euro-Par 2021: Parallel Processing, 2021
    
  
  2020
An efficient parallel implementation for training supervised optimum-path forest classifiers.
    
  
    Neurocomputing, 2020
    
  
    Proceedings of the Companion of the 2020 ACM/SPEC International Conference on Performance Engineering, 2020
    
  
    Proceedings of the OpenMP: Portable Multi-Level Parallelism on Modern Systems, 2020
    
  
    Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020
    
  
    Proceedings of the Euro-Par 2020: Parallel Processing, 2020
    
  
  2019
    IEEE Trans. Parallel Distributed Syst., 2019
    
  
  2018
    Proceedings of the Symposium on High Performance Computing Systems, 2018
    
  
    Proceedings of the 30th International Symposium on Computer Architecture and High Performance Computing, 2018
    
  
  2017
    Proceedings of the International Conference on Supercomputing, 2017
    
  
  2012
    Proceedings of the IEEE 24th International Symposium on Computer Architecture and High Performance Computing, 2012