Patrick Carribault

According to our database1, Patrick Carribault authored at least 45 papers between 2004 and 2022.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

On csauthors.net:

Bibliography

2022
MPI detach - Towards automatic asynchronous local completion.
Parallel Comput., 2022

Relative Performance Projection on Arm Architectures.
Proceedings of the Euro-Par 2022: Parallel Processing, 2022

2021
Communication-Aware Task Scheduling Strategy in Hybrid MPI+OpenMP Applications.
Proceedings of the OpenMP: Enabling Massive Node-Level Parallelism, 2021

Enhancing Load-Balancing of MPI Applications with Workshare.
Proceedings of the Euro-Par 2021: Parallel Processing, 2021

2020
Automatic Code Motion to Extend MPI Nonblocking Overlap Window.
Proceedings of the High Performance Computing, 2020

PARCOACH Extension for Static MPI Nonblocking and Persistent Communication Validation.
Proceedings of the 4th IEEE/ACM International Workshop on Software Correctness for HPC Applications, 2020

Preliminary Experience with OpenMP Memory Management Implementation.
Proceedings of the OpenMP: Portable Multi-Level Parallelism on Modern Systems, 2020

2019
Checkpoint/restart approaches for a thread-based MPI runtime.
Parallel Comput., 2019

Mixing ranks, tasks, progress and nonblocking collectives.
Proceedings of the 26th European MPI Users' Group Meeting, 2019

Detecting Non-sibling Dependencies in OpenMP Task-Based Applications.
Proceedings of the OpenMP: Conquering the Full Hardware Spectrum, 2019

ScalOMP: Analyzing the Scalability of OpenMP Applications.
Proceedings of the OpenMP: Conquering the Full Hardware Spectrum, 2019

Multi-valued Expression Analysis for Collective Checking.
Proceedings of the Euro-Par 2019: Parallel Processing, 2019

2018
PARCOACH Extension for a Full-Interprocedural Collectives Verification.
Proceedings of the 2nd IEEE/ACM International Workshop on Software Correctness for HPC Applications, 2018

Transparent High-Speed Network Checkpoint/Restart in MPI.
Proceedings of the 25th European MPI Users' Group Meeting, 2018

Profile-guided scope-based data allocation method.
Proceedings of the International Symposium on Memory Systems, 2018

Efficient Communication/Computation Overlap with MPI+OpenMP Runtimes Collaboration.
Proceedings of the Euro-Par 2018: Parallel Processing, 2018

2017
Resource-Management Study in HPC Runtime-Stacking Context.
Proceedings of the 29th International Symposium on Computer Architecture and High Performance Computing, 2017

User Co-scheduling for MPI+OpenMP Applications Using OpenMP Semantics.
Proceedings of the Scaling OpenMP for Exascale Performance and Portability, 2017

Towards a Better Expressiveness of the Speedup Metric in MPI Context.
Proceedings of the 46th International Conference on Parallel Processing Workshops, 2017

2016
Introducing Task-Containers as an Alternative to Runtime-Stacking.
Proceedings of the 23rd European MPI Users' Group Meeting, EuroMPI 2016, 2016

2015
Fine-grain data management directory for OpenMP 4.0 and OpenACC.
Concurr. Comput. Pract. Exp., 2015

Correctness Analysis of MPI-3 Non-Blocking Communications in PARCOACH.
Proceedings of the 22nd European MPI Users' Group Meeting, 2015

An MPI Halo-Cell Implementation for Zero-Copy Abstraction.
Proceedings of the 22nd European MPI Users' Group Meeting, 2015

Static/Dynamic validation of MPI collective communications in multi-threaded context.
Proceedings of the 20th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2015

MPI Thread-Level Checking for MPI+OpenMP Applications.
Proceedings of the Euro-Par 2015: Parallel Processing, 2015

2014
PARCOACH: Combining static and dynamic validation of MPI collective communications.
Int. J. High Perform. Comput. Appl., 2014

Improving MPI communication overlap with collaborative polling.
Computing, 2014

Optimizing Collective Operations in Hybrid Applications.
Proceedings of the 21st European MPI Users' Group Meeting, 2014

Static Validation of Barriers and Worksharing Constructs in OpenMP Applications.
Proceedings of the Using and Improving OpenMP for Devices, Tasks, and More, 2014

Evaluation of OpenMP Task Scheduling Algorithms for Large NUMA Architectures.
Proceedings of the Euro-Par 2014 Parallel Processing, 2014

2013
Combining static and dynamic validation of MPI collective communications.
Proceedings of the 20th European MPI Users's Group Meeting, 2013

Data-Management Directory for OpenMP 4.0 and OpenACC.
Proceedings of the Euro-Par 2013: Parallel Processing Workshops, 2013

2012
Adaptive OpenMP for Large NUMA Nodes.
Proceedings of the OpenMP in a Heterogeneous World - 8th International Workshop on OpenMP, 2012

Hierarchical Local Storage: Exploiting Flexible User-Data Sharing Between MPI Tasks.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012

2011
Thread-Local Storage Extension to Support Thread-Based MPI/OpenMP Applications.
Proceedings of the OpenMP in the Petascale Era - 7th International Workshop on OpenMP, 2011

2010
Enabling Low-Overhead Hybrid MPI/OpenMP Parallelism with MPC.
Proceedings of the Beyond Loop Level Parallelism in OpenMP: Accelerators, 2010

User level DB: a debugging API for user-level thread libraries.
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

2009
MPC-MPI: An MPI Implementation Reducing the Overall Memory Consumption.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2009

2008
Scheduling strategies for optimistic parallel execution of irregular programs.
Proceedings of the SPAA 2008: Proceedings of the 20th Annual ACM Symposium on Parallelism in Algorithms and Architectures, 2008

2007
Deep Jam: Conversion of Coarse-Grain Parallelism to Fine-Grain and Vector Parallelism.
J. Instr. Level Parallelism, 2007

Loop Optimization using Hierarchical Compilation and Kernel Decomposition.
Proceedings of the Fifth International Symposium on Code Generation and Optimization (CGO 2007), 2007

2005
Collisions of SHA-0 and Reduced SHA-1.
Proceedings of the Advances in Cryptology, 2005

Deep Jam: Conversion of Coarse-Grain Parallelism to Instruction-Level and Vector Parallelism for Irregular Applications.
Proceedings of the 14th International Conference on Parallel Architectures and Compilation Techniques (PACT 2005), 2005

2004
Branch Strategies to Optimize Decision Trees for Wide-Issue Architectures.
Proceedings of the Languages and Compilers for High Performance Computing, 2004

Applications of storage mapping optimization to register promotion.
Proceedings of the 18th Annual International Conference on Supercomputing, 2004


  Loading...