Patrick Carribault

Proceedings of the SC24-W: Workshops of the International Conference for High Performance Computing, 2024

Investigating parallel execution of quantum Machine Learning circuits on superconducting hardware.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Quantum Computing and Engineering, 2024

Fully Integrated Quantum Method for Classical Register Allocation in LLVM.

[BibT_eX]

[DOI]

Brice Chichereau

Stéphane Vialle

Proceedings of the IEEE International Conference on Quantum Computing and Engineering, 2024

To Share or Not to Share: A Case for MPI in Shared-Memory.

[BibT_eX]

[DOI]

Julien Adam

Proceedings of the Recent Advances in the Message Passing Interface, 2024

Measuring and Interpreting Dependent Task-Based Applications Performances.

[BibT_eX]

[DOI]

Proceedings of the Parallel Processing and Applied Mathematics, 2024

An Overview on Mixing MPI and OpenMP Dependent Tasking on A64FX.

[BibT_eX]

[DOI]

Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region Workshops, 2024

2023

Experimenting with Hybrid Quantum Optimization in HPC Software Stack for CPU Register Allocation.

[BibT_eX]

[DOI]

Brice Chichereau

Stéphane Vialle

Proceedings of the IEEE International Conference on Quantum Computing and Engineering, 2023

Suspending OpenMP Tasks on Asynchronous Events: Extending the Taskwait Construct.

[BibT_eX]

[DOI]

Proceedings of the OpenMP: Advanced Task-Based, Device and Compiler Programming, 2023

Investigating Dependency Graph Discovery Impact on Task-based MPI+OpenMP Applications Performances.

[BibT_eX]

[DOI]

Proceedings of the 52nd International Conference on Parallel Processing, 2023

2022

MPI detach - Towards automatic asynchronous local completion.

[BibT_eX]

[DOI]

Parallel Comput., 2022

Enhancing MPI+OpenMP Task Based Applications for Heterogeneous Architectures with GPU Support.

[BibT_eX]

[DOI]

Luiz Angelo Steffenel

Thierry Gautier

Proceedings of the OpenMP in a Modern World: From Multi-device Support to Meta Programming, 2022

Relative Performance Projection on Arm Architectures.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2022: Parallel Processing, 2022

2021

Communication-Aware Task Scheduling Strategy in Hybrid MPI+OpenMP Applications.

[BibT_eX]

[DOI]

Proceedings of the OpenMP: Enabling Massive Node-Level Parallelism, 2021

Enhancing Load-Balancing of MPI Applications with Workshare.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2021: Parallel Processing, 2021

2020

Automatic Code Motion to Extend MPI Nonblocking Overlap Window.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing, 2020

PARCOACH Extension for Static MPI Nonblocking and Persistent Communication Validation.

[BibT_eX]

[DOI]

Proceedings of the 4th IEEE/ACM International Workshop on Software Correctness for HPC Applications, 2020

Preliminary Experience with OpenMP Memory Management Implementation.

[BibT_eX]

[DOI]

Adrien Roussel

Proceedings of the OpenMP: Portable Multi-Level Parallelism on Modern Systems, 2020

2019

Checkpoint/restart approaches for a thread-based MPI runtime.

[BibT_eX]

[DOI]

Julien Adam

Maxime Kermarquer

Leonardo Bautista-Gomez

Parallel Comput., 2019

Mixing ranks, tasks, progress and nonblocking collectives.

[BibT_eX]

[DOI]

Proceedings of the 26th European MPI Users' Group Meeting, 2019

Detecting Non-sibling Dependencies in OpenMP Task-Based Applications.

[BibT_eX]

[DOI]

Proceedings of the OpenMP: Conquering the Full Hardware Spectrum, 2019

ScalOMP: Analyzing the Scalability of OpenMP Applications.

[BibT_eX]

[DOI]

Proceedings of the OpenMP: Conquering the Full Hardware Spectrum, 2019

Multi-valued Expression Analysis for Collective Checking.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2019: Parallel Processing, 2019

2018

PARCOACH Extension for a Full-Interprocedural Collectives Verification.

[BibT_eX]

[DOI]

Proceedings of the 2nd IEEE/ACM International Workshop on Software Correctness for HPC Applications, 2018

Transparent High-Speed Network Checkpoint/Restart in MPI.

[BibT_eX]

[DOI]

Julien Adam

Proceedings of the 25th European MPI Users' Group Meeting, 2018

Profile-guided scope-based data allocation method.

[BibT_eX]

[DOI]

Proceedings of the International Symposium on Memory Systems, 2018

Efficient Communication/Computation Overlap with MPI+OpenMP Runtimes Collaboration.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2018: Parallel Processing, 2018

2017

Resource-Management Study in HPC Runtime-Stacking Context.

[BibT_eX]

[DOI]

Proceedings of the 29th International Symposium on Computer Architecture and High Performance Computing, 2017

User Co-scheduling for MPI+OpenMP Applications Using OpenMP Semantics.

[BibT_eX]

[DOI]

Antoine Capra

Allen D. Malony

Proceedings of the Scaling OpenMP for Exascale Performance and Portability, 2017

Towards a Better Expressiveness of the Speedup Metric in MPI Context.

[BibT_eX]

[DOI]

Proceedings of the 46th International Conference on Parallel Processing Workshops, 2017

2016

Introducing Task-Containers as an Alternative to Runtime-Stacking.

[BibT_eX]

[DOI]

Proceedings of the 23rd European MPI Users' Group Meeting, EuroMPI 2016, 2016

2015

Fine-grain data management directory for OpenMP 4.0 and OpenACC.

[BibT_eX]

[DOI]

Concurr. Comput. Pract. Exp., 2015

Correctness Analysis of MPI-3 Non-Blocking Communications in PARCOACH.

[BibT_eX]

[DOI]

Proceedings of the 22nd European MPI Users' Group Meeting, 2015

An MPI Halo-Cell Implementation for Zero-Copy Abstraction.

[BibT_eX]

[DOI]

Proceedings of the 22nd European MPI Users' Group Meeting, 2015

Static/Dynamic validation of MPI collective communications in multi-threaded context.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2015

MPI Thread-Level Checking for MPI+OpenMP Applications.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2015: Parallel Processing, 2015

2014

PARCOACH: Combining static and dynamic validation of MPI collective communications.

[BibT_eX]

[DOI]

Int. J. High Perform. Comput. Appl., 2014

Optimizing Collective Operations in Hybrid Applications.

[BibT_eX]

[DOI]

Proceedings of the 21st European MPI Users' Group Meeting, 2014

Static Validation of Barriers and Worksharing Constructs in OpenMP Applications.

[BibT_eX]

[DOI]

Proceedings of the Using and Improving OpenMP for Devices, Tasks, and More, 2014

Evaluation of OpenMP Task Scheduling Algorithms for Large NUMA Architectures.

[BibT_eX]

[DOI]

Jérôme Clet-Ortega

Proceedings of the Euro-Par 2014 Parallel Processing, 2014

2013

Combining static and dynamic validation of MPI collective communications.

[BibT_eX]

[DOI]

Proceedings of the 20th European MPI Users's Group Meeting, 2013

Data-Management Directory for OpenMP 4.0 and OpenACC.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2013: Parallel Processing Workshops, 2013

2012

Improving MPI Communication Overlap with Collaborative Polling.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in the Message Passing Interface, 2012

Adaptive OpenMP for Large NUMA Nodes.

[BibT_eX]

[DOI]

Proceedings of the OpenMP in a Heterogeneous World - 8th International Workshop on OpenMP, 2012

Hierarchical Local Storage: Exploiting Flexible User-Data Sharing Between MPI Tasks.

[BibT_eX]

[DOI]

Marc Tchiboukdjian

Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012

2011

Thread-Local Storage Extension to Support Thread-Based MPI/OpenMP Applications.

[BibT_eX]

[DOI]

Hervé Jourdren

Proceedings of the OpenMP in the Petascale Era - 7th International Workshop on OpenMP, 2011

2010

Enabling Low-Overhead Hybrid MPI/OpenMP Parallelism with MPC.

[BibT_eX]

[DOI]

Hervé Jourdren

Proceedings of the Beyond Loop Level Parallelism in OpenMP: Accelerators, 2010

User level DB: a debugging API for user-level thread libraries.

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

2009

MPC-MPI: An MPI Implementation Reducing the Overall Memory Consumption.

[BibT_eX]

[DOI]

Hervé Jourdren

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2009

2008

Scheduling strategies for optimistic parallel execution of irregular programs.

[BibT_eX]

[DOI]

Proceedings of the SPAA 2008: Proceedings of the 20th Annual ACM Symposium on Parallelism in Algorithms and Architectures, 2008

2007

Deep Jam: Conversion of Coarse-Grain Parallelism to Fine-Grain and Vector Parallelism.

[BibT_eX]

[DOI]

J. Instr. Level Parallelism, 2007

Loop Optimization using Hierarchical Compilation and Kernel Decomposition.

[BibT_eX]

[DOI]

Proceedings of the Fifth International Symposium on Code Generation and Optimization (CGO 2007), 2007

2005

Collisions of SHA-0 and Reduced SHA-1.

[BibT_eX]

[DOI]

Proceedings of the Advances in Cryptology, 2005

Deep Jam: Conversion of Coarse-Grain Parallelism to Instruction-Level and Vector Parallelism for Irregular Applications.

[BibT_eX]

[DOI]

Albert Cohen

William Jalby

Proceedings of the 14th International Conference on Parallel Architectures and Compilation Techniques (PACT 2005), 2005

2004

Branch Strategies to Optimize Decision Trees for Wide-Issue Architectures.

[BibT_eX]

[DOI]

Christophe Lemuet

Jean-Thomas Acquaviva

Albert Cohen

William Jalby

Proceedings of the Languages and Compilers for High Performance Computing, 2004

Applications of storage mapping optimization to register promotion.

[BibT_eX]

[DOI]