Raymond Namyst

Orcid: 0000-0001-7734-1258

According to our database1, Raymond Namyst authored at least 81 papers between 1994 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Programming heterogeneous architectures using hierarchical tasks.
Concurr. Comput. Pract. Exp., 2023

2022
Towards EXtreme scale technologies and accelerators for euROhpc hw/Sw supercomputing applications for exascale: The TEXTAROSSA approach.
Microprocess. Microsystems, November, 2022

SimSGamE : Scheduling simulator for modern game engines.
J. Open Source Softw., 2022

Peachy Parallel Assignments (EduPar 2022).
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2022

Exploring Scheduling Algorithms for Parallel Task Graphs: A Modern Game Engine Case Study.
Proceedings of the Euro-Par 2022: Parallel Processing, 2022

2021
EasyPAP: A framework for learning parallel programming.
J. Parallel Distributed Comput., 2021

SPAWN: An Iterative, Potentials-Based, Dynamic Scheduling and Partitioning Tool.
Int. J. Parallel Program., 2021


2020
AMR-based molecular dynamics for non-uniform, highly dynamic particle simulations.
Comput. Phys. Commun., 2020

2019
Resource aggregation for task-based Cholesky Factorization on top of modern architectures.
Parallel Comput., 2019

2018
EXA2PRO programming environment: architecture and applications.
Proceedings of the 18th International Conference on Embedded Computer Systems: Architectures, 2018

Combining Task-based Parallelism and Adaptive Mesh Refinement Techniques in Molecular Dynamics Simulations.
Proceedings of the 47th International Conference on Parallel Processing, 2018

2017
Resource-Management Study in HPC Runtime-Stacking Context.
Proceedings of the 29th International Symposium on Computer Architecture and High Performance Computing, 2017

2016
Resource Aggregation for Task-Based Cholesky Factorization on Top of Heterogeneous Machines.
Proceedings of the Euro-Par 2016: Parallel Processing Workshops, 2016

2015
Automatic OpenCL Code Generation for Multi-device Heterogeneous Architectures.
Proceedings of the 44th International Conference on Parallel Processing, 2015

2014
Composing multiple StarPU applications over heterogeneous machines: A supervised approach.
Int. J. High Perform. Comput. Appl., 2014

A Runtime Approach to Dynamic Resource Allocation for Sparse Direct Solvers.
Proceedings of the 43rd International Conference on Parallel Processing, 2014

Dynamic Load Balancing with Pair Potentials.
Proceedings of the Euro-Par 2014: Parallel Processing Workshops, 2014

Toward OpenCL Automatic Multi-Device Support.
Proceedings of the Euro-Par 2014 Parallel Processing, 2014

ExaStamp: A Parallel Framework for Molecular Dynamics on Heterogeneous Clusters.
Proceedings of the Euro-Par 2014: Parallel Processing Workshops, 2014

2013
Adaptive Task Size Control on High Level Programming for GPU/CPU Work Sharing.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2013

High Performance Code Generation for Stencil Computation on Heterogeneous Multi-device Architectures.
Proceedings of the 10th IEEE International Conference on High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing, 2013

2012
Poster: Leveraging PEPPHER Technology for Performance Portable Supercomputing.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Abstract: Leveraging PEPPHER Technology for Performance Portable Supercomputing.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012



StarPU-MPI: Task Programming over Clusters of Machines Enhanced with Accelerators.
Proceedings of the Recent Advances in the Message Passing Interface, 2012

High-Level Support for Pipeline Parallelism on Many-Core Architectures.
Proceedings of the Euro-Par 2012 Parallel Processing - 18th International Conference, 2012

Programmability and performance portability aspects of heterogeneous multi-/manycore systems.
Proceedings of the 2012 Design, Automation & Test in Europe Conference & Exhibition, 2012

2011
StarPU: a unified platform for task scheduling on heterogeneous multicore architectures.
Concurr. Comput. Pract. Exp., 2011

The PEPPHER Approach to Programmability and Performance Portability for Heterogeneous many-core Architectures.
Proceedings of the Applications, Tools and Techniques on the Road to Exascale Computing, Proceedings of the conference ParCo 2011, 31 August, 2011

A Sampling-Based Approach for Communication Libraries Auto-Tuning.
Proceedings of the 2011 IEEE International Conference on Cluster Computing (CLUSTER), 2011

EZTrace: A Generic Framework for Performance Analysis.
Proceedings of the 11th IEEE/ACM International Symposium on Cluster, 2011

2010
ForestGOMP: An Efficient OpenMP Environment for NUMA Architectures.
Int. J. Parallel Program., 2010

Adaptive MPI Multirail Tuning for Non-uniform Input/Output Access.
Proceedings of the Recent Advances in the Message Passing Interface, 2010

hwloc: A Generic Framework for Managing Hardware Affinities in HPC Applications.
Proceedings of the 18th Euromicro Conference on Parallel, 2010

Optimizing MPI communication within large multicore nodes with kernel assistance.
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Structuring the execution of OpenMP applications for multicore architectures.
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Data-Aware Task Scheduling on Multi-accelerator Based Platforms.
Proceedings of the 16th IEEE International Conference on Parallel and Distributed Systems, 2010

2009
Exploiting the Cell/BE Architecture with the StarPU Unified Runtime System.
Proceedings of the Embedded Computer Systems: Architectures, 2009

Dynamic Task and Data Placement over NUMA Architectures: An OpenMP Runtime Perspective.
Proceedings of the Evolving OpenMP in an Age of Extreme Parallelism, 2009

Automatic Calibration of Performance Models on Heterogeneous Multicore Architectures.
Proceedings of the Euro-Par 2009, 2009

2008
BubbleSched, plate-forme de conception d'ordonnanceurs de threads sur machines hiérarchiques.
Tech. Sci. Informatiques, 2008

NewMadeleine, ordonnancement et optimisation de schémas de communication haute performance.
Tech. Sci. Informatiques, 2008

Scheduling Dynamic OpenMP Applications over Multicore Architectures.
Proceedings of the OpenMP in a New Era of Parallelism, 4th International Workshop, 2008

A multithreaded communication engine for multicore architectures.
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

MPC: A Unified Parallel Runtime for Clusters of NUMA Machines.
Proceedings of the Euro-Par 2008, 2008

Efficient Shared Memory Message Passing for Inter-VM Communications.
Proceedings of the Euro-Par 2008 Workshops, 2008

A Unified Runtime System for Heterogeneous Multi-core Architectures.
Proceedings of the Euro-Par 2008 Workshops, 2008

2007
An Efficient OpenMP Runtime System for Hierarchical Arch
CoRR, 2007

Improving Reactivity and Communication Overlap in MPI Using a Generic I/O Manager.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 14th European PVM/MPI User's Group Meeting, Paris, France, September 30, 2007

An Efficient OpenMP Runtime System for Hierarchical Architectures.
Proceedings of the A Practical Programming Model for the Multi-Core Era, 2007

High-Performance Multi-Rail Support with the NEWMADELEINE Communication Library.
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

NEW MADELEINE: a Fast Communication Scheduling Engine for High Performance Networks.
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

Building Portable Thread Schedulers for Hierarchical Multiprocessors: The BubbleSched Framework.
Proceedings of the Euro-Par 2007, 2007

2006
Grid'5000: A Large Scale And Highly Reconfigurable Experimental Grid Testbed.
Int. J. High Perform. Comput. Appl., 2006

Short Paper : Dynamic Optimization of Communications over High Speed Networks.
Proceedings of the 15th IEEE International Symposium on High Performance Distributed Computing, 2006

2005
Grid'5000: a large scale and highly reconfigurable grid experimental testbed.
Proceedings of the 6th IEEE/ACM International Conference on Grid Computing (GRID 2005), 2005

An Efficient Multi-level Trace Toolkit for Multi-threaded Applications.
Proceedings of the Euro-Par 2005, Parallel Processing, 11th International Euro-Par Conference, Lisbon, Portugal, August 30, 2005

2003
Controlling Kernel Scheduling from User Space: An Approach to Enhancing Applications' Reactivity to I/O Events.
Proceedings of the High Performance Computing - HiPC 2003, 10th International Conference, 2003

2002
Madeleine II: a portable and efficient communication library for high-performance cluster computing.
Parallel Comput., 2002

High Performance Computing on Heterogeneous Clusters with the Madeleine II Communication Library.
Clust. Comput., 2002

Improving Reactivity to I/O Events in Multithreaded Environments Using a Uniform, Scheduler-Centric API.
Proceedings of the Euro-Par 2002, 2002

2001
The Hyperion system: Compiling multithreaded Java bytecode for distributed execution.
Parallel Comput., 2001

MPICH/Madeleine: a True Multi-Protocol MPI for High Performance Networks.
Proceedings of the 15th International Parallel & Distributed Processing Symposium (IPDPS-01), 2001

Efficient Inter-Device Data-Forwarding in the Madeleine Communication Library.
Proceedings of the 15th International Parallel & Distributed Processing Symposium (IPDPS-01), 2001

2000
Compiling Data-Parallel Programs to a Distributed Runtime Environment with Thread Isomigration.
Parallel Process. Lett., 2000

Using the VI Architecture to Build Distributed, Multithreaded Runtime Systems: A Case Study.
Proceedings of the Applied Computing 2000, 2000

Integrating Kernel Activations in a Multithreaded Runtime System on Top of LINUX.
Proceedings of the Parallel and Distributed Processing, 2000

A Portable and Adaptative Multi-protocol Communication Library for Multithreaded Runtime Systems.
Proceedings of the Parallel and Distributed Processing, 2000

Implementing Java Consistency Using a Generic, Multithreaded DSM Runtime System.
Proceedings of the Parallel and Distributed Processing, 2000

Compiling Multithreaded Java Bytecode for Distributed Execution (Distinguished Paper).
Proceedings of the Euro-Par 2000, Parallel Processing, 6th International Euro-Par Conference, Munich, Germany, August 29, 2000

Madeleine II: a Portable and Efficient Communication Library for High-Performance Cluster Computing.
Proceedings of the 2000 IEEE International Conference on Cluster Computing (CLUSTER 2000), November 28th, 2000

1999
Technology transfer within the ProHPC TTN at ENS Lyon.
Future Gener. Comput. Syst., 1999

Efficient Communications in Multithreaded Runtime Systems.
Proceedings of the Parallel and Distributed Processing, 1999

An Efficient and Transparent Thread Migration Scheme in the PM2 Runtime System.
Proceedings of the Parallel and Distributed Processing, 1999

1998
Madeleine: An Efficient and Portable Communication Interface for RPC-Based Multithreaded Environments.
Proceedings of the 1998 International Conference on Parallel Architectures and Compilation Techniques, 1998

A Multithreaded Runtime Environment with Thread Migration for a HPF Data-Parallel Compiler.
Proceedings of the 1998 International Conference on Parallel Architectures and Compilation Techniques, 1998

1997
Architecture Virtualization with Mobile Threads.
Proceedings of the Parallel Computing: Fundamentals, 1997

1995
PM2: Parallel Multithreaded Machine. A Computing Environment for Distributed Architectures.
Proceedings of the Parallel Computing: State-of-the-Art and Perspectives, 1995

1994
Object Spaces, Cooperation Spaces and Groups.
Proceedings of the 6th ACM SIGOPS European Workshop: Matching Operating Systems to Application Needs, 1994


  Loading...