George Bosilca

According to our database1, George Bosilca
  • authored at least 111 papers between 2001 and 2017.
  • has a "Dijkstra number"2 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Homepages:

On csauthors.net:

Bibliography

2017
Dynamic task discovery in PaRSEC: a data-flow task-based runtime.
Proceedings of the 8th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, 2017

Using software-based performance counters to expose low-level open MPI performance information.
Proceedings of the 24th European MPI Users' Group Meeting, 2017

Efficient Communications in Training Large Scale Neural Networks.
Proceedings of the on Thematic Workshops of ACM Multimedia 2017, Mountain View, CA, USA, October 23, 2017

Online Dynamic Monitoring of MPI Communications.
Proceedings of the Euro-Par 2017: Parallel Processing - 23rd International Conference on Parallel and Distributed Computing, Santiago de Compostela, Spain, August 28, 2017

2016
Assessing the cost of redistribution followed by a computational kernel: Complexity and performance results.
Parallel Computing, 2016

Failure detection and propagation in HPC systems.
Proceedings of the International Conference for High Performance Computing, 2016

Surviving Errors with OpenSHMEM.
Proceedings of the OpenSHMEM and Related Technologies. Enhancing OpenSHMEM for Hybrid Environments, 2016

GPU-Aware Non-contiguous Data Movement In Open MPI.
Proceedings of the 25th ACM International Symposium on High-Performance Parallel and Distributed Computing, 2016

Exploiting a Parametrized Task Graph Model for the Parallelization of a Sparse Direct Multifrontal Solver.
Proceedings of the Euro-Par 2016: Parallel Processing Workshops, 2016

DSN 2016 Tutorial: Resilience for Scientific Computing: From Theory to Practice.
Proceedings of the 46th Annual IEEE/IFIP International Conference on Dependable Systems and Networks Workshops, 2016

2015
Algorithm-Based Fault Tolerance for Dense Matrix Factorizations, Multiple Failures and Accuracy.
TOPC, 2015

Composing resilience techniques: ABFT, periodic and incremental checkpointing.
IJNC, 2015

Practical scalable consensus for pseudo-synchronous distributed systems.
Proceedings of the International Conference for High Performance Computing, 2015

Sliding Substitution of Failed Nodes.
Proceedings of the 22nd European MPI Users' Group Meeting, 2015

Plan B: Interruption of Ongoing MPI Operations to Support Failure Recovery.
Proceedings of the 22nd European MPI Users' Group Meeting, 2015

Accelerating NWChem Coupled Cluster Through Dataflow-Based Execution.
Proceedings of the Parallel Processing and Applied Mathematics, 2015

From MPI to OpenSHMEM: Porting LAMMPS.
Proceedings of the OpenSHMEM and Related Technologies. Experiences, Implementations, and Technologies, 2015

Hierarchical DAG Scheduling for Hybrid Distributed Systems.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015

Design for a Soft Error Resilient Dynamic Task-Based Runtime.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015


PaRSEC in Practice: Optimizing a Legacy Chemistry Application through Distributed Task-Based Execution.
Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015

2014
An efficient distributed randomized algorithm for solving large dense symmetric indefinite linear systems.
Parallel Computing, 2014

Power profiling of Cholesky and QR factorizations on distributed memory systems.
Computer Science - R&D, 2014

Taking advantage of hybrid systems for sparse direct solvers via task-based runtimes.
CoRR, 2014

Unified model for assessing checkpointing protocols at extreme-scale.
Concurrency and Computation: Practice and Experience, 2014

PTG: an abstraction for unhindered parallelism.
Proceedings of the Fourth International Workshop on Domain-Specific Languages and High-Level Frameworks for High Performance Computing, 2014

Optimizations to enhance sustainability of MPI applications.
Proceedings of the 21st European MPI Users' Group Meeting, 2014

A Multithreaded Communication Substrate for OpenSHMEM.
Proceedings of the 8th International Conference on Partitioned Global Address Space Programming Models, 2014

Taking Advantage of Hybrid Systems for Sparse Direct Solvers via Task-Based Runtimes.
Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014

Assessing the Impact of ABFT and Checkpoint Composite Strategies.
Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014

Task-Based Programming for Seismic Imaging: Preliminary Results.
Proceedings of the 2014 IEEE International Conference on High Performance Computing and Communications, 2014

Assembly Operations for Multicore Architectures Using Task-Based Runtime Systems.
Proceedings of the Euro-Par 2014: Parallel Processing Workshops, 2014

Utilizing dataflow-based execution for coupled cluster methods.
Proceedings of the 2014 IEEE International Conference on Cluster Computing, 2014

2013
Kernel-assisted and topology-aware MPI collective communications on multicore/many-core platforms.
J. Parallel Distrib. Comput., 2013

Post-failure recovery of MPI communication capability: Design and rationale.
IJHPCA, 2013

PaRSEC: Exploiting Heterogeneity to Enhance Scalability.
Computing in Science and Engineering, 2013

Correlated set coordination in fault tolerant message logging protocols for many-core clusters.
Concurrency and Computation: Practice and Experience, 2013

Extending the scope of the Checkpoint-on-Failure protocol for forward recovery in standard MPI.
Concurrency and Computation: Practice and Experience, 2013

An evaluation of User-Level Failure Mitigation support in MPI.
Computing, 2013

CPU-GPU hybrid bidiagonal reduction with soft error resilience.
Proceedings of the Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, 2013

Parallel reduction to hessenberg form with algorithm-based fault tolerance.
Proceedings of the International Conference for High Performance Computing, 2013

Efficient parallelization of batch pattern training algorithm on many-core and cluster architectures.
Proceedings of the IEEE 7th International Conference on Intelligent Data Acquisition and Advanced Computing Systems, 2013

2012
DAGuE: A generic distributed DAG engine for High Performance Computing.
Parallel Computing, 2012



An Evaluation of User-Level Failure Mitigation Support in MPI.
Proceedings of the Recent Advances in the Message Passing Interface, 2012

Algorithm-based fault tolerance for dense matrix factorizations.
Proceedings of the 17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2012

HierKNEM: An Adaptive Framework for Kernel-Assisted and Topology-Aware Collective Communications on Many-core Clusters.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012

Scalable Dense Linear Algebra on Heterogeneous Hardware.
Proceedings of the Transition of HPC Towards Exascale Computing, 2012

From Serial Loops to Parallel Execution on Distributed Systems.
Proceedings of the Euro-Par 2012 Parallel Processing - 18th International Conference, 2012

A Checkpoint-on-Failure Protocol for Algorithm-Based Recovery in Standard MPI.
Proceedings of the Euro-Par 2012 Parallel Processing - 18th International Conference, 2012

2011
Impact of Kernel-Assisted MPI Communication over Scientific Applications: CPMD and FFTW.
Proceedings of the Recent Advances in the Message Passing Interface, 2011

OMPIO: A Modular Software Architecture for MPI I/O.
Proceedings of the Recent Advances in the Message Passing Interface, 2011

Scalable Runtime for MPI: Efficiently Building the Communication Infrastructure.
Proceedings of the Recent Advances in the Message Passing Interface, 2011

Will MPI Remain Relevant?
Proceedings of the Recent Advances in the Message Passing Interface, 2011

DAGuE: A Generic Distributed DAG Engine for High Performance Computing.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

Flexible Development of Dense Linear Algebra Algorithms on Massively Parallel Architectures with DPLASMA.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

Kernel Assisted Collective Intra-node MPI Communication among Multi-Core and Many-Core CPUs.
Proceedings of the International Conference on Parallel Processing, 2011

The Common Communication Interface (CCI).
Proceedings of the IEEE 19th Annual Symposium on High Performance Interconnects, 2011

Correlated Set Coordination in Fault Tolerant Message Logging Protocols.
Proceedings of the Euro-Par 2011 Parallel Processing - 17th International Conference, 2011

Algorithms, Models and Tools for Parallel Computing on Heterogeneous Platforms (HeteroPar 2011).
Proceedings of the Euro-Par 2011: Parallel Processing Workshops - CCPI, CGWS, HeteroPar, HiBB, HPCVirt, HPPC, HPSS, MDGS, ProPer, Resilience, UCHPC, VHPC, Bordeaux, France, August 29, 2011

Process Distance-Aware Adaptive MPI Collective Communications.
Proceedings of the 2011 IEEE International Conference on Cluster Computing (CLUSTER), 2011

On Scalability for MPI Runtime Systems.
Proceedings of the 2011 IEEE International Conference on Cluster Computing (CLUSTER), 2011

Performance Portability of a GPU Enabled Factorization with the DAGuE Framework.
Proceedings of the 2011 IEEE International Conference on Cluster Computing (CLUSTER), 2011

2010
Improvement of parallelization efficiency of batch pattern BP training algorithm using Open MPI.
Proceedings of the International Conference on Computational Science, 2010

Self-healing network for scalable fault-tolerant runtime environments.
Future Generation Comp. Syst., 2010

Redesigning the message logging model for high performance.
Concurrency and Computation: Practice and Experience, 2010

Locality and Topology Aware Intra-node Communication among Multicore CPUs.
Proceedings of the Recent Advances in the Message Passing Interface, 2010

Dodging the Cost of Unavoidable Memory Copies in Message Logging Protocols.
Proceedings of the Recent Advances in the Message Passing Interface, 2010

2009
Algorithm-based fault tolerance applied to high performance computing.
J. Parallel Distrib. Comput., 2009

Constructing Resiliant Communication Infrastructure for Runtime Environments.
Proceedings of the Parallel Computing: From Multicores and GPU's to Petascale, 2009

Reasons for a pessimistic or optimistic message logging protocol in MPI uncoordinated failure, recovery.
Proceedings of the 2009 IEEE International Conference on Cluster Computing, August 31, 2009

2008
Algorithmic Based Fault Tolerance Applied to High Performance Computing
CoRR, 2008

The Next Frontier.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2008

A Scalable Tools Communications Infrastructure.
Proceedings of the 22nd Annual International Symposium on High Performance Computing Systems and Applications (HPCS 2008), 2008

2007
Recovery Patterns for Iterative Methods in a Parallel Unstable Environment.
SIAM J. Scientific Computing, 2007

Open MPI: a High Performance, Flexible Implementation of MPI Point-to-Point Communications.
Parallel Processing Letters, 2007

MPI collective algorithm selection and quadtree encoding.
Parallel Computing, 2007

Performance analysis of MPI collective operations.
Cluster Computing, 2007

Advanced MPI Programming.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 14th European PVM/MPI User's Group Meeting, Paris, France, September 30, 2007

An Evaluation of Open MPI's Matching Transport Layer on the Cray XT.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 14th European PVM/MPI User's Group Meeting, Paris, France, September 30, 2007

Retrospect: Deterministic Replay of MPI Applications for Interactive Distributed Debugging.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 14th European PVM/MPI User's Group Meeting, Paris, France, September 30, 2007

The X-Scale Challenge.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 14th European PVM/MPI User's Group Meeting, Paris, France, September 30, 2007

Optimal Routing in Binomial Graph Networks.
Proceedings of the Eighth International Conference on Parallel and Distributed Computing, 2007

Self-healing in Binomial Graph Networks.
Proceedings of the On the Move to Meaningful Internet Systems 2007: OTM 2007 Workshops, 2007

Binomial Graph: A Scalable and Fault-Tolerant Logical Network Topology.
Proceedings of the Parallel and Distributed Processing and Applications, 2007

Network Fault Tolerance in Open MPI.
Proceedings of the Euro-Par 2007, 2007

Decision Trees and MPI Collective Algorithm Selection Problem.
Proceedings of the Euro-Par 2007, 2007

Topic 9 Parallel and Distributed Programming.
Proceedings of the Euro-Par 2007, 2007

Reliability Analysis of Self-Healing Network using Discrete-Event Simulation.
Proceedings of the Seventh IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2007), 2007

2006
Self-adapting numerical software (SANS) effort.
IBM Journal of Research and Development, 2006

High Performance RDMA Protocols in HPC.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2006

MPI Collective Algorithm Selection and Quadtree Encoding.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2006

Implementation and Usage of the PERUSE-Interface in Open MPI.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2006

Scalable Fault Tolerant Protocol for Parallel Runtime Environments.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2006

Open MPI: A High-Performance, Heterogeneous MPI.
Proceedings of the 2006 IEEE International Conference on Cluster Computing, 2006

2005
Process Fault Tolerance: Semantics, Design and Applications for High Performance Computing.
IJHPCA, 2005

Hash Functions for Datatype Signatures in MPI.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2005

Advanced Message Passing and Threading Issues.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2005

Scalable Fault Tolerant MPI: Extending the Recovery Algorithm.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2005

Analysis of the Component Architecture Overhead in Open MPI.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2005

Fault tolerant high performance computing by a coding approach.
Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2005

Performance Analysis of MPI Collective Operations.
Proceedings of the 19th International Parallel and Distributed Processing Symposium (IPDPS 2005), 2005

2004
OVM, une machine parallèle virtuelle à exécution dans le désordre.
Technique et Science Informatiques, 2004

TEG: A High-Performance, Scalable, Multi-network Point-to-Point Communications Methodology.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2004

Open MPI's TEG Point-to-Point Communications Methodology: Comparison to Existing Implementations.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2004

Open MPI: Goals, Concept, and Design of a Next Generation MPI Implementation.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2004

2002
OVM: Out-of-order execution parallel virtual machine.
Future Generation Comp. Syst., 2002

MPICH-V: toward a scalable fault tolerant MPI for volatile nodes.
Proceedings of the 2002 ACM/IEEE conference on Supercomputing, 2002

MPICH-CM: A Communication Library Design for a P2P MPI Implementation.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 9th European PVM/MPI Users' Group Meeting, Linz, Austria, September 29, 2002

2001
OVM: Out-of-Order Execution Parallel Virtual Machine.
Proceedings of the First IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2001), 2001


  Loading...