We stand with Ukraine

We stand with Ukraine

Julien Langou

Orcid: 0000-0002-7803-1822

Affiliations:

University of Colorado Denver

According to our database¹, Julien Langou authored at least 72 papers between 2003 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

Online presence:

On csauthors.net:

Bibliography

2025

2024 NSF CSSI-Cybertraining-SCIPE PI Meeting August 12 to 13, 2024, Charlotte, NC.

[DOI]

CoRR, July, 2025

2024

A new deflation criterion for the QZ algorithm.

[DOI]

,

,

Numer. Linear Algebra Appl., January, 2024

Communication efficient application of sequences of planar rotations to a matrix.

[DOI]

,

CoRR, 2024

Probabilistic Analysis of Least Squares, Orthogonal Projection, and QR Factorization Algorithms Subject to Gaussian Noise.

[DOI]

,

,

Mohammad Meysami

CoRR, 2024

Tightening I/O Lower Bounds through the Hourglass Dependency Pattern.

[DOI]

Lionel Eyraud-Dubois

,

Guillaume Iooss

,

,

Fabrice Rastello

Proceedings of the 36th ACM Symposium on Parallelism in Algorithms and Architectures, 2024

2022

Low-synch Gram-Schmidt with delayed reorthogonalization for Krylov solvers.

[DOI]

,

,

Stephen J. Thomas

,

Kasia Swirydowicz

,

Ichitaro Yamazaki

,

Parallel Comput., 2022

Numerical analysis of Givens rotation.

[DOI]

Weslley da Silva Pereira

,

,

CoRR, 2022

I/O-Optimal Algorithms for Symmetric Linear Algebra Kernels.

[DOI]

Olivier Beaumont

,

Lionel Eyraud-Dubois

,

,

Mathieu Vérité

Proceedings of the SPAA '22: 34th ACM Symposium on Parallelism in Algorithms and Architectures, Philadelphia, PA, USA, July 11, 2022

Symmetric Block-Cyclic Distribution: Fewer Communications Leads to Faster Dense Cholesky Factorization.

[DOI]

Olivier Beaumont

,

Philippe Duchon

,

Lionel Eyraud-Dubois

,

,

Mathieu Vérité

Proceedings of the SC22: International Conference for High Performance Computing, 2022

Proposed Consistent Exception Handling for the BLAS and LAPACK.

[DOI]

,

Jack J. Dongarra

,

,

,

,

,

,

Weslley S. Pereira

,

,

Cindy Rubio-González

Proceedings of the Sixth IEEE/ACM International Workshop on Software Correctness for HPC Applications, 2022

2021

Low synchronization Gram-Schmidt and generalized minimal residual algorithms.

[DOI]

Katarzyna Swirydowicz

,

,

Shreyas Ananthan

,

Ulrike Meier Yang

,

Stephen J. Thomas

Numer. Linear Algebra Appl., 2021

2020

Automated derivation of parametric data movement lower bounds for affine programs.

[DOI]

,

,

Louis-Noël Pouchet

,

,

Fabrice Rastello

Proceedings of the 41st ACM SIGPLAN International Conference on Programming Language Design and Implementation, 2020

A Comparison of Several Fault-Tolerance Methods for the Detection and Correction of Floating-Point Errors in Matrix-Matrix Multiplication.

[DOI]

Valentin Le Fèvre

,

Thomas Hérault

,

,

Proceedings of the Euro-Par 2020: Parallel Processing Workshops, 2020

A Makespan Lower Bound for the Tiled Cholesky Factorization Based on ALAP Schedule.

[DOI]

Olivier Beaumont

,

,

,

Proceedings of the Euro-Par 2020: Parallel Processing, 2020

2018

Low synchronization GMRES algorithms.

[DOI]

Kasia Swirydowicz

,

,

Shreyas Ananthan

,

Ulrike Meier Yang

,

Stephen J. Thomas

CoRR, 2018

2017

Bidiagonalization and R-Bidiagonalization: Parallel Tiled Algorithms, Critical Paths and Distributed-Memory Implementation.

[DOI]

Mathieu Faverge

,

,

,

Jack J. Dongarra

Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017

Fast Parallel Randomized QR with Column Pivoting Algorithms for Reliable Low-Rank Matrix Approximations.

[DOI]

,

,

Proceedings of the 24th IEEE International Conference on High Performance Computing, 2017

2016

A backward/forward recovery approach for the preconditioned conjugate gradient method.

[DOI]

Massimiliano Fasi

,

,

,

J. Comput. Sci., 2016

Bidiagonalization with Parallel Tiled Algorithms.

[DOI]

Mathieu Faverge

,

,

,

Jack J. Dongarra

CoRR, 2016

2015

Mixing LU and QR factorization algorithms to design high-performance dense linear algebra solvers.

[DOI]

Mathieu Faverge

,

Julien Herrmann

,

,

Bradley R. Lowery

,

,

Jack J. Dongarra

J. Parallel Distributed Comput., 2015

A Makespan Lower Bound for the Scheduling of the Tiled Cholesky Factorization based on ALAP scheduling.

[DOI]

,

CoRR, 2015

2014

Designing LU-QR Hybrid Solvers for Performance and Stability.

[DOI]

Mathieu Faverge

,

Julien Herrmann

,

,

Bradley R. Lowery

,

,

Jack J. Dongarra

Proceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium, 2014

2013

Level-3 Cholesky Factorization Routines Improve Performance of Many Cholesky Algorithms.

[DOI]

Fred G. Gustavson

,

Jerzy Wasniewski

,

Jack J. Dongarra

,

José R. Herrero

,

ACM Trans. Math. Softw., 2013

Hierarchical QR factorization algorithms for multi-core clusters.

[DOI]

Jack J. Dongarra

,

Mathieu Faverge

,

Thomas Hérault

,

Mathias Jacquelin

,

,

Parallel Comput., 2013

A Greedy Algorithm for Optimally Pipelining a Reduction.

[DOI]

Bradley R. Lowery

,

CoRR, 2013

Topic 10: Parallel Numerical Algorithms - (Introduction).

[DOI]

,

Matthias Bolten

,

,

Marián Vajtersic

Proceedings of the Euro-Par 2013 Parallel Processing, 2013

2012

Communication-optimal Parallel and Sequential QR and LU Factorizations.

[DOI]

,

,

,

SIAM J. Sci. Comput., 2012

Flexible Variants of Block Restarted GMRES Methods with Application to Geophysics.

[DOI]

,

,

,

,

SIAM J. Sci. Comput., 2012

Poster: Matrices over Runtime Systems at Exascale.

[DOI]

Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Abstract: Matrices Over Runtime Systems at Exascale.

[DOI]

Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Hierarchical QR Factorization Algorithms for Multi-core Cluster Systems.

[DOI]

Jack J. Dongarra

,

Mathieu Faverge

,

Thomas Hérault

,

,

Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012

2011

Any admissible cycle-convergence behavior is possible for restarted GMRES at its initial cycles.

[DOI]

Eugene Vecharynski

,

Numer. Linear Algebra Appl., 2011

QCG-OMPI: MPI applications on grids.

[DOI]

Emmanuel Agullo

,

,

Thomas Hérault

,

,

Sylvain Peyronnet

,

,

Franck Cappello

,

Jack J. Dongarra

Future Gener. Comput. Syst., 2011

Tiled QR factorization algorithms.

[DOI]

Henricus Bouwmeester

,

Mathias Jacquelin

,

,

Proceedings of the Conference on High Performance Computing Networking, 2011

Flexible Development of Dense Linear Algebra Algorithms on Massively Parallel Architectures with DPLASMA.

[DOI]

,

Aurélien Bouteiller

,

Anthony Danalis

,

Mathieu Faverge

,

,

Thomas Hérault

,

,

,

Pierre Lemarinier

,

,

,

,

Jack J. Dongarra

Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

LU factorization for accelerator-based systems.

[DOI]

Emmanuel Agullo

,

Cédric Augonnet

,

Jack J. Dongarra

,

Mathieu Faverge

,

,

,

Stanimire Tomov

Proceedings of the 9th IEEE/ACS International Conference on Computer Systems and Applications, 2011

2010

Rectangular full packed format for cholesky's algorithm: factorization, solution, and inversion.

[DOI]

Fred G. Gustavson

,

Jerzy Wasniewski

,

Jack J. Dongarra

,

ACM Trans. Math. Softw., 2010

The Cycle-Convergence of Restarted GMRES for Normal Matrices Is Sublinear.

[DOI]

Eugene Vecharynski

,

SIAM J. Sci. Comput., 2010

A Critical Path Approach to Analyzing Parallelism of Algorithmic Variants. Application to Cholesky Inversion

[DOI]

Henricus Bouwmeester

,

CoRR, 2010

Towards an Efficient Tile Matrix Inversion of Symmetric Positive Definite Matrices on Multicore Architectures.

[DOI]

Emmanuel Agullo

,

Henricus Bouwmeester

,

Jack J. Dongarra

,

,

,

Proceedings of the High Performance Computing for Computational Science - VECPAR 2010, 2010

QR factorization of tall and skinny matrices in a grid computing environment.

[DOI]

Emmanuel Agullo

,

,

Jack J. Dongarra

,

Thomas Hérault

,

Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

2009

A class of parallel tiled linear algebra algorithms for multicore architectures.

[DOI]

Alfredo Buttari

,

,

,

Jack J. Dongarra

Parallel Comput., 2009

Computing the conditioning of the components of a linear least-squares solution.

[DOI]

,

Jack J. Dongarra

,

,

Numer. Linear Algebra Appl., 2009

Algorithm-based fault tolerance applied to high performance computing.

[DOI]

,

,

Jack J. Dongarra

,

J. Parallel Distributed Comput., 2009

The Problem With the Linpack Benchmark 1.0 Matrix Generator.

[DOI]

Jack J. Dongarra

,

Int. J. High Perform. Comput. Appl., 2009

Accelerating scientific computations with mixed precision algorithms.

[DOI]

,

Alfredo Buttari

,

Jack J. Dongarra

,

,

,

,

,

Stanimire Tomov

Comput. Phys. Commun., 2009

2008

Algorithmic Based Fault Tolerance Applied to High Performance Computing

[DOI]

,

,

Jack J. Dongarra

,

CoRR, 2008

Communication-avoiding parallel and sequential QR factorizations

[DOI]

,

,

,

CoRR, 2008

2007

Prospectus for a Dense Linear Algebra Software Library.

[DOI]

,

,

,

,

,

Alfredo Buttari

,

Stanimire Tomov

,

,

,

,

Christof Vömel

,

,

,

Jack J. Dongarra

,

,

Beresford N. Parlett

,

Proceedings of the Handbook of Parallel Computing - Models, Algorithms and Applications., 2007

Recovery Patterns for Iterative Methods in a Parallel Unstable Environment.

[DOI]

,

,

,

Jack J. Dongarra

SIAM J. Sci. Comput., 2007

Convergence in Backward Error of Relaxed GMRES.

[DOI]

,

,

SIAM J. Sci. Comput., 2007

Performance Optimization and Modeling of Blocked Sparse Kernels.

[DOI]

Alfredo Buttari

,

Victor Eijkhout

,

,

Salvatore Filippone

Int. J. High Perform. Comput. Appl., 2007

Mixed Precision Iterative Refinement Techniques for the Solution of Dense Linear Systems.

[DOI]

Alfredo Buttari

,

Jack J. Dongarra

,

,

,

,

Int. J. High Perform. Comput. Appl., 2007

A distributed packed storage for large dense parallel in-core calculations.

[DOI]

,

,

,

Concurr. Comput. Pract. Exp., 2007

Advanced MPI Programming.

[DOI]

,

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 14th European PVM/MPI User's Group Meeting, Paris, France, September 30, 2007

Parallel Tiled QR Factorization for Multicore Architectures.

[DOI]

Alfredo Buttari

,

,

,

Jack J. Dongarra

Proceedings of the Parallel Processing and Applied Mathematics, 2007

2006

A note on the error analysis of classical Gram-Schmidt.

[DOI]

Alicja Smoktunowicz

,

Jesse L. Barlow

,

Numerische Mathematik, 2006

Conjugate-gradient eigenvalue solvers in computing electronic properties of nanostructure architectures.

[DOI]

Stanimire Tomov

,

,

Jack J. Dongarra

,

,

Int. J. Comput. Sci. Eng., 2006

Self-adapting numerical software (SANS) effort.

[DOI]

Jack J. Dongarra

,

,

,

Victor Eijkhout

,

,

,

,

,

Jelena Pjesivac-Grbovic

,

,

,

Sathish S. Vadhiyar

IBM J. Res. Dev., 2006

Tools and techniques for performance - Exploiting the performance of 32 bit floating point arithmetic in obtaining 64 bit accuracy (revisiting iterative refinement for linear systems).

[DOI]

,

,

,

,

Alfredo Buttari

,

Jack J. Dongarra

Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

Recent Advances in Dense Linear Algebra: Minisymposium Abstract.

[DOI]

Daniel Kressner

,

Proceedings of the Applied Parallel Computing. State of the Art in Scientific Computing, 2006

Prospectus for the Next LAPACK and ScaLAPACK Libraries.

[DOI]

,

Jack J. Dongarra

,

Beresford N. Parlett

,

,

,

,

,

,

,

,

Christof Vömel

,

,

,

,

Alfredo Buttari

,

,

Stanimire Tomov

Proceedings of the Applied Parallel Computing. State of the Art in Scientific Computing, 2006

The Impact of Multicore on Math Software.

[DOI]

Alfredo Buttari

,

Jack J. Dongarra

,

,

,

,

Stanimire Tomov

Proceedings of the Applied Parallel Computing. State of the Art in Scientific Computing, 2006

Exploiting Mixed Precision Floating Point Hardware in Scientific Computations.

Alfredo Buttari

,

Jack J. Dongarra

,

,

,

,

,

Stanimire Tomov

Proceedings of the High Performance Computing and Grids in Action, 2006

Parallel Linear Algebra Software.

[DOI]

Victor Eijkhout

,

,

Jack J. Dongarra

Proceedings of the Parallel Processing for Scientific Computing, 2006

2005

Algorithm 842: A set of GMRES routines for real and complex arithmetics on high performance computers.

[DOI]

Valérie Frayssé

,

,

,

ACM Trans. Math. Softw., 2005

Rounding error analysis of the classical Gram-Schmidt orthogonalization process.

[DOI]

,

,

Miroslav Rozlozník

,

Jasper van den Eshof

Numerische Mathematik, 2005

Hash Functions for Datatype Signatures in MPI.

[DOI]

,

,

,

Jack J. Dongarra

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2005

Fault tolerant high performance computing by a coding approach.

[DOI]

,

,

,

,

,

,

Jack J. Dongarra

Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2005

Comparison of Nonlinear Conjugate-Gradient Methods for Computing the Electronic Properties of Nanostructure Architectures.

[DOI]

Stanimire Tomov

,

,

,

,

Jack J. Dongarra

Proceedings of the Computational Science, 2005

2004

A Rank-<i>k</i> Update Procedure for Reorthogonalizing the Orthogonal Factor from Modified Gram-Schmidt.

[DOI]

,

,

SIAM J. Matrix Anal. Appl., 2004

2003

A Robust Criterion for the Modified Gram-Schmidt Algorithm with Selective Reorthogonalization.

[DOI]

,

SIAM J. Sci. Comput., 2003

Loading...