José Ignacio Aliaga

Orcid: 0000-0001-8469-764X

According to our database1, José Ignacio Aliaga authored at least 66 papers between 1996 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
General framework for re-assuring numerical reliability in parallel Krylov solvers: A case of bi-conjugate gradient stabilized methods.
Int. J. High Perform. Comput. Appl., January, 2024

2023
Compressed basis GMRES on high-performance graphics processing units.
Int. J. High Perform. Comput. Appl., March, 2023

General framework for re-assuring numerical reliability in parallel Krylov solvers: A case of BiCGStab methods.
CoRR, 2023

Sparse matrix-vector and matrix-multivector products for the truncated SVD on graphics processors.
Concurr. Comput. Pract. Exp., 2023

Efficient data redistribution for malleable applications.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

Configurable synthetic application for studying malleability in HPC.
Proceedings of the 31st Euromicro International Conference on Parallel, 2023

2022
Compression and load balancing for efficient sparse matrix-vector product on multicore processors and graphics processing units.
Concurr. Comput. Pract. Exp., 2022

General Framework for Deriving Reproducible Krylov Subspace Algorithms: BiCGStab Case.
Proceedings of the Parallel Processing and Applied Mathematics, 2022

2021
Malleability Implementation in a MPI Iterative Method.
Proceedings of the IEEE International Conference on Cluster Computing, 2021

2020
Iteration-fusing conjugate gradient for sparse linear systems with MPI + OmpSs.
J. Supercomput., 2020

Reproducibility strategies for parallel Preconditioned Conjugate Gradient.
J. Comput. Appl. Math., 2020

Reproducibility of parallel preconditioned conjugate gradient in hybrid programming environments.
Int. J. High Perform. Comput. Appl., 2020

Compressed Basis GMRES on High Performance GPUs.
CoRR, 2020

Reproducibility of Parallel Preconditioned Conjugate Gradient in Hybrid Programming Environments.
CoRR, 2020

Balanced and Compressed Coordinate Layout for the Sparse Matrix-Vector Product on GPUs.
Proceedings of the Euro-Par 2020: Parallel Processing Workshops, 2020

2019
An efficient GPU version of the preconditioned GMRES method.
J. Supercomput., 2019

Accelerating the task/data-parallel version of ILUPACK's BiCG in multi-CPU/GPU configurations.
Parallel Comput., 2019

Erratum to "Exploiting nested task-parallelism in theH-LU factorization" [J. Comput. Sci. 33 (2019) 20-33].
J. Comput. Sci., 2019

Exploiting nested task-parallelism in the H-LU factorization.
J. Comput. Sci., 2019

Energy-aware strategies for task-parallel sparse linear system solvers.
Concurr. Comput. Pract. Exp., 2019

2018
Extending ILUPACK with a Task-Parallel Version of BiCG for Dual-GPU Servers.
Proceedings of the 9th International Workshop on Programming Models and Applications for Multicores and Manycores, 2018

Extending ILUPACK with a GPU Version of the BiCGStab Method.
Proceedings of the XLIV Latin American Computer Conference, 2018

2017
Adapting concurrency throttling and voltage-frequency scaling for dense eigensolvers.
J. Supercomput., 2017

Communication in task-parallel ILU-preconditioned CG solvers using MPI + OmpSs.
Concurr. Comput. Pract. Exp., 2017

Overcoming Memory-Capacity Constraints in the Use of ILUPACK on Graphics Processors.
Proceedings of the 29th International Symposium on Computer Architecture and High Performance Computing, 2017

SYCL-BLAS: Combining Expression Trees and Kernel Fusion on Heterogeneous Systems.
Proceedings of the Parallel Computing is Everywhere, 2017

SYCL-BLAS: Leveraging Expression Trees for Linear Algebra.
Proceedings of the 5th International Workshop on OpenCL, 2017

Task-Parallel LU Factorization of Hierarchical Matrices Using OmpSs.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

Evaluating the NVIDIA Tegra Processor as a Low-Power Alternative for Sparse GPU Computations.
Proceedings of the High Performance Computing - 4th Latin American Conference, 2017

2016
Exploiting task and data parallelism in ILUPACK's preconditioned CG solver on NUMA architectures and many-core accelerators.
Parallel Comput., 2016

A fast band-Krylov eigensolver for macromolecular functional motion simulation on multicore architectures and graphics processors.
J. Comput. Phys., 2016

Characterizing the efficiency of multicore and manycore processors for the solution of sparse linear systems.
Comput. Sci. Res. Dev., 2016

A Data-Parallel ILUPACK for Sparse General and Symmetric Indefinite Linear Systems.
Proceedings of the Euro-Par 2016: Parallel Processing Workshops, 2016

Exploiting Task-Parallelism in Message-Passing Sparse Linear System Solvers Using OmpSs.
Proceedings of the Euro-Par 2016: Parallel Processing, 2016

Design of a Task-Parallel Version of ILUPACK for Graphics Processors.
Proceedings of the High Performance Computing - Third Latin American Conference, 2016

2015
Are our dense linear algebra libraries energy-friendly?
Comput. Sci. Res. Dev., 2015

Out-of-core macromolecular simulations on multithreaded architectures.
Concurr. Comput. Pract. Exp., 2015

Unveiling the performance-energy trade-off in iterative linear system solvers for multithreaded processors.
Concurr. Comput. Pract. Exp., 2015

Harnessing CUDA Dynamic Parallelism for the Solution of Sparse Linear Systems.
Proceedings of the Parallel Computing: On the Road to Exascale, 2015

Performance and Fault Tolerance of Preconditioned Iterative Solvers on Low-Power ARM Architectures.
Proceedings of the Parallel Computing: On the Road to Exascale, 2015

Systematic Fusion of CUDA Kernels for Iterative Sparse Linear System Solvers.
Proceedings of the Euro-Par 2015: Parallel Processing, 2015

2014
iMODS: internal coordinates normal mode analysis server.
Nucleic Acids Res., 2014

Assessing the impact of the CPU power-saving modes on the task-parallel solution of sparse linear systems.
Clust. Comput., 2014

Leveraging Task-Parallelism with OmpSs in ILUPACK's Preconditioned CG Method.
Proceedings of the 26th IEEE International Symposium on Computer Architecture and High Performance Computing, 2014

Leveraging Data-Parallelism in ILUPACK using Graphics Processors.
Proceedings of the IEEE 13th International Symposium on Parallel and Distributed Computing, 2014

2013
Exploring large macromolecular functional motions on clusters of multicore processors.
J. Comput. Phys., 2013

Out-of-Core Solution of Eigenproblems for Macromolecular Simulations.
Proceedings of the Parallel Processing and Applied Mathematics, 2013

Performance and Energy Analysis of the Iterative Solution of Sparse Linear Systems on Multicore and Manycore Architectures.
Proceedings of the Parallel Processing and Applied Mathematics, 2013

Reformulated Conjugate Gradient for the Energy-Aware Solution of Linear Systems on GPUs.
Proceedings of the 42nd International Conference on Parallel Processing, 2013

2012
Solving dense generalized eigenproblems on multi-threaded architectures.
Appl. Math. Comput., 2012

Leveraging Task-Parallelism in Energy-Efficient ILU Preconditioners.
Proceedings of the ICT as Key Technology against Global Warming, 2012

2011
ILUPACK.
Proceedings of the Encyclopedia of Parallel Computing, 2011

Exploiting thread-level parallelism in the iterative solution of sparse linear systems.
Parallel Comput., 2011

Analysis and optimization of power consumption in the iterative solution of sparse linear systems on multi-core and many-core platforms.
Proceedings of the 2011 International Green Computing Conference and Workshops, 2011

2010
Parallelization of Multilevel ILU Preconditioners on Distributed-Memory Multiprocessors.
Proceedings of the Applied Parallel and Scientific Computing, 2010

2009
Toward the parallelization of GSL.
J. Supercomput., 2009

Evaluation of Parallel Sparse Matrix Partitioning Software for Parallel Multilevel ILU Preconditioning on Shared-Memory Multiprocessors.
Proceedings of the Parallel Computing: From Multicores and GPU's to Petascale, 2009

2008
Design, Tuning and Evaluation of Parallel Multilevel ILU Preconditioners.
Proceedings of the High Performance Computing for Computational Science, 2008

2007
Parallelization of Multilevel Preconditioners Constructed from Inverse-Based ILUs on Shared-Memory Multiprocessors.
Proceedings of the Parallel Computing: Architectures, 2007

2006
Parallelization of GSL: The Web Service Interface.
Proceedings of the 14th Euromicro International Conference on Parallel, 2006

2005
Parallelization of GSL on Clusters of Symmetric Multiprocessors.
Proceedings of the Parallel Computing: Current & Future Issues of High-End Computing, 2005

2004
Parallelization of GSL: Architecture, Interfaces, and Programming Models.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2004


Parallelization of the GNU Scientific Library on Heterogeneous Systems.
Proceedings of the 3rd International Symposium on Parallel and Distributed Computing (ISPDC 2004), 2004

2000
A Lanczos-type method for multiple starting vectors.
Math. Comput., 2000

1996
A Parallel Implementation of the General Lanczos Method on the CRAY T3D.
Proceedings of the Vector and Parallel Processing, 1996


  Loading...