José R. Herrero

José Monteiro

Comput. Math. Appl., 2020

2019

Look-ahead in the two-sided reduction to compact band forms for symmetric eigenvalue problems and the SVD.

[BibT_eX]

[DOI]

Andrés E. Tomás

Numer. Algorithms, 2019

Resource-aware Elastic Swap Random Forest for Evolving Data Streams.

[BibT_eX]

[DOI]

CoRR, 2019

A Case for Malleable Thread-Level Linear Algebra Libraries: The LU Factorization With Partial Pivoting.

[BibT_eX]

[DOI]

Robert A. van de Geijn

IEEE Access, 2019

2018

Static scheduling of the LU factorization with look-ahead on asymmetric multicore processors.

[BibT_eX]

[DOI]

Parallel Comput., 2018

Energy balance between voltage-frequency scaling and resilience for linear algebra routines on low-power multicore architectures.

[BibT_eX]

[DOI]

Parallel Comput., 2018

Two-sided orthogonal reductions to condensed forms on asymmetric multicore processors.

[BibT_eX]

[DOI]

Pedro Alonso

Parallel Comput., 2018

Multi-threaded dense linear algebra libraries for low-power asymmetric multicore processors.

[BibT_eX]

[DOI]

Francisco D. Igual

Chris Adeniyi-Jones

J. Comput. Sci., 2018

A path-level exact parallelization strategy for sequential simulation.

[BibT_eX]

[DOI]

Comput. Geosci., 2018

2017

Two-Sided Reduction to Compact Band Forms with Look-Ahead.

[BibT_eX]

[DOI]

Andrés E. Tomás

CoRR, 2017

Reduction to Tridiagonal Form for Symmetric Eigenproblems on Asymmetric Multicore Processors.

[BibT_eX]

[DOI]

Pedro Alonso

Proceedings of the 8th International Workshop on Programming Models and Applications for Multicores and Manycores, 2017

Static Versus Dynamic Task Scheduling of the Lu Factorization on ARM big. LITTLE Architectures.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

Low-latency multi-threaded ensemble learning for dynamic big data streams.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Big Data (IEEE BigData 2017), 2017

2016

Echo State Hoeffding Tree Learning.

[BibT_eX]

[DOI]

Proceedings of The 8th Asian Conference on Machine Learning, 2016

2015

Acceleration of the Geostatistical Software Library (GSLIB) by code optimization and hybrid parallel programming.

[BibT_eX]

[DOI]

Oscar Peredo

Julián M. Ortiz

Comput. Geosci., 2015

Multi-Threaded Dense Linear Algebra Libraries for Low-Power Asymmetric Multicore Processors.

[BibT_eX]

[DOI]

Francisco D. Igual

CoRR, 2015

Parallel computing on graphics processing units and heterogeneous platforms.

[BibT_eX]

[DOI]

Paolo Bientinesi

Robert Strzodka

Concurr. Comput. Pract. Exp., 2015

Tareador: a tool to unveil parallelization strategies at undergraduate level.

[BibT_eX]

[DOI]

Eduard Ayguadé

Rosa M. Badia

Daniel Jiménez-González

Proceedings of the Workshop on Computer Architecture Education, 2015

2014

Tuning and hybrid parallelization of a genetic-based multi-point statistics simulation code.

[BibT_eX]

[DOI]

Parallel Comput., 2014

Evaluation and assessment of professional skills in the Final Year Project.

[BibT_eX]

[DOI]

Proceedings of the IEEE Frontiers in Education Conference, 2014

2013

Level-3 Cholesky Factorization Routines Improve Performance of Many Cholesky Algorithms.

[BibT_eX]

[DOI]

ACM Trans. Math. Softw., 2013

Graphics processing unit computing and exploitation of hardware accelerators.

[BibT_eX]

[DOI]

Robert Strzodka

Concurr. Comput. Pract. Exp., 2013

A Square Block Format for Symmetric Band Matrices.

[BibT_eX]

[DOI]

Fred G. Gustavson

Enric Morancho

Proceedings of the Parallel Processing and Applied Mathematics, 2013

2012

On new computational local orders of convergence.

[BibT_eX]

[DOI]

Appl. Math. Lett., 2012

2011

Special Issue: GPU computing.

[BibT_eX]

[DOI]

Robert Strzodka

Concurr. Comput. Pract. Exp., 2011

New Level-3 BLAS Kernels for Cholesky Factorization.

[BibT_eX]

[DOI]

Fred G. Gustavson

Jerzy Wasniewski

Proceedings of the Parallel Processing and Applied Mathematics, 2011

2009

Parallelizing dense and banded linear algebra libraries using SMPSs.

[BibT_eX]

[DOI]

Gregorio Quintana-Ortí

Concurr. Comput. Pract. Exp., 2009

2008

Hypermatrix oriented supernode amalgamation.

[BibT_eX]

[DOI]

J. Supercomput., 2008

2007

Exploiting computer resources for fast nearest neighbor classification.

[BibT_eX]

[DOI]

Pattern Anal. Appl., 2007

Analysis of a sparse hypermatrix Cholesky with fixed-sized blocking.

[BibT_eX]

[DOI]

Appl. Algebra Eng. Commun. Comput., 2007

New Data Structures for Matrices and Specialized Inner Kernels: Low Overhead for High Performance.

[BibT_eX]

[DOI]

Jose Ramón Herrero Zaragoza

Proceedings of the Parallel Processing and Applied Mathematics, 2007

2006

A framework for efficient execution of matrix computations.

[BibT_eX]

[DOI]

PhD thesis, 2006

Using Non-canonical Array Layouts in Dense Matrix Operations.

[BibT_eX]

[DOI]

Proceedings of the Applied Parallel Computing. State of the Art in Scientific Computing, 2006

Sparse Hypermatrix Cholesky: Customization for High Performance.

[BibT_eX]

Proceedings of the International MultiConference of Engineers and Computer Scientists 2006, 2006

Compiler-Optimized Kernels: An Efficient Alternative to Hand-Coded Inner Kernels.

[BibT_eX]

[DOI]

Proceedings of the Computational Science and Its Applications, 2006

2005

Adapting Linear Algebra Codes to the Memory Hierarchy Using a Hypermatrix Scheme.

[BibT_eX]

[DOI]

Proceedings of the Parallel Processing and Applied Mathematics, 2005

A Study on Load Imbalance in Parallel Hypermatrix Multiplication Using OpenMP.

[BibT_eX]

[DOI]

Proceedings of the Parallel Processing and Applied Mathematics, 2005

Efficient Implementation of Nearest Neighbor Classification.

[BibT_eX]

[DOI]

Proceedings of the Computer Recognition Systems, 2005

2004

Optimization of a Statically Partitioned Hypermatrix Sparse Cholesky Factorization.

[BibT_eX]

[DOI]

Proceedings of the Applied Parallel Computing, 2004

2003

Building Software Via Shared Knowledge.

[BibT_eX]

Proceedings of the International Conference on Software Engineering Research and Practice, 2003

Automatic Benchmarking and Optimization of Codes: An Experience with Numerical Kernels.

[BibT_eX]

Proceedings of the International Conference on Software Engineering Research and Practice, 2003

Improving Performance of Hypermatrix Cholesky Factorization.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2003. Parallel Processing, 2003

Operating System Support for Process Confinement.

[BibT_eX]

David Benlliure

Proceedings of the International Conference on Security and Management, 2003

1996

Data Prefetching and Multilevel Blocking for Linear Algebra Operations.

[BibT_eX]

[DOI]

Elena García-Diego