Alfredo Remón

According to our database1, Alfredo Remón authored at least 45 papers between 2006 and 2019.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

On csauthors.net:

Bibliography

2019
Power-aware computing.
Concurrency and Computation: Practice and Experience, 2019

A GPU-aware mixed-precision solver for low-rank algebraic Riccati equations.
Concurrency and Computation: Practice and Experience, 2019

2017
Extending the Gauss-Huard method for the solution of Lyapunov matrix equations and matrix inversion.
Concurrency and Computation: Practice and Experience, 2017

Solving Sparse Differential Riccati Equations on Hybrid CPU-GPU Platforms.
Proceedings of the Computational Science and Its Applications - ICCSA 2017, 2017

2016
Balancing Energy and Performance in Dense Linear System Solvers for Hybrid ARM+GPU platforms.
CLEI Electron. J., 2016

The Impact of Panel Factorization on the Gauss-Huard Algorithm for the Solution of Linear Systems on Modern Architectures.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2016

Tuning the Blocksize for Dense Linear Algebra Factorization Routines with the Roofline Model.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2016

2015
Extending lyapack for the solution of band Lyapunov equations on hybrid CPU-GPU platforms.
The Journal of Supercomputing, 2015

Fast and Reliable Noise Estimation for Hyperspectral Subspace Identification.
IEEE Geosci. Remote Sensing Lett., 2015

Unleashing GPU acceleration for symmetric band linear algebra kernels and model reduction.
Cluster Computing, 2015

Revisiting the Gauss-Huard Algorithm for the Solution of Linear Systems on Graphics Accelerators.
Proceedings of the Parallel Processing and Applied Mathematics, 2015

A Parallel Multi-threaded Solver for Symmetric Positive Definite Bordered-Band Linear Systems.
Proceedings of the Parallel Processing and Applied Mathematics, 2015

Exploring the Offload Execution Model in the Intel Xeon Phi via Matrix Inversion.
Proceedings of the Parallel Computing: On the Road to Exascale, 2015

Solving dense linear systems with hybrid ARM+GPU platforms.
Proceedings of the 2015 Latin American Computing Conference, 2015

Solving Linear Systems on the Intel Xeon-Phi Accelerator via the Gauss-Huard Algorithm.
Proceedings of the High Performance Computing - Second Latin American Conference, 2015

2014
Hyperspectral Unmixing on Multicore DSPs: Trading Off Performance for Energy.
IEEE J Sel. Topics in Appl. Earth Observ. and Remote Sensing, 2014

A factored variant of the Newton iteration for the solution of algebraic Riccati equations via the matrix sign function.
Numerical Algorithms, 2014

Trading Off Performance for Energy in Linear Algebra Operations with Applications in Control Theory.
CLEI Electron. J., 2014

Accelerating Band Linear Algebra Operations on GPUs with Application in Model Reduction.
Proceedings of the Computational Science and Its Applications - ICCSA 2014 - 14th International Conference, Guimarães, Portugal, June 30, 2014

Accelerating the general band matrix multiplication using graphics processors.
Proceedings of the XL Latin American Computing Conference, 2014

Efficient Symmetric Band Matrix-Matrix Multiplication on GPUs.
Proceedings of the High Performance Computing - First HPCLATAM, 2014

2013
Accelerating the Lyapack library using GPUs.
The Journal of Supercomputing, 2013

Performance versus energy consumption of hyperspectral unmixing algorithms on multi-core platforms.
EURASIP J. Adv. Sig. Proc., 2013

Matrix inversion on CPU-GPU platforms with applications in control theory.
Concurrency and Computation: Practice and Experience, 2013

Solving Matrix Equations on Multi-Core and Many-Core Architectures.
Algorithms, 2013

Exploiting Data- and Task-Parallelism in the Solution of Riccati Equations on Multicore Servers and GPUs.
Proceedings of the Parallel Computing: Accelerating Computational Science and Engineering (CSE), 2013

On the Impact of Optimization on the Time-Power-Energy Balance of Dense Linear Algebra Factorizations.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2013

2012
High Performance Implementations of the BST Method on Hybrid CPU-GPU Platforms.
Proceedings of the 10th IEEE International Symposium on Parallel and Distributed Processing with Applications, 2012

Unleashing CPU-GPU Acceleration for Control Theory Applications.
Proceedings of the Euro-Par 2012: Parallel Processing Workshops, 2012

2011
Using graphics processors to accelerate the computation of the matrix inverse.
The Journal of Supercomputing, 2011

A mixed-precision algorithm for the solution of Lyapunov equations on hybrid CPU-GPU platforms.
Parallel Computing, 2011

Real-Time Endmember Extraction on Multicore Processors.
IEEE Geosci. Remote Sensing Lett., 2011

Accelerating BST Methods for Model Reduction with Graphics Processors.
Proceedings of the Parallel Processing and Applied Mathematics, 2011

High Performance Matrix Inversion on a Multi-core Platform with Several GPUs.
Proceedings of the 19th International Euromicro Conference on Parallel, 2011

High performance matrix inversion of SPD matrices on graphics processors.
Proceedings of the 2011 International Conference on High Performance Computing & Simulation, 2011

Efficient Model Order Reduction of Large-Scale Systems on Multi-core Platforms.
Proceedings of the Computational Science and Its Applications - ICCSA 2011, 2011

2010
Accelerating Model Reduction of Large Linear Systems with Graphics Processors.
Proceedings of the Applied Parallel and Scientific Computing, 2010

2009
Toward the parallelization of GSL.
The Journal of Supercomputing, 2009

Using Hybrid CPU-GPU Platforms to Accelerate the Computation of the Matrix Sign Function.
Proceedings of the Euro-Par 2009, 2009

2008
An Algorithm-by-Blocks for SuperMatrix Band Cholesky Factorization.
Proceedings of the High Performance Computing for Computational Science, 2008

2007
Parallel Solution of Band Linear Systems in Model Reduction.
Proceedings of the Parallel Processing and Applied Mathematics, 2007

The Implementation of BLAS for Band Matrices.
Proceedings of the Parallel Processing and Applied Mathematics, 2007

Parallel Implementation of LQG Balanced Truncation for Large-Scale Systems.
Proceedings of the Large-Scale Scientific Computing, 6th International Conference, 2007

2006
Cholesky Factorization of Band Matrices Using Multithreaded BLAS.
Proceedings of the Applied Parallel Computing. State of the Art in Scientific Computing, 2006

Parallel LU Factorization of Band Matrices on SMP Systems.
Proceedings of the High Performance Computing and Communications, 2006


  Loading...