Carlos García

Orcid: 0000-0002-3470-1097

Affiliations:
  • Complutense University of Madrid, Computer Architecture and Automation Department, Spain (PhD 2007)


According to our database1, Carlos García authored at least 65 papers between 2002 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Acceleration and energy consumption optimization in cascading classifiers for face detection on low-cost ARM big.LITTLE asymmetric architectures.
CoRR, 2024

2023
Exploring the performance and portability of the k-means algorithm on SYCL across CPU and GPU architectures.
J. Supercomput., November, 2023

Comparing Performance and Portability Between CUDA and SYCL for Protein Database Search on NVIDIA, AMD, and Intel GPUs.
Proceedings of the 35th IEEE International Symposium on Computer Architecture and High Performance Computing, 2023

Exploring Heterogeneous Computing Environments: A Preliminary Analysis of Python and SYCL Performance.
Proceedings of the Cloud Computing, Big Data & Emerging Topics - 11th Conference, 2023

2022
Evaluation of Intel's DPC++ Compatibility Tool in heterogeneous computing.
J. Parallel Distributed Comput., 2022

Assessing Opportunities of SYCL and Intel oneAPI for Biological Sequence Alignment.
CoRR, 2022

Migrating CUDA to oneAPI: A Smith-Waterman Case Study.
Proceedings of the Bioinformatics and Biomedical Engineering, 2022

Performance Portability Assessment: Non-negative Matrix Factorization as a Case Study.
Proceedings of the Euro-Par 2022: Parallel Processing Workshops, 2022

Portability and Performance Assessment of the Non-Negative Matrix Factorization Algorithm with OpenMP and SYCL.
Proceedings of the XLVIII Latin American Computer Conference, 2022

2021
Early Experiences Migrating CUDA codes to oneAPI.
CoRR, 2021

2020
HEVC optimization based on human perception for real-time environments.
Multim. Tools Appl., 2020

CNN Inference acceleration using low-power devices for human monitoring and security scenarios.
Comput. Electr. Eng., 2020

2019
Portability Study of an OpenCL Algorithm for Automatic Target Detection in Hyperspectral Images.
IEEE Trans. Geosci. Remote. Sens., 2019

SWIMM 2.0: Enhanced Smith-Waterman on Intel's Multicore and Manycore Architectures Based on AVX-512 Vector Extensions.
Int. J. Parallel Program., 2019

Open Multi-Processing Acceleration for Unsupervised Land Cover Categorization Using Probabilistic Latent Semantic Analysis.
Proceedings of the 2019 IEEE International Geoscience and Remote Sensing Symposium, 2019

2018
Fast and effective CU size decision based on spatial and temporal homogeneity detection.
Multim. Tools Appl., 2018

Multicore Real-Time Implementation of a Full Hyperspectral Unmixing Chain.
IEEE Geosci. Remote. Sens. Lett., 2018

Portable real-time DCT-based steganography using OpenCL.
J. Real Time Image Process., 2018

OSWALD.
Int. J. High Perform. Comput. Appl., 2018

Acceleration and energy consumption optimization in cascading classifiers for face detection on low-cost ARM big. LITTLE asymmetric architectures.
Int. J. Circuit Theory Appl., 2018

Complexity reduction in the HEVC/H265 standard based on smooth region classification.
Digit. Signal Process., 2018

SWIFOLD: Smith-Waterman implementation on FPGA with OpenCL for long DNA sequences.
BMC Syst. Biol., 2018

Control Design for an Articulated Truck With Autonomous Driving in an Electrified Highway.
IEEE Access, 2018

2017
First Experiences Optimizing Smith-Waterman on Intel's Knights Landing Processor.
CoRR, 2017

Accelerating Smith-Waterman Alignment of Long DNA Sequences with OpenCL on FPGA.
Proceedings of the Bioinformatics and Biomedical Engineering, 2017

First Experiences Accelerating Smith-Waterman on Intel's Knights Landing Processor.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2017

Embedded Grammars for Grammatical Evolution on GPGPU.
Proceedings of the Applications of Evolutionary Computation - 20th European Conference, 2017

2016
GPU Implementation of Spatial-Spectral Preprocessing for Hyperspectral Unmixing.
IEEE Geosci. Remote. Sens. Lett., 2016

Code obfuscation using very long identifiers for FFT motion estimation models in embedded processors.
J. Real Time Image Process., 2016

Real-time motion estimation for image and video processing applications.
J. Real Time Image Process., 2016

4K-based intra and interprediction techniques for HEVC.
Proceedings of the Real-Time Image and Video Processing 2016, 2016

2015
An energy-aware performance analysis of SWIMM: <i>S</i>mith-<i>W</i>aterman implementation on <i>I</i>ntel's <i>M</i>ulticore and <i>M</i>anycore architectures.
Concurr. Comput. Pract. Exp., 2015

Non-negative Matrix Factorization on Low-Power Architectures and Accelerators: A Comparative Study.
Comput. Electr. Eng., 2015

NMF-mGPU: non-negative matrix factorization on multi-GPU systems.
BMC Bioinform., 2015

Smith-Waterman Protein Search with OpenCL on an FPGA.
Proceedings of the 2015 IEEE TrustCom/BigDataSE/ISPA, 2015

OpenACC-based GPU acceleration of an optical flow algorithm.
Proceedings of the 30th Annual ACM Symposium on Applied Computing, 2015

Parallel trajectory synchronization for aircraft conflicts resolution.
Proceedings of the 30th Annual ACM Symposium on Applied Computing, 2015

Customized Nios II multi-cycle instructions to accelerate block-matching techniques.
Proceedings of the Real-Time Image and Video Processing 2015, 2015

Fast-coding robust motion estimation model in a GPU.
Proceedings of the Real-Time Image and Video Processing 2015, 2015

Early Experiences with OpenCL on FPGAs: Convolution Case Study.
Proceedings of the 23rd IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2015

2014
Smith-Waterman algorithm on heterogeneous systems: A case study.
Proceedings of the 2014 IEEE International Conference on Cluster Computing, 2014

2013
Offset Printing Plate Quality Sensor on a Low-Cost Processor.
Sensors, 2013

Robust motion estimation on a low-power multi-core DSP.
EURASIP J. Adv. Signal Process., 2013

Acceleration of block-matching algorithms using a custom instruction-based paradigm on a Nios II microprocessor.
EURASIP J. Adv. Signal Process., 2013

Multi-GPU based on multicriteria optimization for motion estimation system.
EURASIP J. Adv. Signal Process., 2013

Hardware implementation of machine vision systems: image and video processing.
EURASIP J. Adv. Signal Process., 2013

GPU-based acceleration of bio-inspired motion estimation model.
Concurr. Comput. Pract. Exp., 2013

Implementation of a Low-Cost Mobile Devices to Support Medical Diagnosis.
Comput. Math. Methods Medicine, 2013

Non-negative matrix factorization on low-power architectures: a comparative study.
Proceedings of the 20th European MPI Users's Group Meeting, 2013

2012
A Low Cost Matching Motion Estimation Sensor Based on the NIOS II Microprocessor.
Sensors, 2012

OpenIRS-UCM: an open-source multi-platform for interactive response systems.
Proceedings of the Annual Conference on Innovation and Technology in Computer Science Education, 2012

2011
Parallelism on the Nonnegative Matrix Factorization.
Proceedings of the Applications, Tools and Techniques on the Road to Exascale Computing, Proceedings of the conference ParCo 2011, 31 August, 2011

Biclustering and classification analysis in gene expression using Nonnegative Matrix Factorization on multi-GPU systems.
Proceedings of the 11th International Conference on Intelligent Systems Design and Applications, 2011

2010
On-Line Multi-Threaded Processing of Web User-Clicks on Multi-Core Processors.
Proceedings of the High Performance Computing for Computational Science - VECPAR 2010, 2010

Building efficient multi-threaded search nodes.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

2008
bioNMF: a web-based tool for nonnegative matrix factorization in biology.
Nucleic Acids Res., 2008

Improving Search Engines Performance on Multithreading Processors.
Proceedings of the High Performance Computing for Computational Science, 2008

Exploiting Hybrid Parallelism in Web Search Engines.
Proceedings of the Euro-Par 2008, 2008

2007
Multigrid Smoothers on Multicore Architectures.
Proceedings of the Parallel Computing: Architectures, 2007

2006
Enhancing the Performance of Multigrid Smoothers in Simultaneous Multithreading Architectures.
Proceedings of the High Performance Computing for Computational Science, 2006

2005
JPEG2000 Optimization in General Purpose Microprocessors.
Proceedings of the Parallel Computing: Current & Future Issues of High-End Computing, 2005

A Speculative Parallel Algorithm for Self-Organizing Maps.
Proceedings of the Parallel Computing: Current & Future Issues of High-End Computing, 2005

2004
Exploiting Multilevel Parallelism Within Modern Microprocessors: DWT as a Case Study.
Proceedings of the High Performance Computing for Computational Science, 2004

2003
Vectorization of Multigrid Codes Using SIMD ISA Extensions.
Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS 2003), 2003

2002
A Parallel Cloth Simulator Using Multilevel Algorithms.
Proceedings of the 16th International Parallel and Distributed Processing Symposium (IPDPS 2002), 2002


  Loading...