Pedro Tomás

Orcid: 0000-0001-8083-4432

According to our database1, Pedro Tomás authored at least 79 papers between 2003 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
gem5-accel: A Pre-RTL Simulation Toolchain for Accelerator Architecture Validation.
IEEE Comput. Archit. Lett., 2024

NDPmulator: Enabling Full-System Simulation for Near-Data Accelerators From Caches to DRAM.
IEEE Access, 2024

2023
Super-resolution of magnetic resonance images using Generative Adversarial Networks.
Comput. Medical Imaging Graph., September, 2023

Trading Performance, Power, and Area on Low-Precision Posit MAC Units for CNN Training.
Proceedings of the 35th IEEE International Symposium on Computer Architecture and High Performance Computing, 2023

Stacking Deep Learning Models for Early Detection of Wildfire Smoke Plumes.
Proceedings of the 31st European Signal Processing Conference, 2023

Supporting RISC-V Performance Counters Through Linux Performance Analysis Tools.
Proceedings of the 34th IEEE International Conference on Application-specific Systems, 2023

2022
Unified Posit/IEEE-754 Vector MAC Unit for Transprecision Computing.
IEEE Trans. Circuits Syst. II Express Briefs, 2022

Compiling for Vector Extensions With Stream-Based Specialization.
IEEE Micro, 2022

Decoupling GPGPU voltage-frequency scaling for deep-learning applications.
J. Parallel Distributed Comput., 2022

gem5-ndp: Near-Data Processing Architecture Simulation From Low Level Caches to DRAM.
Proceedings of the 2022 IEEE 34th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD), 2022

Early prototyping and testing of CERN LHC CMS high-granularity calorimeter slow-control system.
Proceedings of the IEEE International Workshop on Rapid System Prototyping, 2022

Validation of NFV management and orchestration on Kubernetes-based 5G testbed environment.
Proceedings of the IEEE Globecom 2022 Workshops, 2022

2021
A Compute Cache System for Signal Processing Applications.
J. Signal Process. Syst., 2021

A Reconfigurable Posit Tensor Unit with Variable-Precision Arithmetic and Automatic Data Streaming.
J. Signal Process. Syst., 2021

Compiler-Assisted Data Streaming for Regular Code Structures.
IEEE Trans. Computers, 2021

Supporting RISC-V Performance Counters through Performance analysis tools for Linux (Perf).
CoRR, 2021

Unlimited Vector Extension with Data Streaming Support.
Proceedings of the 48th ACM/IEEE Annual International Symposium on Computer Architecture, 2021

Positnn: Training Deep Neural Networks with Mixed Low-Precision Posit.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
End-to-End Learning of Video Compression using Spatio-Temporal Autoencoders.
Proceedings of the IEEE Workshop on Signal Processing Systems, 2020

Dynamic Fused Multiply-Accumulate Posit Unit with Variable Exponent Size for Low-Precision DSP Applications.
Proceedings of the IEEE Workshop on Signal Processing Systems, 2020

Exploiting Non-conventional DVFS on GPUs: Application to Deep Learning.
Proceedings of the 32nd IEEE International Symposium on Computer Architecture and High Performance Computing, 2020

Processing Convolutional Neural Networks on Cache.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Neighborhood-aware autoencoder for missing value imputation.
Proceedings of the 28th European Signal Processing Conference, 2020

Reconfigurable Stream-based Tensor Unit with Variable-Precision Posit Arithmetic.
Proceedings of the 31st IEEE International Conference on Application-specific Systems, 2020

2019
Modeling and Decoupling the GPU Power Consumption for Cross-Domain DVFS.
IEEE Trans. Parallel Distributed Syst., 2019

DVFS-aware application classification to improve GPGPUs energy efficiency.
Parallel Comput., 2019

GPU Static Modeling Using PTX and Deep Structured Learning.
IEEE Access, 2019

Heart Disease Detection Architecture for Lead I Off-the-Person ECG Monitoring Devices.
Proceedings of the 27th European Signal Processing Conference, 2019

2018
Stream data prefetcher for the GPU memory interface.
J. Supercomput., 2018

MrBayes sMC<sup>3</sup>.
Int. J. High Perform. Comput. Appl., 2018

Exploiting Compute Caches for Memory Bound Vector Operations.
Proceedings of the 30th International Symposium on Computer Architecture and High Performance Computing, 2018

GPGPU Power Modeling for Multi-domain Voltage-Frequency Scaling.
Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2018

2017
Adaptive In-Cache Streaming for Efficient Data Management.
IEEE Trans. Very Large Scale Integr. Syst., 2017

Efficient parallelization of perturbative Monte Carlo QM/MM simulations in heterogeneous platforms.
Int. J. High Perform. Comput. Appl., 2017

SCRATCH: an end-to-end application-aware soft-GPGPU architecture and trimming tool.
Proceedings of the 50th Annual IEEE/ACM International Symposium on Microarchitecture, 2017

2016
BowMapCL: Burrows-Wheeler Mapping on Multiple Heterogeneous Accelerators.
IEEE ACM Trans. Comput. Biol. Bioinform., 2016

A Framework for Application-Guided Task Management on Heterogeneous Embedded Systems.
ACM Trans. Archit. Code Optim., 2016

Multi-objective kernel mapping and scheduling for morphable many-core architectures.
Expert Syst. Appl., 2016

A Cross-Core Performance Model for Heterogeneous Many-Core Architectures.
Proceedings of the High Performance Computing for Computational Science - VECPAR 2016, 2016

Unsupervised variable-grained online phase clustering for heterogeneous/morphable processors.
Proceedings of the International Conference on High Performance Computing & Simulation, 2016

In-Cache Streaming: Morphable Infrastructure for Many-Core Processing Systems.
Proceedings of the Euro-Par 2016: Parallel Processing Workshops, 2016

Performance and Power-Aware Classification for Frequency Scaling of GPGPU Applications.
Proceedings of the Euro-Par 2016: Parallel Processing Workshops, 2016

2015
Multicore SIMD ASIP for Next-Generation Sequencing and Alignment Biochip Platforms.
IEEE Trans. Very Large Scale Integr. Syst., 2015

Prognostic models based on patient snapshots and time windows: Predicting disease progression to assisted ventilation in Amyotrophic Lateral Sclerosis.
J. Biomed. Informatics, 2015

Morphable hundred-core heterogeneous architecture for energy-aware computation.
IET Comput. Digit. Tech., 2015

Acceleration of stochastic seismic inversion in OpenCL-based heterogeneous platforms.
Comput. Geosci., 2015

Attaining performance fairness in big.LITTLE systems.
Proceedings of the 12th International Workshop on Intelligent Solutions in Embedded Systems, 2015

Accelerating Phylogenetic Inference on Heterogeneous OpenCL Platforms.
Proceedings of the 2015 IEEE TrustCom/BigDataSE/ISPA, 2015

Multi-kernel Auto-Tuning on GPUs: Performance and Energy-Aware Optimization.
Proceedings of the 23rd Euromicro International Conference on Parallel, 2015

Energy-Efficient Architecture for DP Local Sequence Alignment: Exploiting ILP and DLP.
Proceedings of the Bioinformatics and Biomedical Engineering, 2015

Efficient data-stream management for shared-memory many-core systems.
Proceedings of the 25th International Conference on Field Programmable Logic and Applications, 2015

Fast and Scalable Thread Migration for Multi-core Architectures.
Proceedings of the 13th IEEE International Conference on Embedded and Ubiquitous Computing, 2015

2014
Stream Oriented Modular Architecture with Polymorphic Processing Engines.
Proceedings of the 26th IEEE International Symposium on Computer Architecture and High Performance Computing Workshop, 2014

Performance-Aware Task Management and Frequency Scaling in Embedded Systems.
Proceedings of the 26th IEEE International Symposium on Computer Architecture and High Performance Computing, 2014

Accelerating Phylogenetic Inference on GPUs: an OpenACC and CUDA comparison.
Proceedings of the International Work-Conference on Bioinformatics and Biomedical Engineering, 2014

Burrows-Wheeler Transform based indexed exact search on a multi-GPU OpenCL platform.
Proceedings of the International Conference on High Performance Computing & Simulation, 2014

Low-power vectorial VLIW architecture for maximum parallelism exploitation of dynamic programming algorithms.
Proceedings of the International Conference on High Performance Computing & Simulation, 2014

SchedMon: A Performance and Energy Monitoring Tool for Modern Multi-cores.
Proceedings of the Euro-Par 2014: Parallel Processing Workshops, 2014

GPU Accelerated Stochastic Inversion of Deep Water Seismic Data.
Proceedings of the Euro-Par 2014: Parallel Processing Workshops, 2014

Accelerating differential power analysis on heterogeneous systems.
Proceedings of the 9th Workshop on Embedded Systems Security, 2014

Finite-Difference in Time-Domain Scalable Implementations on CUDA and OpenCL.
Proceedings of the Numerical Computations with GPUs, 2014

2013
HotStream: Efficient Data Streaming of Complex Patterns to Multiple Accelerating Kernels.
Proceedings of the 25th International Symposium on Computer Architecture and High Performance Computing, 2013

Transparent Application Acceleration by Intelligent Scheduling of Shared Library Calls on Heterogeneous Systems.
Proceedings of the Parallel Processing and Applied Mathematics, 2013

Monitoring Performance and Power for Application Characterization with the Cache-Aware Roofline Model.
Proceedings of the Parallel Processing and Applied Mathematics, 2013

A flexible shared library profiler for early estimation of performance gains in heterogeneous systems.
Proceedings of the International Conference on High Performance Computing & Simulation, 2013

A comparison of computing architectures and parallelization frameworks based on a two-dimensional FDTD.
Proceedings of the International Conference on High Performance Computing & Simulation, 2013

Scalable and high throughput biosensing platform.
Proceedings of the 23rd International Conference on Field programmable Logic and Applications, 2013

BioBlaze: Multi-core SIMD ASIP for DNA sequence alignment.
Proceedings of the 24th International Conference on Application-Specific Systems, 2013

2012
Energy efficient stream-based configurable architecture for embedded platforms.
Proceedings of the 2012 International Conference on Embedded Computer Systems: Architectures, 2012

2010
A quantitative analysis of firing rate estimators: Unveiling bias sources.
Neurocomputing, 2010

Efficient Independent Component Analysis on a GPU.
Proceedings of the 10th IEEE International Conference on Computer and Information Technology, 2010

2009
A Feature Selection Algorithm for the Regularization of Neuron Models.
IEEE Trans. Instrum. Meas., 2009

Neural code metrics: Analysis and application to the assessment of neural models.
Neurocomputing, 2009

2008
Statistical Analysis of a Spike Train Distance in Poisson Models.
IEEE Signal Process. Lett., 2008

Towards a Unified Model for the Retina - Static vs Dynamic Integrate and Fire Models.
Proceedings of the First International Conference on Biomedical Electronics and Devices, 2008

2007
An Efficient Expectation-Maximisation Algorithm for Spike Classification.
Proceedings of the 15th International Conference on Digital Signal Processing, 2007

Stochastic integrate-and-fire model for the retina.
Proceedings of the 15th European Signal Processing Conference, 2007

2005
Visual neuroprosthesis: a non invasive system for stimulating the cortex.
IEEE Trans. Circuits Syst. I Regul. Pap., 2005

2003
An FPL Bioinspired Visual Encoding System to Stimulate Cortical Neurons in Real-Time.
Proceedings of the Field Programmable Logic and Application, 13th International Conference, 2003


  Loading...