Mário P. Véstias

Orcid: 0000-0001-8556-4507

According to our database1, Mário P. Véstias authored at least 64 papers between 2002 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Designing Deep Learning Models on FPGA with Multiple Heterogeneous Engines.
ACM Trans. Reconfigurable Technol. Syst., March, 2024

Enhancing Urban Intersection Efficiency: Visible Light Communication and Learning-Based Control for Traffic Signal Optimization and Vehicle Management.
Symmetry, February, 2024

LiDAR 3D Object Detection in FPGA with Low Bitwidth Quantization.
Proceedings of the Applied Reconfigurable Computing. Architectures, Tools, and Applications, 2024

2023
Efficient Design of Low Bitwidth Convolutional Neural Networks on FPGA with Optimized Dot Product Units.
ACM Trans. Reconfigurable Technol. Syst., March, 2023

Energy-Efficient and Real-Time Wearable for Wellbeing-Monitoring IoT System Based on SoC-FPGA.
Algorithms, March, 2023

Smart Embedded System for Skin Cancer Classification.
Future Internet, February, 2023

2022
Onboard Processing of Synthetic Aperture Radar Backprojection Algorithm in FPGA.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2022

A Review of Synthetic-Aperture Radar Image Formation Algorithms and Implementations: A Computational Perspective.
Remote. Sens., 2022

2021
Efficient Design of Pruned Convolutional Neural Networks on FPGA.
J. Signal Process. Syst., 2021

Configurable Hardware Core for IoT Object Detection.
Future Internet, 2021

Decimal Multiplication in FPGA with a Novel Decimal Adder/Subtractor.
Algorithms, 2021

IOb-Cache: A High-Performance Configurable Open-Source Cache.
Algorithms, 2021

A Full Featured Configurable Accelerator for Object Detection With YOLO.
IEEE Access, 2021

2020
Hyperspectral Compressive Sensing With a System-On-Chip FPGA.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2020

A fast and scalable architecture to run convolutional neural networks in low density FPGAs.
Microprocess. Microsystems, 2020

Moving Deep Learning to the Edge.
Algorithms, 2020

A Configurable Architecture for Running Hybrid Convolutional Neural Networks in Low-Density FPGAs.
IEEE Access, 2020

Implementing CNNs Using a Linear Array of Full Mesh CGRAs.
Proceedings of the Applied Reconfigurable Computing. Architectures, Tools, and Applications, 2020

Processing Systems for Deep Learning Inference on Edge Devices.
Proceedings of the Convergence of Artificial Intelligence and the Internet of Things, 2020

2019
A Survey of Convolutional Neural Networks on Edge with Reconfigurable Computing.
Algorithms, 2019

Low Energy Heterogeneous Computing with Multiple RISC-V and CGRA Cores.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2019

Hybrid Dot-Product Calculation for Convolutional Neural Networks in FPGA.
Proceedings of the 29th International Conference on Field Programmable Logic and Applications, 2019

Faster Convolutional Neural Networks in Low Density FPGAs Using Block Pruning.
Proceedings of the Applied Reconfigurable Computing - 15th International Symposium, 2019

Exploring Data Size to Run Convolutional Neural Networks in Low Density FPGAs.
Proceedings of the Applied Reconfigurable Computing - 15th International Symposium, 2019

2018
Improving the area of fast parallel decimal multipliers.
Microprocess. Microsystems, 2018

Lite-CNN: A High-Performance Architecture to Execute CNNs in Low Density FPGAs.
Proceedings of the 28th International Conference on Field Programmable Logic and Applications, 2018

2017
Decimal addition on FPGA based on a mixed BCD/excess-6 representation.
Microprocess. Microsystems, 2017

Parallel dot-products for deep learning on FPGA.
Proceedings of the 27th International Conference on Field Programmable Logic and Applications, 2017

K-means clustering on CGRA.
Proceedings of the 27th International Conference on Field Programmable Logic and Applications, 2017

2016
XtokaxtikoX: A stochastic computing-based autonomous cyber-physical system.
Proceedings of the IEEE International Conference on Rebooting Computing, 2016

Multi-core for K-means clustering on FPGA.
Proceedings of the 26th International Conference on Field Programmable Logic and Applications, 2016

2015
Designing Hardware/Software Systems for Embedded High-Performance Computing.
CoRR, 2015

FPGA-based architecture for hyperspectral unmixing.
Proceedings of the 2015 IEEE International Geoscience and Remote Sensing Symposium, 2015

Enhancing stochastic computations via process variation.
Proceedings of the 25th International Conference on Field Programmable Logic and Applications, 2015

Using Dynamic Reconfiguration to Reduce the Area of a JPEG Decoder on FPGA.
Proceedings of the 2015 Euromicro Conference on Digital System Design, 2015

Sparse Matrix Multiplication on a Reconfigurable Many-Core Architecture.
Proceedings of the 2015 Euromicro Conference on Digital System Design, 2015

2014
A Many-Core Overlay for High-Performance Embedded Computing on FPGAs.
CoRR, 2014

Trends of CPU, GPU and FPGA for high-performance computing.
Proceedings of the 24th International Conference on Field Programmable Logic and Applications, 2014

Efficient implementation of a single-precision floating-point arithmetic unit on FPGA.
Proceedings of the 24th International Conference on Field Programmable Logic and Applications, 2014

2013
Very low resource table-based FPGA evaluation of elementary functions.
Proceedings of the 2012 International Conference on Reconfigurable Computing and FPGAs, 2013

Design of a multiband full-rate ultra-wideband receiver in FPGA.
Proceedings of the 4th IEEE Latin American Symposium on Circuits and Systems, 2013

Design of a massively parallel computing architecture for dense matrix multiplication.
Proceedings of the 4th IEEE Latin American Symposium on Circuits and Systems, 2013

A reconfigurable computing architecture using magnetic tunneling junction memories.
Proceedings of the 23rd International Conference on Field programmable Logic and Applications, 2013

Analysis of matrix multiplication on high density Virtex-7 FPGA.
Proceedings of the 23rd International Conference on Field programmable Logic and Applications, 2013

2012
A High-Performance Reconfigurable Computing architecture using a magnetic configuration memory.
Proceedings of the 2012 International Conference on Reconfigurable Computing and FPGAs, 2012

Non-volatile memory circuits for FIMS and TAS writing techniques on magnetic tunnelling junctions.
Proceedings of the 19th IEEE International Conference on Electronics, Circuits and Systems, 2012

Tradeoffs in the design of sliding block Viterbi decoders for MB-OFDM UWB systems.
Proceedings of the IEEE Second International Conference on Consumer Electronics - Berlin, 2012

Design of an IEEE 802.15.3c baseband processor in FPGA.
Proceedings of the IEEE Second International Conference on Consumer Electronics - Berlin, 2012

Design and test of a MIMO Receiver based on the Alamouti scheme in FPGA.
Proceedings of the IEEE Second International Conference on Consumer Electronics - Berlin, 2012

Sliding block Viterbi decoders in FPGA.
Proceedings of the 22nd International Conference on Field Programmable Logic and Applications (FPL), 2012

Design of High-Speed Viterbi Decoders on Virtex-6 FPGAs.
Proceedings of the 15th Euromicro Conference on Digital System Design, 2012

Parallel Decimal Multipliers and Squarers Using Karatsuba-Ofman's Algorithm.
Proceedings of the 15th Euromicro Conference on Digital System Design, 2012

2011
Revisiting the Newton-Raphson Iterative Method for Decimal Division.
Proceedings of the International Conference on Field Programmable Logic and Applications, 2011

2009
Run-Time Reconfigurable Array Using Magnetic RAM.
Proceedings of the 12th Euromicro Conference on Digital System Design, 2009

Double-precision Gauss-Jordan Algorithm with Partial Pivoting on FPGAs.
Proceedings of the 12th Euromicro Conference on Digital System Design, 2009

2008
Decimal multiplier on FPGA using embedded binary multipliers.
Proceedings of the FPL 2008, 2008

Multiplier-based double precision floating point divider according to the IEEE-754 standard.
Proceedings of the Reconfigurable Computing: Architectures, 2008

2007
Router Design for Application Specific Networks-on-Chip on Reconfigurable Systems.
Proceedings of the FPL 2007, 2007

2006
Area and performance optimization of a generic network-on-chip architecture.
Proceedings of the 19th Annual Symposium on Integrated Circuits and Systems Design, 2006

A Generic Network-on-Chip Architecture for Reconfigurable Systems: Implementation and Evaluation.
Proceedings of the 2006 International Conference on Field Programmable Logic and Applications (FPL), 2006

Co-synthesis of a configurable SoC platform based on a network on chip architecture.
Proceedings of the 2006 Conference on Asia South Pacific Design Automation: ASP-DAC 2006, 2006

Area/Performance Improvement of NoC Architectures.
Proceedings of the Reconfigurable Computing: Architectures and Applications, 2006

2003
DALI: A Methodology for the Co-Design of Dataflow Applications on Hardware/Software Architectures.
Proceedings of the 16th Annual Symposium on Integrated Circuits and Systems Design, 2003

2002
System-Level Co-Synthesis of Dataflow Dominated Applications on Reconfigurable Hardware/Software Architectures.
Proceedings of the 13th IEEE International Workshop on Rapid System Prototyping (RSP 2002), 2002


  Loading...