Davide Rossi

CoRR, 2024

Occamy: A 432-Core 28.1 DP-GFLOP/s/W 83% FPU Utilization Dual-Chiplet, Dual-HBM2E RISC-V-Based Accelerator for Stencil and Sparse Linear Algebra Computations with 8-to-64-bit Floating-Point Support in 12nm FinFET.

[BibT_eX]

[DOI]

Gianna Paulin

Paul Scheffler

Thomas Benz

Matheus A. Cavalcante

Proceedings of the IEEE Symposium on VLSI Technology and Circuits 2024, 2024

TitanCFI: Toward Enforcing Control-Flow Integrity in the Root -of- Trust.

[BibT_eX]

[DOI]

Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2024

Assessing the Performance of OpenTitan as Cryptographic Accelerator in Secure Open-Hardware System-on-Chips.

[BibT_eX]

[DOI]

Proceedings of the 21st ACM International Conference on Computing Frontiers, 2024

A Gigabit, DMA-enhanced Open-Source Ethernet Controller for Mixed-Criticality Systems.

[BibT_eX]

[DOI]

Proceedings of the 21st ACM International Conference on Computing Frontiers, 2024

QR-PULP: Streamlining QR Decomposition for RISC-V Parallel Ultra-Low-Power Platforms.

[BibT_eX]

[DOI]

Amirhossein Kiamarzi

Giuseppe Tagliavini

Proceedings of the 21st ACM International Conference on Computing Frontiers, 2024

Spatzformer: An Efficient Reconfigurable Dual-Core RISC-V V Cluster for Mixed Scalar-Vector Workloads.

[BibT_eX]

[DOI]

Matteo Perotti

Michele Raeber

Mattia Sinigaglia

Matheus A. Cavalcante

Proceedings of the 35th IEEE International Conference on Application-specific Systems, 2024

2023

RedMule: A mixed-precision matrix-matrix operation engine for flexible and energy-efficient on-chip linear algebra and TinyML training acceleration.

[BibT_eX]

[DOI]

Future Gener. Comput. Syst., December, 2023

CVA6 RISC-V Virtualization: Architecture, Microarchitecture, and Design Space Exploration.

[BibT_eX]

[DOI]

IEEE Trans. Very Large Scale Integr. Syst., November, 2023

Graphene-Based Wireless Agile Interconnects for Massive Heterogeneous Multi-Chip Processors.

[BibT_eX]

[DOI]

IEEE Wirel. Commun., August, 2023

Scalable Hierarchical Instruction Cache for Ultralow-Power Processors Clusters.

[BibT_eX]

[DOI]

IEEE Trans. Very Large Scale Integr. Syst., April, 2023

Energy Efficiency of Opportunistic Refreshing for Gain-Cell Embedded DRAM.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. I Regul. Pap., April, 2023

Dustin: A 16-Cores Parallel Ultra-Low-Power Cluster With 2b-to-32b Fully Flexible Bit-Precision and Vector Lockstep Execution Mode.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. I Regul. Pap., 2023

Scalable Hierarchical Instruction Cache for Ultra-Low-Power Processors Clusters.

[BibT_eX]

[DOI]

CoRR, 2023

Marsellus: A Heterogeneous RISC-V AI-IoT End-Node SoC with 2-to-8b DNN Acceleration and 30%-Boost Adaptive Body Biasing.

[BibT_eX]

[DOI]

CoRR, 2023

Echoes: a 200 GOPS/W Frequency Domain SoC with FFT Processor and I2S DSP for Flexible Data Acquisition from Microphone Arrays.

[BibT_eX]

[DOI]

CoRR, 2023

DARKSIDE: A Heterogeneous RISC-V Compute Cluster for Extreme-Edge On-Chip DNN Inference and Training.

[BibT_eX]

[DOI]

CoRR, 2023

A 3 TOPS/W RISC-V Parallel Cluster for Inference of Fine-Grain Mixed-Precision Quantized Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE Computer Society Annual Symposium on VLSI, 2023

A 12.4TOPS/W @ 136GOPS AI-IoT System-on-Chip with 16 RISC-V, 2-to-8b Precision-Scalable DNN Acceleration and 30%-Boost Adaptive Body Biasing.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Solid- State Circuits Conference, 2023

ECHOES: a 200 GOPS/W Frequency Domain SoC with FFT Processor and I<sup>2</sup>S DSP for Flexible Data Acquisition from Microphone Arrays.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Circuits and Systems, 2023

Cyber Security aboard Micro Aerial Vehicles: An OpenTitan-based Visual Communication Use Case.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Circuits and Systems, 2023

Shaheen: An Open, Secure, and Scalable RV64 SoC for Autonomous Nano-UAVs.

[BibT_eX]

[DOI]

Proceedings of the 35th IEEE Hot Chips Symposium, 2023

Siracusa: A Low-Power On-Sensor RISC-V SoC for Extended Reality Visual Processing in 16nm CMOS.

[BibT_eX]

[DOI]

Proceedings of the 49th IEEE European Solid State Circuits Conference, 2023

Reducing Load-Use Dependency-Induced Performance Penalty in the Open-Source RISC-V CVA6 CPU.

[BibT_eX]

[DOI]

Proceedings of the 26th Euromicro Conference on Digital System Design, 2023

HULK-V: a Heterogeneous Ultra-low-power Linux capable RISC-V SoC.

[BibT_eX]

[DOI]

Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2023

TransLib: A Library to Explore Transprecision Floating-Point Arithmetic on Multi-Core IoT End-Nodes.

[BibT_eX]

[DOI]

Seyed Ahmad Mirsalari

Giuseppe Tagliavini

Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2023

End-to-End DNN Inference on a Massively Parallel Analog In Memory Computing Architecture.

[BibT_eX]

[DOI]

Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2023

PATRONoC: Parallel AXI Transport Reducing Overhead for Networks-on-Chip targeting Multi-Accelerator DNN Platforms at the Edge.

[BibT_eX]

[DOI]

Vikram Jain

Matheus A. Cavalcante

Proceedings of the 60th ACM/IEEE Design Automation Conference, 2023

2022

A Low-Power Transprecision Floating-Point Cluster for Efficient Near-Sensor Data Analytics.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2022

Vega: A Ten-Core SoC for IoT Endnodes With DNN Acceleration and Cognitive Wake-Up From MRAM-Based State-Retentive Sleep Mode.

[BibT_eX]

[DOI]

IEEE J. Solid State Circuits, 2022

A Heterogeneous In-Memory Computing Cluster for Flexible End-to-End Inference of Real-World Deep Neural Networks.

[BibT_eX]

[DOI]

IEEE J. Emerg. Sel. Topics Circuits Syst., 2022

ControlPULP: A RISC-V Power Controller for HPC Processors with Parallel Control-Law Computation Acceleration.

[BibT_eX]

[DOI]

Proceedings of the Embedded Computer Systems: Architectures, Modeling, and Simulation, 2022

Kraken: A Direct Event/Frame-Based Multi-sensor Fusion SoC for Ultra-Efficient Visual Processing in Nano-UAVs.

[BibT_eX]

[DOI]

Proceedings of the 2022 IEEE Hot Chips 34 Symposium, 2022

Darkside: 2.6GFLOPS, 8.7mW Heterogeneous RISC-V Cluster for Extreme-Edge On-Chip DNN Inference and Training.

[BibT_eX]

[DOI]

Proceedings of the 48th IEEE European Solid State Circuits Conference, 2022

RedMulE: A Compact FP16 Matrix-Multiplication Accelerator for Adaptive Deep Learning on RISC-V-Based Ultra-Low-Power SoCs.

[BibT_eX]

[DOI]

Proceedings of the 2022 Design, Automation & Test in Europe Conference & Exhibition, 2022

Scale up your In-Memory Accelerator: Leveraging Wireless-on-Chip Communication for AIMC-based CNN Inference.

[BibT_eX]

[DOI]

Albert Cabellos-Aparicio

Proceedings of the 4th IEEE International Conference on Artificial Intelligence Circuits and Systems, 2022

2021

Arnold: An eFPGA-Augmented RISC-V SoC for Flexible and Low-Power IoT End Nodes.

[BibT_eX]

[DOI]

IEEE Trans. Very Large Scale Integr. Syst., 2021

Analytical Modeling of Jitter in Bang-Bang CDR Circuits Featuring Phase Interpolation.

[BibT_eX]

[DOI]

Francesco Brandonisio

Andrea Bandiziol

Roberto Nonis

IEEE Trans. Very Large Scale Integr. Syst., 2021

A Fully Integrated 5-mW, 0.8-Gbps Energy-Efficient Chip-to-Chip Data Link for Ultralow-Power IoT End-Nodes in 65-nm CMOS.

[BibT_eX]

[DOI]

IEEE Trans. Very Large Scale Integr. Syst., 2021

Energy-Efficient Hardware-Accelerated Synchronization for Shared-L1-Memory Multiprocessor Clusters.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2021

A 0.5GHz 0.35mW LDO-Powered Constant-Slope Phase Interpolator With 0.22% INL.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. II Express Briefs, 2021

DORY: Automatic End-to-End Deployment of Real-World DNNs on Low-Cost IoT MCUs.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2021

Vega: A 10-Core SoC for IoT End-Nodes with DNN Acceleration and Cognitive Wake-Up From MRAM-Based State-Retentive Sleep Mode.

[BibT_eX]

[DOI]

CoRR, 2021

A Fully-Integrated 5mW, 0.8Gbps Energy-Efficient Chip-to-Chip Data Link for Ultra-Low-Power IoT End-Nodes in 65-nm CMOS.

[BibT_eX]

[DOI]

CoRR, 2021

Hardware-In-The Loop Emulation for Agile Co-Design of Parallel Ultra-Low Power IoT Processors.

[BibT_eX]

[DOI]

Luca Valente

Proceedings of the 29th IFIP/IEEE International Conference on Very Large Scale Integration, 2021

4.4 A 1.3TOPS/W @ 32GOPS Fully Integrated 10-Core SoC for IoT End-Nodes with 1.7μW Cognitive Wake-Up From MRAM-Based State-Retentive Sleep Mode.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Solid-State Circuits Conference, 2021

GVSoC: A Highly Configurable, Fast and Accurate Full-Platform Simulator for RISC-V based IoT Processors.

[BibT_eX]

[DOI]

Proceedings of the 39th IEEE International Conference on Computer Design, 2021

A 1.15 TOPS/W, 16-Cores Parallel Ultra-Low Power Cluster with 2b-to-32b Fully Flexible Bit-Precision and Vector Lockstep Execution Mode.

[BibT_eX]

[DOI]

Proceedings of the 47th ESSCIRC 2021, 2021

Architecting more than Moore: wireless plasticity for massive heterogeneous computer architectures (WiPLASH).

[BibT_eX]

[DOI]

Proceedings of the CF '21: Computing Frontiers Conference, 2021

XpulpNN: Enabling Energy Efficient and Flexible Inference of Quantized Neural Networks on RISC-V based IoT End Nodes.

[BibT_eX]

[DOI]

Proceedings of the 28th IEEE Symposium on Computer Arithmetic, 2021

Streamlining the OpenMP Programming Model on Ultra-Low-Power Multi-core MCUs.

[BibT_eX]

[DOI]

Proceedings of the Architecture of Computing Systems - 34th International Conference, 2021

End-to-end 100-TOPS/W Inference With Analog In-Memory Computing: Are We There Yet?

[BibT_eX]

[DOI]

Proceedings of the 3rd IEEE International Conference on Artificial Intelligence Circuits and Systems, 2021

2020

Always-On 674μ W@4GOP/s Error Resilient Binary Neural Networks With Aggressive SRAM Voltage Scaling on a 22-nm IoT End-Node.

[BibT_eX]

[DOI]

Alfio Di Mauro

Francesco Conti

IEEE Trans. Circuits Syst., 2020

Modular Design and Optimization of Biomedical Applications for Ultralow Power Heterogeneous Platforms.

[BibT_eX]

[DOI]

Elisabetta De Giovanni

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2020

A custom processor for protocol-independent packet parsing.

[BibT_eX]

[DOI]

Microprocess. Microsystems, 2020

Performance-aware predictive-model-based on-chip body-bias regulation strategy for an ULP multi-core cluster in 28 nm UTBB FD-SOI.

[BibT_eX]

[DOI]

Integr., 2020

Exploring NEURAghe: A Customizable Template for APSoC-Based CNN Inference at the Edge.

[BibT_eX]

[DOI]

IEEE Embed. Syst. Lett., 2020

Impact of Memory Voltage Scaling on Accuracy and Resilience of Deep Learning Based Edge Devices.

[BibT_eX]

[DOI]

IEEE Des. Test, 2020

XpulpNN: Enabling Energy Efficient and Flexible Inference of Quantized Neural Network on RISC-V based IoT End Nodes.

[BibT_eX]

[DOI]

CoRR, 2020

Graphene-based Wireless Agile Interconnects for Massive Heterogeneous Multi-chip Processors.

[BibT_eX]

[DOI]

CoRR, 2020

A transprecision floating-point cluster for efficient near-sensor data analytics.

[BibT_eX]

[DOI]

CoRR, 2020

Performance-Aware Predictive-Model-Based On-Chip Body-Bias Regulation Strategy for an ULP Multi-Core Cluster in 28nm UTBB FD-SOI.

[BibT_eX]

[DOI]

CoRR, 2020

Always-On 674uW @ 4GOP/s Error Resilient Binary Neural Networks with Aggressive SRAM Voltage Scaling on a 22nm IoT End-Node.

[BibT_eX]

[DOI]

Alfio Di Mauro

Francesco Conti

CoRR, 2020

Flexible Software-Defined Packet Processing Using Low-Area Hardware.

[BibT_eX]

[DOI]

IEEE Access, 2020

A Mixed-Precision RISC-V Processor for Extreme-Edge DNN Inference.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE Computer Society Annual Symposium on VLSI, 2020

An Energy-Efficient Low-Voltage Swing Transceiver for mW-Range IoT End-Nodes.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Circuits and Systems, 2020

TRANSPIRE: An energy-efficient TRANSprecision floating-point Programmable archItectuRE.

[BibT_eX]

[DOI]

Proceedings of the 2020 Design, Automation & Test in Europe Conference & Exhibition, 2020

Energy-Efficient Two-level Instruction Cache Design for an Ultra-Low-Power Multi-core Cluster.

[BibT_eX]

[DOI]

Proceedings of the 2020 Design, Automation & Test in Europe Conference & Exhibition, 2020

XpulpNN: Accelerating Quantized Neural Networks on RISC-V Processors Through ISA Extensions.

[BibT_eX]

[DOI]

Proceedings of the 2020 Design, Automation & Test in Europe Conference & Exhibition, 2020

Enabling mixed-precision quantized neural networks in extreme-edge devices.

[BibT_eX]

[DOI]

Proceedings of the 17th ACM International Conference on Computing Frontiers, 2020

Neuro-PULP: A Paradigm Shift Towards Fully Programmable Platforms for Neural Interfaces.

[BibT_eX]

[DOI]

Timothy G. Constandinou

Proceedings of the 2nd IEEE International Conference on Artificial Intelligence Circuits and Systems, 2020

2019

An Energy-Efficient Integrated Programmable Array Accelerator and Compilation Flow for Near-Sensor Ultralow Power Processing.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2019

BioWolf: A Sub-10-mW 8-Channel Advanced Brain-Computer Interface Platform With a Nine-Core Processor and BLE Connectivity.

[BibT_eX]

[DOI]

IEEE Trans. Biomed. Circuits Syst., 2019

Online Learning and Classification of EMG-Based Gestures on a Parallel Ultra-Low Power Platform Using Hyperdimensional Computing.

[BibT_eX]

[DOI]

Simone Benatti

Fabio Montagna

Abbas Rahimi

IEEE Trans. Biomed. Circuits Syst., 2019

Mr.Wolf: An Energy-Precision Scalable Parallel Ultra Low Power SoC for IoT Edge Processing.

[BibT_eX]

[DOI]

IEEE J. Solid State Circuits, 2019

Hyperdrive: A Multi-Chip Systolically Scalable Binary-Weight CNN Inference Engine.

[BibT_eX]

[DOI]

IEEE J. Emerg. Sel. Topics Circuits Syst., 2019

PULP-NN: Accelerating Quantized Neural Networks on Parallel Ultra-Low-Power RISC-V Processors.

[BibT_eX]

[DOI]

CoRR, 2019

An Explicitly Parallel Architecture for Packet Processing in Software Defined Networks.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE Nordic Circuits and Systems Conference, 2019

PULP-NN: A Computing Library for Quantized Neural Network inference at the edge on RISC-V Based Parallel Ultra Low Power Clusters.

[BibT_eX]

[DOI]

Proceedings of the 26th IEEE International Conference on Electronics, Circuits and Systems, 2019

A PULP-based Parallel Power Controller for Future Exascale Systems.

[BibT_eX]

[DOI]

Proceedings of the 26th IEEE International Conference on Electronics, Circuits and Systems, 2019

Reducing Crossbar Costs in the Match-Action Pipeline.

[BibT_eX]

[DOI]

Proceedings of the 20th IEEE International Conference on High Performance Switching and Routing, 2019

Design and Evaluation of SmallFloat SIMD extensions to the RISC-V ISA.

[BibT_eX]

[DOI]

Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2019

Hardware-Accelerated Energy-Efficient Synchronization and Communication for Ultra-Low-Power Tightly Coupled Clusters.

[BibT_eX]

[DOI]

Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2019

DORY: Lightweight memory hierarchy management for deep NN inference on IoT endnodes: work-in-progress.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Hardware/Software Codesign and System Synthesis Companion, 2019

2018

NEURAghe: Exploiting CPU-FPGA Synergies for Efficient and Flexible CNN Inference Acceleration on Zynq SoCs.

[BibT_eX]

[DOI]

ACM Trans. Reconfigurable Technol. Syst., 2018

Neurostream: Scalable and Energy Efficient Deep Learning with Smart Memory Cubes.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2018

The Quest for Energy-Efficient I$ Design in Ultra-Low-Power Clustered Many-Cores.

[BibT_eX]

[DOI]

IEEE Trans. Multi Scale Comput. Syst., 2018

A Heterogeneous Multicore System on Chip for Energy Efficient Brain Inspired Computing.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. II Express Briefs, 2018

Synergistic HW/SW Approximation Techniques for Ultralow-Power Parallel Computing.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2018

YodaNN: An Architecture for Ultralow Power Binary-Weight CNN Acceleration.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2018

Power mitigation of a heterogeneous multicore architecture on FPGA/ASIC by DFS/DVFS techniques.

[BibT_eX]

[DOI]

Sajjad Nouri

Microprocess. Microsystems, 2018

A sensor fusion approach for drowsiness detection in wearable ultra-low-power systems.

[BibT_eX]

[DOI]

Simone Benatti

Inf. Fusion, 2018

Low-latency Packet Parsing in Software Defined Networks.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Nordic Circuits and Systems Conference, 2018

Hyperdrive: A Systolically Scalable Binary-Weight CNN Inference Engine for mW IoT End-Nodes.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Computer Society Annual Symposium on VLSI, 2018

Live Demonstration: Body-Bias Based Performance Monitoring and Compensation for a Near-Threshold Multi-Core Cluster in 28nm FD-SOI Technology.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Circuits and Systems, 2018

A Transprecision Floating-Point Architecture for Energy-Efficient Embedded Computing.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Circuits and Systems, 2018

Sub-mW multi-Gbps chip-to-chip communication Links for Ultra-Low Power IoT end-nodes.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Circuits and Systems, 2018

A Heterogeneous Cluster with Reconfigurable Accelerator for Energy Efficient Near-Sensor Data Analytics.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Circuits and Systems, 2018

Compressed Sensing Based Seizure Detection for an Ultra Low Power Multi-core Architecture.

[BibT_eX]

[DOI]

Proceedings of the 2018 International Conference on High Performance Computing & Simulation, 2018

A Fully Programmable eFPGA-Augmented SoC for Smart-Power Applications.

[BibT_eX]

[DOI]

Francesco Renzini

Eleonora Franchi Scarselli

Claudio Mucci

Roberto Canegallo

Proceedings of the 25th IEEE International Conference on Electronics, Circuits and Systems, 2018

Mr. Wolf: A 1 GFLOP/s Energy-Proportional Parallel Ultra Low Power SoC for IOT Edge Processing.

[BibT_eX]

[DOI]

Proceedings of the 44th IEEE European Solid State Circuits Conference, 2018

A transprecision floating-point platform for ultra-low power computing.

[BibT_eX]

[DOI]

Proceedings of the 2018 Design, Automation & Test in Europe Conference & Exhibition, 2018

Energy proportionality in near-threshold computing servers and cloud data centers: Consolidating or Not?

[BibT_eX]

[DOI]

Ali Pahlevan

Yasir Mahmood Qureshi

Proceedings of the 2018 Design, Automation & Test in Europe Conference & Exhibition, 2018

PULP-HD: accelerating brain-inspired high-dimensional computing on a parallel ultra-low power platform.

[BibT_eX]

[DOI]

Proceedings of the 55th Annual Design Automation Conference, 2018

Always-ON visual node with a hardware-software event-based binarized neural network inference engine.

[BibT_eX]

[DOI]

Proceedings of the 15th ACM International Conference on Computing Frontiers, 2018

An Explicitly Parallel Architecture for Packet Parsing in Software Defined Networks.

[BibT_eX]

[DOI]

Proceedings of the 29th IEEE International Conference on Application-specific Systems, 2018

GAP-8: A RISC-V SoC for AI at the Edge of the IoT.

[BibT_eX]

[DOI]

Proceedings of the 29th IEEE International Conference on Application-specific Systems, 2018

2017

Near-Threshold RISC-V Core With DSP Extensions for Scalable IoT Endpoint Devices.

[BibT_eX]

[DOI]

Michael Gautschi

IEEE Trans. Very Large Scale Integr. Syst., 2017

Logic-Base Interconnect Design for Near Memory Computing in the Smart Memory Cube.

[BibT_eX]

[DOI]

IEEE Trans. Very Large Scale Integr. Syst., 2017

An IoT Endpoint System-on-Chip for Secure and Energy-Efficient Near-Sensor Analytics.

[BibT_eX]

[DOI]

Francesco Conti

Robert Schilling

Antonio Pullini

Frank Kagan Gürkaynak

Michael Muehlberghuber

IEEE Trans. Circuits Syst. I Regul. Pap., 2017

Energy-Efficient Near-Threshold Parallel Computing: The PULPv2 Cluster.

[BibT_eX]

[DOI]

Frank Kagan Gürkaynak

IEEE Micro, 2017

Increasing the energy efficiency of microcontroller platforms with low-design margin co-processors.

[BibT_eX]

[DOI]

Microprocess. Microsystems, 2017

A Sub-mW IoT-Endnode for Always-On Visual Monitoring and Smart Triggering.

[BibT_eX]

[DOI]

IEEE Internet Things J., 2017

A Self-Aware Architecture for PVT Compensation and Power Nap in Near Threshold Processors.

[BibT_eX]

[DOI]

Igor Loi

Antonio Pullini

Thomas Christoph Müller

IEEE Des. Test, 2017

Slow and steady wins the race? A comparison of ultra-low-power RISC-V cores for Internet-of-Things applications.

[BibT_eX]

[DOI]

Proceedings of the 27th International Symposium on Power and Timing Modeling, 2017

μDMA: An autonomous I/O subsystem for IoT end-nodes.

[BibT_eX]

[DOI]

Proceedings of the 27th International Symposium on Power and Timing Modeling, 2017

Temperature and process-aware performance monitoring and compensation for an ULP multi-core cluster in 28nm UTBB FD-SOI technology.

[BibT_eX]

[DOI]

Proceedings of the 27th International Symposium on Power and Timing Modeling, 2017

A wearable EEG-based drowsiness detection system with blink duration and alpha waves analysis.

[BibT_eX]

[DOI]

Simone Benatti

Proceedings of the 8th International IEEE/EMBS Conference on Neural Engineering, 2017

A 142MOPS/mW integrated programmable array accelerator for smart visual processing.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Circuits and Systems, 2017

Efficient mapping of CDFG onto coarse-grained reconfigurable array architectures.

[BibT_eX]

[DOI]

Proceedings of the 22nd Asia and South Pacific Design Automation Conference, 2017

2016

PULP: A Ultra-Low Power Parallel Accelerator for Energy-Efficient and Flexible Embedded Vision.

[BibT_eX]

[DOI]

J. Signal Process. Syst., 2016

Power, Area, and Performance Optimization of Standard Cell Memory Arrays Through Controlled Placement.

[BibT_eX]

[DOI]

ACM Trans. Design Autom. Electr. Syst., 2016

YodaNN: An Ultra-Low Power Convolutional Neural Network Accelerator Based on Binary Weights.

[BibT_eX]

[DOI]

Proceedings of the IEEE Computer Society Annual Symposium on VLSI, 2016

A heterogeneous multi-core system-on-chip for energy efficient brain inspired vision.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Circuits and Systems, 2016

Energy-efficient design of an always-on smart visual trigger.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Smart Cities Conference, 2016

Always-on motion detection with application-level error control on a near-threshold approximate computing platform.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Electronics, Circuits and Systems, 2016

A 2 MS/s 10A Hall current sensor SoC with digital compressive sensing encoder in 0.16 µm BCD.

[BibT_eX]

[DOI]

Proceedings of the ESSCIRC Conference 2016: 42<sup>nd</sup> European Solid-State Circuits Conference, 2016

Towards near-threshold server processors.

[BibT_eX]

[DOI]

Ali Pahlevan

Javier Picorel

Arash Pourhabibi Zarandi

Marina Zapater

Andrea Bartolini

Pablo García Del Valle

David Atienza

Babak Falsafi

Proceedings of the 2016 Design, Automation & Test in Europe Conference & Exhibition, 2016

Enabling the heterogeneous accelerator model on ultra-low power microcontroller platforms.

[BibT_eX]

[DOI]

Proceedings of the 2016 Design, Automation & Test in Europe Conference & Exhibition, 2016

193 MOPS/mW @ 162 MOPS, 0.32V to 1.15V voltage range multi-core accelerator for energy efficient parallel and sequential digital processing.

[BibT_eX]

[DOI]

Frank Kagan Gürkaynak

Proceedings of the 2016 IEEE Symposium in Low-Power and High-Speed Chips, 2016

Scalable EEG seizure detection on an ultra low power multi-core architecture.

[BibT_eX]

[DOI]

Proceedings of the IEEE Biomedical Circuits and Systems Conference, 2016

Design and Evaluation of a Processing-in-Memory Architecture for the Smart Memory Cube.

[BibT_eX]

[DOI]

Proceedings of the Architecture of Computing Systems - ARCS 2016, 2016

2015

A Modular Shared L2 Memory Design for 3-D Integration.

[BibT_eX]

[DOI]

IEEE Trans. Very Large Scale Integr. Syst., 2015

Synergistic Architecture and Programming Model Support for Approximate Micropower Computing.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Computer Society Annual Symposium on VLSI, 2015

PULP: A parallel ultra low power platform for next generation IoT applications.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Hot Chips 27 Symposium (HCS), 2015

Reducing energy consumption in microcontroller-based platforms with low design margin co-processors.

[BibT_eX]

[DOI]

Proceedings of the 2015 Design, Automation & Test in Europe Conference & Exhibition, 2015

High performance AXI-4.0 based interconnect for extensible smart memory cubes.

[BibT_eX]

[DOI]

Proceedings of the 2015 Design, Automation & Test in Europe Conference & Exhibition, 2015

Exploring multi-banked shared-L1 program cache on ultra-low power, tightly coupled processor clusters.

[BibT_eX]

[DOI]

Proceedings of the 12th ACM International Conference on Computing Frontiers, 2015

Controlled placement of standard cell memory arrays for high density and low power in 28nm FD-SOI.

[BibT_eX]

[DOI]

Adam Teman

Pascal Andreas Meinerzhagen

Andreas Peter Burg

Proceedings of the 20th Asia and South Pacific Design Automation Conference, 2015

2014

Multicore Signal Processing Platform With Heterogeneous Configurable Hardware Accelerators.

[BibT_eX]

[DOI]

IEEE Trans. Very Large Scale Integr. Syst., 2014

Energy-efficient vision on the PULP platform for ultra-low power parallel computing.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE Workshop on Signal Processing Systems, 2014

Customizing an open source processor to fit in an ultra-low power cluster with a shared L1 memory.

[BibT_eX]

[DOI]

Michael Gautschi