Michael Ferdman

Proceedings of the 2026 ACM/SIGDA International Symposium on Field Programmable Gate Arrays, 2026

2025

A VM-HDL Co-Simulation Framework for Systems with PCIe-Connected FPGAs.

[BibT_eX]

[DOI]

CoRR, January, 2025

What Gets Measured Gets Managed: Mitigating Supply Chain Attacks with a Link Integrity Management System.

[BibT_eX]

[DOI]

Johnny So

Nick Nikiforakis

Proceedings of the 2025 ACM SIGSAC Conference on Computer and Communications Security, 2025

2024

Server Architecture From Enterprise to Post-Moore.

[BibT_eX]

[DOI]

Boris Grot

IEEE Micro, 2024

A Case for Hardware Memoization in Server CPUs.

[BibT_eX]

[DOI]

Farid Samandi

Natheesan Ratnasegar

IEEE Comput. Archit. Lett., 2024

Infrastructure for Exploring SIMT Architecture in General-Purpose Processors.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2024

NUCAlloc: Fine-Grained Block Placement in Hashed Last-Level NUCA Caches.

[BibT_eX]

[DOI]

Raveendra Soori

Shreyas Prabhu

Harpreet Singh Chawla

Proceedings of the 38th ACM International Conference on Supercomputing, 2024

Ready or Not, Here I Come: Characterizing the Security of Prematurely-public Web Applications.

[BibT_eX]

[DOI]

Brian Kondracki

Nick Nikiforakis

Proceedings of the Annual Computer Security Applications Conference, 2024

2023

The More Things Change, the More They Stay the Same: Integrity of Modern JavaScript.

[BibT_eX]

[DOI]

Johnny So

Amogha Udupa Shankaranarayana Gopal

Nick Nikiforakis

Proceedings of the ACM Web Conference 2023, 2023

TAILCHECK: A Lightweight Heap Overflow Detection Mechanism with Page Protection and Tagged Pointers.

[BibT_eX]

[DOI]

Raveendra Soori

Dongyoon Lee

Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation, 2023

Waverunner: An Elegant Approach to Hardware Acceleration of State Machine Replication.

[BibT_eX]

[DOI]

Mohammadreza Alimadadi

Proceedings of the 20th USENIX Symposium on Networked Systems Design and Implementation, 2023

2022

Guest Editorial: IEEE TC Special Issue: Hardware Acceleration of Machine Learning.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2022

An incrementally updatable and scalable system for large-scale sequence search using the Bentley-Saxe transformation.

[BibT_eX]

[DOI]

Bioinform., 2022

Domains Do Change Their Spots: Quantifying Potential Abuse of Residual Trust.

[BibT_eX]

[DOI]

Proceedings of the 43rd IEEE Symposium on Security and Privacy, 2022

AppBastion: Protection from Untrusted Apps and OSes on ARM.

[BibT_eX]

[DOI]

Darius Suciu

Radu Sion

Proceedings of the Computer Security - ESORICS 2022, 2022

2021

Practical Model Checking on FPGAs.

[BibT_eX]

[DOI]

ACM Trans. Reconfigurable Technol. Syst., 2021

On the Distribution, Sparsity, and Inference-time Quantization of Attention Values in Transformers.

[BibT_eX]

[DOI]

Niranjan Balasubramanian

Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020

An Efficient, Scalable, and Exact Representation of High-Dimensional Color Information Enabled Using de Bruijn Graph Search.

[BibT_eX]

[DOI]

J. Comput. Biol., 2020

Flick: Fast and Lightweight ISA-Crossing Call for Heterogeneous-ISA Environments.

[BibT_eX]

[DOI]

Proceedings of the 47th ACM/IEEE Annual International Symposium on Computer Architecture, 2020

A Scheduling Approach to Incremental Maintenance of Datalog Programs.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020

FPGA-Accelerated Samplesort for Large Data Sets.

[BibT_eX]

[DOI]

Proceedings of the FPGA '20: The 2020 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, 2020

2019

Argus: An End-to-End Framework for Accelerating CNNs on FPGAs.

[BibT_eX]

[DOI]

IEEE Micro, 2019

Massively Parallel Server Processors.

[BibT_eX]

[DOI]

IEEE Comput. Archit. Lett., 2019

x86-64 instruction usage among C/C++ applications.

[BibT_eX]

[DOI]

Proceedings of the 12th ACM International Conference on Systems and Storage, 2019

Swarm Model Checking on the GPU.

[BibT_eX]

[DOI]

Proceedings of the Model Checking Software - 26th International Symposium, 2019

An Efficient, Scalable and Exact Representation of High-Dimensional Color Information Enabled via de Bruijn Graph Search.

[BibT_eX]

[DOI]

Proceedings of the Research in Computational Molecular Biology, 2019

Runtime-Programmable Pipelines for Model Checkers on FPGAs.

[BibT_eX]

[DOI]

Proceedings of the 29th International Conference on Field Programmable Logic and Applications, 2019

Sorting Large Data Sets with FPGA-Accelerated Samplesort.

[BibT_eX]

[DOI]

Proceedings of the 27th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2019

2018

Panning for gold.com: Understanding the Dynamics of Domain Dropcatching.

[BibT_eX]

[DOI]

Proceedings of the 2018 World Wide Web Conference on World Wide Web, 2018

Mantis: A Fast, Small, and Exact Large-Scale Sequence-Search Index.

[BibT_eX]

[DOI]

Proceedings of the Research in Computational Molecular Biology, 2018

Taming the Killer Microsecond.

[BibT_eX]

[DOI]

Proceedings of the 51st Annual IEEE/ACM International Symposium on Microarchitecture, 2018

Impact of Device Performance on Mobile Internet QoE.

[BibT_eX]

[DOI]

Mallesham Dasari

Santiago Vargas

Arani Bhattacharya

Aruna Balasubramanian

Samir R. Das

Proceedings of the Internet Measurement Conference 2018, 2018

Medusa: A Scalable Interconnect for Many-Port DNN Accelerators and Wide DRAM Controller Interfaces.

[BibT_eX]

[DOI]

Proceedings of the 28th International Conference on Field Programmable Logic and Applications, 2018

FPGASwarm: High Throughput Model Checking on FPGAs.

[BibT_eX]

[DOI]

Shenghsun Cho

Proceedings of the 28th International Conference on Field Programmable Logic and Applications, 2018

A Full-System VM-HDL Co-Simulation Framework for Servers with PCIe-Connected FPGAs.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, 2018

2017

Maximizing CNN Accelerator Efficiency Through Resource Partitioning.

[BibT_eX]

[DOI]

Proceedings of the 44th Annual International Symposium on Computer Architecture, 2017

Storage-Efficient Batching for Minimizing Bandwidth of Fully-Connected Neural Network Layers (Abstract Only).

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, 2017

Escher: A CNN Accelerator with Flexible Buffering to Minimize Off-Chip Transfer.

[BibT_eX]

[DOI]

Proceedings of the 25th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2017

2016

Fused-layer CNN accelerators.

[BibT_eX]

[DOI]

Proceedings of the 49th Annual IEEE/ACM International Symposium on Microarchitecture, 2016

Demystifying cloud benchmarking.

[BibT_eX]

[DOI]

Tapti Palit

Proceedings of the 2016 IEEE International Symposium on Performance Analysis of Systems and Software, 2016

Overcoming resource underutilization in spatial CNN accelerators.

[BibT_eX]

[DOI]

Proceedings of the 26th International Conference on Field Programmable Logic and Applications, 2016

2015

A Comprehensive Implementation and Evaluation of Direct Interrupt Delivery.

[BibT_eX]

[DOI]

Proceedings of the 11th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments, 2015

Architectural Support for Dynamic Linking.

[BibT_eX]

[DOI]

Proceedings of the Twentieth International Conference on Architectural Support for Programming Languages and Operating Systems, 2015

2014

A Case for Specialized Processors for Scale-Out Workloads.

[BibT_eX]

[DOI]

Adrian Daniel Popescu

Anastasia Ailamaki

IEEE Micro, 2014

DIMMer: A case for turning off DIMMs in clouds.

[BibT_eX]

[DOI]

Proceedings of the ACM Symposium on Cloud Computing, 2014

2012

Quantifying the Mismatch between Emerging Scale-Out Applications and Modern Processors.

[BibT_eX]

[DOI]

Adrian Daniel Popescu

Anastasia Ailamaki

ACM Trans. Comput. Syst., 2012

Scale-out processors.

[BibT_eX]

[DOI]

Proceedings of the 39th International Symposium on Computer Architecture (ISCA 2012), 2012

Clearing the clouds: a study of emerging scale-out workloads on modern hardware.

[BibT_eX]

[DOI]

Adrian Daniel Popescu

Anastasia Ailamaki

Proceedings of the 17th International Conference on Architectural Support for Programming Languages and Operating Systems, 2012

2011

Toward Dark Silicon in Servers.

[BibT_eX]

[DOI]

IEEE Micro, 2011

Spatial Memory Streaming.

[BibT_eX]

[DOI]

J. Instr. Level Parallelism, 2011

Proactive instruction fetch.

[BibT_eX]

[DOI]

Cansu Kaynak

Proceedings of the 44rd Annual IEEE/ACM International Symposium on Microarchitecture, 2011

Cuckoo directory: A scalable directory for many-core systems.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on High-Performance Computer Architecture (HPCA-17 2011), 2011

2010

Making Address-Correlated Prefetching Practical.

[BibT_eX]

[DOI]

IEEE Micro, 2010

Near-Optimal Cache Block Placement with Reactive Nonuniform Cache Architectures.

[BibT_eX]

[DOI]

IEEE Micro, 2010

TurboTag: lookup filtering to reduce coherence directory power.

[BibT_eX]

[DOI]

Proceedings of the 2010 International Symposium on Low Power Electronics and Design, 2010

2009

Reactive NUCA: near-optimal block placement and replication in distributed caches.

[BibT_eX]

[DOI]

Proceedings of the 36th International Symposium on Computer Architecture (ISCA 2009), 2009

Practical off-chip meta-data for temporal memory streaming.

[BibT_eX]

[DOI]

Proceedings of the 15th International Conference on High-Performance Computer Architecture (HPCA-15 2009), 2009

2008

Cache bursts: A new approach for eliminating dead blocks and increasing cache efficiency.

[BibT_eX]

[DOI]

Proceedings of the 41st Annual IEEE/ACM International Symposium on Microarchitecture (MICRO-41 2008), 2008

Temporal instruction fetch streaming.

[BibT_eX]

[DOI]

Proceedings of the 41st Annual IEEE/ACM International Symposium on Microarchitecture (MICRO-41 2008), 2008

Temporal streams in commercial server applications.

[BibT_eX]

[DOI]

Proceedings of the 4th International Symposium on Workload Characterization (IISWC 2008), 2008

2007

Last-Touch Correlated Data Streaming.

[BibT_eX]

[DOI]