Paul Gratz

Proceedings of the 20th ACM International Conference on Distributed and Event-based Systems, 2026

Machine Learning-Driven Early Performance Prediction Framework for Accelerated Microarchitecture Simulation.

[BibT_eX]

[DOI]

Jered Dominguez-Trujillo

Galen M. Shipman

Kevin Sheridan

Proceedings of the Design, Automation & Test in Europe Conference, 2026

2025

Workload Characterization for Branch Predictability.

[BibT_eX]

[DOI]

FNU Vikas

CoRR, December, 2025

Targeted Wearout Attacks in Microprocessor Cores.

[BibT_eX]

[DOI]

CoRR, August, 2025

KiSS: A Novel Container Size-Aware Memory Management Policy for Serverless in Edge-Cloud Continuum.

[BibT_eX]

[DOI]

Sabyasachi Gupta

John Lusher

CoRR, February, 2025

R-Max: A Method for Approximating the Benefit of Ideal Prefetching and Replacement Policy.

[BibT_eX]

[DOI]

IEEE Comput. Archit. Lett., 2025

Estimating CPI Stacks From Multiplexed Performance Counter Data Using Machine Learning.

[BibT_eX]

[DOI]

Jered Dominguez-Trujillo

Kevin Sheridan

IEEE Comput. Archit. Lett., 2025

Correct Wrong Path.

[BibT_eX]

[DOI]

Bhargav Reddy Godala

Sankara Prasad Ramesh

IEEE Comput. Archit. Lett., 2025

Benchmarking 3D Gaussian Splatting Rendering.

[BibT_eX]

[DOI]

Saichand Samudrala

Sushant Kondguli

Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2025

Light-weight Cache Replacement for Instruction Heavy Workloads.

[BibT_eX]

[DOI]

Proceedings of the 52nd Annual International Symposium on Computer Architecture, 2025

Skia: Exposing Shadow Branches.

[BibT_eX]

[DOI]

Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2025

2024

Coherence Attacks and Countermeasures in Interposer-based Chiplet Systems.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., June, 2024

Exposing Shadow Branches.

[BibT_eX]

[DOI]

CoRR, 2024

Correct Wrong Path.

[BibT_eX]

[DOI]

Bhargav Reddy Godala

Sankara Prasad Ramesh

CoRR, 2024

Aiding Microprocessor Performance Validation with Machine Learning.

[BibT_eX]

[DOI]

Erick Carvajal Barboza

Mahesh Ketkar

Jiang Hu

Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2024

Flow Correlator: A Flow Table Cache Management Strategy.

[BibT_eX]

[DOI]

Luke McHale

Alex Sprintson

Proceedings of the 33rd International Conference on Computer Communications and Networks, 2024

2023

KVRangeDB: Range Queries for a Hash-based Key-Value Device.

[BibT_eX]

[DOI]

Qing Zheng

Jason Lee

Bradley W. Settlemyer

ACM Trans. Storage, August, 2023

Machine Learning for Microprocessor Performance Bug Localization.

[BibT_eX]

[DOI]

Erick Carvajal Barboza

CoRR, 2023

Last-Level Cache Insertion and Promotion Policy in the Presence of Aggressive Prefetching.

[BibT_eX]

[DOI]

Elvira Teran

IEEE Comput. Archit. Lett., 2023

A Characterization of the Effects of Software Instruction Prefetching on an Aggressive Front-end.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2023

2022

Software Hint-Driven Data Management for Hybrid Memory in Mobile Systems.

[BibT_eX]

[DOI]

ACM Trans. Embed. Comput. Syst., 2022

SIMD-Matcher: A SIMD-based Arbitrary Matching Framework.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2022

Reducing Minor Page Fault Overheads through Enhanced Page Walker.

[BibT_eX]

[DOI]

Chandrahas Tirumalasetty

Ayman Abouelwafa

ACM Trans. Archit. Code Optim., 2022

The Championship Simulator: Architectural Simulation for Education and Competition.

[BibT_eX]

[DOI]

CoRR, 2022

Hardware Trojan Threats to Cache Coherence in Modern 2.5D Chiplet Systems.

[BibT_eX]

[DOI]

IEEE Comput. Archit. Lett., 2022

Page Size Aware Cache Prefetching.

[BibT_eX]

[DOI]

Proceedings of the 55th IEEE/ACM International Symposium on Microarchitecture, 2022

Composite Instruction Prefetching.

[BibT_eX]

[DOI]

Samira Mirbagher Ajorpaz

Proceedings of the IEEE 40th International Conference on Computer Design, 2022

SLAP-CC: Set-Level Adaptive Prefetching for Compressed Caches.

[BibT_eX]

[DOI]

Laith M. AlBarakat

Proceedings of the IEEE 40th International Conference on Computer Design, 2022

Stay in your Lane: A NoC with Low-overhead Multi-packet Bypassing.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2022

2021

Interposer-Based Root of Trust.

[BibT_eX]

[DOI]

CoRR, 2021

KVRAID: high performance, write efficient, update friendly erasure coding scheme for KV-SSDs.

[BibT_eX]

[DOI]

Rekha Pitchumani

Yang-Seok Ki

Proceedings of the SYSTOR '21: The 14th ACM International Systems and Storage Conference, 2021

SEEC: stochastic escape express channel.

[BibT_eX]

[DOI]

Mayank Parasar

Joshua San Miguel

Proceedings of the International Conference for High Performance Computing, 2021

Pitstop: Enabling a Virtual Network Free Network-on-Chip.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2021

Automatic Microprocessor Performance Bug Detection.

[BibT_eX]

[DOI]

Erick Carvajal Barboza

Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2021

An FPGA-based Hybrid Memory Emulation System.

[BibT_eX]

[DOI]

Proceedings of the 31st International Conference on Field-Programmable Logic and Applications, 2021

CMRC: Comprehensive Microarchitectural Register Coalescing for GPGPUs.

[BibT_eX]

[DOI]

Ahmad M. Radaideh

Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2021

OpenMem: Hardware/Software Cooperative Management for Mobile Memory System.

[BibT_eX]

[DOI]

Proceedings of the 58th ACM/IEEE Design Automation Conference, 2021

2020

Hardware Memory Management for Future Mobile Hybrid Memory Systems.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2020

FPGA-based Hyrbid Memory Emulation System.

[BibT_eX]

[DOI]

CoRR, 2020

Virtualize and share non-volatile memories in user space.

[BibT_eX]

[DOI]

Jaemin Jung

Doug Voigt

CCF Trans. High Perform. Comput., 2020

SB-Fetch: synchronization aware hardware prefetching for chip multiprocessors.

[BibT_eX]

[DOI]

Laith M. AlBarakat

Proceedings of the ICS '20: 2020 International Conference on Supercomputing, 2020

DRAIN: Deadlock Removal for Arbitrary Irregular Networks.

[BibT_eX]

[DOI]

Mayank Parasar

Hossein Farrokhbakht

Joshua San Miguel

Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2020

Exploiting Zero Data to Reduce Register File and Execution Unit Dynamic Power Consumption in GPGPUs.

[BibT_eX]

[DOI]

Ahmad M. Radaideh

Proceedings of the 57th ACM/IEEE Design Automation Conference, 2020

A Generic FPGA Accelerator for Minimum Storage Regenerating Codes.

[BibT_eX]

[DOI]

Proceedings of the 25th Asia and South Pacific Design Automation Conference, 2020

2019

GenMatcher: A Generic Clustering-Based Arbitrary Matching Framework.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2019

Optimizing Post-Copy Live Migration with System-Level Checkpoint Using Fabric-Attached Memory.

[BibT_eX]

[DOI]

Yuan Chen

Dejan S. Milojicic

Proceedings of the 2019 IEEE/ACM Workshop on Memory Centric High Performance Computing, 2019

vNVML: An Efficient User Space Library for Virtualizing and Sharing Non-Volatile Memories.

[BibT_eX]

[DOI]

Jaemin Jung

Doug Voigt

Proceedings of the 35th Symposium on Mass Storage Systems and Technologies, 2019

SWAP: Synchronized Weaving of Adjacent Packets for Network Deadlock Resolution.

[BibT_eX]

[DOI]

Mayank Parasar

Joshua San Miguel

Proceedings of the 52nd Annual IEEE/ACM International Symposium on Microarchitecture, 2019

Perceptron-based prefetch filtering.

[BibT_eX]

[DOI]

Proceedings of the 46th International Symposium on Computer Architecture, 2019

SpecLock: Speculative Lock Forwarding.

[BibT_eX]

[DOI]

Pooria M. Yaghini

George Michelogiannakis

Proceedings of the 37th IEEE International Conference on Computer Design, 2019

The Best of IEEE Computer Architecture Letters in 2018.

[BibT_eX]

[DOI]

Proceedings of the 25th IEEE International Symposium on High Performance Computer Architecture, 2019

2018

Guest Editorial: Emerging Technologies and Architectures for Manycore Computing Part 1: Hardware Techniques.

[BibT_eX]

[DOI]

Sébastien Le Beux

Ian O'Connor

IEEE Trans. Multi Scale Comput. Syst., 2018

SDPR: Improving Latency and Bandwidth in On-Chip Interconnect Through Simultaneous Dual-Path Routing.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2018

MTB-Fetch: Multithreading Aware Hardware Prefetching for Chip Multiprocessors.

[BibT_eX]

[DOI]

Laith M. AlBarakat

IEEE Comput. Archit. Lett., 2018

Synchronized Progress in Interconnection Networks (SPIN): A New Theory for Deadlock Freedom.

[BibT_eX]

[DOI]

Aniruddh Ramrakhyani

Proceedings of the 45th ACM/IEEE Annual International Symposium on Computer Architecture, 2018

2017

Speculative paging for future NVM storage.

[BibT_eX]

[DOI]

Jinchun Kim

Proceedings of the International Symposium on Memory Systems, 2017

Minimal exercise vector generation for reliability improvement.

[BibT_eX]

[DOI]

P. Madhukar Reddy

Stavros Hadjitheophanous

Maria K. Michael

Proceedings of the 23rd IEEE International Symposium on On-Line Testing and Robust System Design, 2017

Kill the Program Counter: Reconstructing Program Behavior in the Processor Cache Hierarchy.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Second International Conference on Architectural Support for Programming Languages and Operating Systems, 2017

2016

GCA: Global Congestion Awareness for Load Balance in Networks-on-Chip.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2016

Resource Sharing Centric Dynamic Voltage and Frequency Scaling for CMP Cores, Uncore, and Memory.

[BibT_eX]

[DOI]

ACM Trans. Design Autom. Electr. Syst., 2016

Path confidence based lookahead prefetching.

[BibT_eX]

[DOI]

Jinchun Kim

Seth H. Pugsley

Chris Wilkerson

Zeshan Chishti

Proceedings of the 49th Annual IEEE/ACM International Symposium on Microarchitecture, 2016

2015

Use It or Lose It: Proactive, Deterministic Longevity in Future Chip Multiprocessors.

[BibT_eX]

[DOI]

Hyungjun Kim

Siva Bhanu Krishna Boga

Arseniy Vitkovskiy

Stavros Hadjitheophanous

Maria K. Michael

ACM Trans. Design Autom. Electr. Syst., 2015

Wear-Aware Adaptive Routing for Networks-on-Chips.

[BibT_eX]

[DOI]

Arseni Vitkovski

Proceedings of the 9th International Symposium on Networks-on-Chip, 2015

Dynamic Memory Pressure Aware Ballooning.

[BibT_eX]

[DOI]

Jinchun Kim

Proceedings of the 2015 International Symposium on Memory Systems, 2015

Shared Last-Level Caches and The Case for Longer Timeslices.

[BibT_eX]

[DOI]

Proceedings of the 2015 International Symposium on Memory Systems, 2015

Having your cake and eating it too: Energy savings without performance loss through resource sharing driven power management.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM International Symposium on Low Power Electronics and Design, 2015

Clotho: Proactive wearout deceleration in Chip-Multiprocessor interconnects.

[BibT_eX]

[DOI]

Arseniy Vitkovskiy

Proceedings of the 33rd IEEE International Conference on Computer Design, 2015

Energy-efficient implementations of GF (p) and GF(2m) elliptic curve cryptography.

[BibT_eX]

[DOI]

Proceedings of the 33rd IEEE International Conference on Computer Design, 2015

A control-theoretic approach for energy efficient CPU-GPU subsystem in mobile platforms.

[BibT_eX]

[DOI]

Proceedings of the 52nd Annual Design Automation Conference, 2015

Bandwidth-efficient on-chip interconnect designs for GPGPUs.

[BibT_eX]

[DOI]

Proceedings of the 52nd Annual Design Automation Conference, 2015

2014

LumiNOC: A Power-Efficient, High-Performance, Photonic Network-on-Chip.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2014

Spatial Locality Speculation to Reduce Energy in Chip-Multiprocessor Networks-on-Chip.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2014

Towards platform level power management in mobile systems.

[BibT_eX]

[DOI]

Proceedings of the 27th IEEE International System-on-Chip Conference, 2014

STORM: A Simple Traffic-Optimized Router Microarchitecture for Networks-on-Chip.

[BibT_eX]

[DOI]

Proceedings of the Eighth IEEE/ACM International Symposium on Networks-on-Chip, 2014

B-Fetch: Branch Prediction Directed Prefetching for Chip-Multiprocessors.

[BibT_eX]

[DOI]

Proceedings of the 47th Annual IEEE/ACM International Symposium on Microarchitecture, 2014

The design space of ultra-low energy asymmetric cryptography.

[BibT_eX]

[DOI]

Andrew D. Targhetta

Donald E. Owen

Proceedings of the 2014 IEEE International Symposium on Performance Analysis of Systems and Software, 2014

Stochastic Pre-classification for SDN Data Plane Matching.

[BibT_eX]

[DOI]

Proceedings of the 22nd IEEE International Conference on Network Protocols, 2014

Up by their bootstraps: Online learning in Artificial Neural Networks for CMP uncore power management.

[BibT_eX]

[DOI]

Proceedings of the 20th IEEE International Symposium on High Performance Computer Architecture, 2014

ILP and TLP in shared memory applications: a limit study.

[BibT_eX]

[DOI]

Ehsan Fatehi

Proceedings of the International Conference on Parallel Architectures and Compilation, 2014

2013

ARI: Adaptive LLC-memory traffic management.

[BibT_eX]

[DOI]

Sheng Qiu

ACM Trans. Archit. Code Optim., 2013

LumiNOC: A low-latency, high-bandwidth per Watt, photonic Network-on-Chip.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE International Workshop on System Level Interconnect Prediction, 2013

GCA: Global congestion awareness for load balance in Networks-on-Chip.

[BibT_eX]

[DOI]

Mukund Ramakrishna

Alexander Sprintson

Proceedings of the 2013 Seventh IEEE/ACM International Symposium on Networks-on-Chip (NoCS), 2013

Use it or lose it: wear-out and lifetime in future chip multiprocessors.

[BibT_eX]

[DOI]

Proceedings of the 46th Annual IEEE/ACM International Symposium on Microarchitecture, 2013

Bidirectional interconnect design for low latency high bandwidth NoC.

[BibT_eX]

[DOI]

Proceedings of 2013 International Conference on IC Design & Technology, 2013

Power gating with block migration in chip-multiprocessor last-level caches.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE 31st International Conference on Computer Design, 2013

Stochastic Pre-Classification for Software Defined Firewalls.

[BibT_eX]

[DOI]

Proceedings of the 22nd International Conference on Computer Communication and Networks, 2013

Dynamic voltage and frequency scaling for shared resources in multicore processor designs.

[BibT_eX]

[DOI]

Proceedings of the 50th Annual Design Automation Conference 2013, 2013

2012

B-Fetch: Branch Prediction Directed Prefetching for In-Order Processors.

[BibT_eX]

[DOI]

Reena Panda

IEEE Comput. Archit. Lett., 2012

In-network Monitoring and Control Policy for DVFS of CMP Networks-on-Chip and Last Level Caches.

[BibT_eX]

[DOI]

Proceedings of the 2012 Sixth IEEE/ACM International Symposium on Networks-on-Chip (NoCS), 2012

Exploiting path diversity for low-latency and high-bandwidth with the dual-path NoC router.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Symposium on Circuits and Systems, 2012

WaveSync: A low-latency source synchronous bypass network-on-chip architecture.

[BibT_eX]

[DOI]

Proceedings of the 30th International IEEE Conference on Computer Design, 2012

LumiNOC: a power-efficient, high-performance, photonic network-on-chip for future parallel architectures.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2012

2011

Asynchronous Bypass Channels for Multi-Synchronous NoCs: A Router Microarchitecture, Topology, and Routing Algorithm.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2011

AcENoCs: A Configurable HW/SW Platform for FPGA Accelerated NoC Emulation.

[BibT_eX]

[DOI]

Swapnil Lotlikar

Vinayak Pai

Proceedings of the VLSI Design 2011: 24th International Conference on VLSI Design, 2011

Reducing network-on-chip energy consumption through spatial locality speculation.

[BibT_eX]

[DOI]

Proceedings of the NOCS 2011, 2011

2010

Leveraging Unused Cache Block Words to Reduce Power in CMP Interconnect.

[BibT_eX]

[DOI]

Hyungjun Kim

IEEE Comput. Archit. Lett., 2010

Asynchronous Bypass Channels: Improving Performance for Multi-synchronous NoCs.

[BibT_eX]

[DOI]

Proceedings of the NOCS 2010, 2010

2009

An evaluation of the TRIPS computer system.

[BibT_eX]

[DOI]

Proceedings of the 14th International Conference on Architectural Support for Programming Languages and Operating Systems, 2009

2008

Regional congestion awareness for load balance in networks-on-chip.

[BibT_eX]

[DOI]

Boris Grot

Proceedings of the 14th International Conference on High-Performance Computer Architecture (HPCA-14 2008), 2008

2007

On-Chip Interconnection Networks of the TRIPS Chip.

[BibT_eX]

[DOI]

Karthikeyan Sankaralingam

Changkyu Kim

Heather Hanson

Premkishore Shivakumar

Doug Burger

IEEE Micro, 2007

Implementation and Evaluation of a Dynamically Routed Processor Operand Network.

[BibT_eX]

[DOI]

Karthikeyan Sankaralingam

Heather Hanson

Premkishore Shivakumar

Robert G. McDonald

Karthikeyan Sankaralingam

Doug Burger

Proceedings of the First International Symposium on Networks-on-Chips, 2007

2006

Distributed Microarchitectural Protocols in the TRIPS Prototype Processor.

[BibT_eX]

[DOI]

Premkishore Shivakumar