Omer Khan

Mohsin Shan

Akif Rehman

Proceedings of the ICS '20: 2020 International Conference on Supercomputing, 2020

IRONHIDE: A Secure Multicore that Efficiently Mitigates Microarchitecture State Attacks for Interactive Applications.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2020

2019

Guest Editors Introduction: Special Section on Emerging Technologies in Computer Design.

[BibT_eX]

[DOI]

Ozgur Sinanoglu

IEEE Trans. Emerg. Top. Comput., 2019

Advancing the State-of-the-Art in Hardware Trojans Detection.

[BibT_eX]

[DOI]

Syed Kamran Haider

Chenglu Jin

Devu Manikantan Shila

Marten van Dijk

IEEE Trans. Dependable Secur. Comput., 2019

Accelerating Synchronization Using Moving Compute to Data Model at 1, 000-core Multicore Scale.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2019

IRONHIDE: A Secure Multicore Architecture that Leverages Hardware Isolation Against Microarchitecture State Attacks.

[BibT_eX]

[DOI]

CoRR, 2019

HeteroMap: A Runtime Performance Predictor for Efficient Processing of Graph Analytics on Heterogeneous Multi-Accelerators.

[BibT_eX]

[DOI]

Halit Dogan

Christopher J. Michael

Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2019

POSTER: Exploiting Multi-Level Task Dependencies to Prune Redundant Work in Relax-Ordered Task-Parallel Algorithms.

[BibT_eX]

[DOI]

Proceedings of the 28th International Conference on Parallel Architectures and Compilation Techniques, 2019

2018

Guest Editorial: Special Section on Defect and Fault Tolerance in VLSI and Nanotechnology.

[BibT_eX]

[DOI]

Maria K. Michael

Salvatore Pontarelli

IEEE Trans. Emerg. Top. Comput., 2018

Declarative Resilience: A Holistic Soft-Error Resilient Multicore Architecture that Trades off Program Accuracy for Efficiency.

[BibT_eX]

[DOI]

ACM Trans. Embed. Comput. Syst., 2018

Multicore Resource Isolation for Deterministic, Resilient and Secure Concurrent Execution of Safety-Critical Applications.

[BibT_eX]

[DOI]

IEEE Comput. Archit. Lett., 2018

Software-Hardware Managed Last-level Cache Allocation Scheme for Large-Scale NVRAM-Based Multicores Executing Parallel Data Analytics Applications.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium, 2018

Breaking the Oblivious-RAM Bandwidth Wall.

[BibT_eX]

[DOI]

Proceedings of the 36th IEEE International Conference on Computer Design, 2018

Accelerating Synchronization in Graph Analytics Using Moving Compute to Data Model on Tilera TILE-Gx72.

[BibT_eX]

[DOI]

Proceedings of the 36th IEEE International Conference on Computer Design, 2018

2017

Efficient Situational Scheduling of Graph Workloads on Single-Chip Multicores and GPUs.

[BibT_eX]

[DOI]

Christopher J. Michael

IEEE Micro, 2017

Exploiting the Tradeoff between Program Accuracy and Soft-error Resiliency Overhead for Machine Learning Workloads.

[BibT_eX]

[DOI]

CoRR, 2017

Accelerating Graph and Machine Learning Workloads Using a Shared Memory Multicore Architecture with Auxiliary Support for In-hardware Explicit Messaging.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017

GraphTuner: An Input Dependence Aware Loop Perforation Scheme for Efficient Execution of Approximated Graph Algorithms.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Computer Design, 2017

2016

Locality-aware data replication in the last-level cache for large scale multicores.

[BibT_eX]

[DOI]

J. Supercomput., 2016

LDAC: Locality-Aware Data Access Control for Large-Scale Multicore Cache Hierarchies.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2016

Efficient Error-Detection and Recovery Mechanisms for Reliability and Resiliency of Multicores.

[BibT_eX]

[DOI]

Proceedings of the 29th International Conference on VLSI Design and 15th International Conference on Embedded Systems, 2016

GPU concurrency choices in graph analytics.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Symposium on Workload Characterization, 2016

Foreword.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems, 2016

2015

The Execution Migration Machine: Directoryless Shared-Memory Architecture.

[BibT_eX]

[DOI]

Computer, 2015

A Cross-Layer Multicore Architecture to Tradeoff Program Accuracy and Resilience Overheads.

[BibT_eX]

[DOI]

Henry Hoffmann

IEEE Comput. Archit. Lett., 2015

Efficient parallel packet processing using a shared memory many-core processor with hardware support to accelerate communication.

[BibT_eX]

[DOI]

Proceedings of the 10th IEEE International Conference on Networking, 2015

Exploring the performance implications of memory safety primitives in many-core processors executing multi-threaded workloads.

[BibT_eX]

[DOI]

Proceedings of the Fourth Workshop on Hardware and Architectural Support for Security and Privacy, 2015

CRONO: A Benchmark Suite for Multithreaded Graph Algorithms Executing on Futuristic Multicores.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Symposium on Workload Characterization, 2015

M-MAP: Multi-factor memory authentication for secure embedded processors.

[BibT_eX]

[DOI]

Proceedings of the 33rd IEEE International Conference on Computer Design, 2015

Efficient parallelization of path planning workload on single-chip shared-memory multicores.

[BibT_eX]

[DOI]

Kartik Lakshminarasimhan

Proceedings of the 2015 IEEE High Performance Extreme Computing Conference, 2015

OSPREY: Implementation of Memory Consistency Models for Cache Coherence Protocols involving Invalidation-Free Data Access.

[BibT_eX]

[DOI]

Proceedings of the 2015 International Conference on Parallel Architectures and Compilation, 2015

2014

NUCA-L1: A Non-Uniform Access Latency Level-1 Cache Architecture for Multicores Operating at Near-Threshold Voltages.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2014

HaTCh: Hardware Trojan Catcher.

[BibT_eX]

[DOI]

Syed Kamran Haider

Chenglu Jin

Devu Manikantan Shila

Marten van Dijk

IACR Cryptol. ePrint Arch., 2014

Thread Migration Prediction for Distributed Shared Caches.

[BibT_eX]

[DOI]

IEEE Comput. Archit. Lett., 2014

Locality-aware data replication in the Last-Level Cache.

[BibT_eX]

[DOI]

George Kurian

Proceedings of the 20th IEEE International Symposium on High Performance Computer Architecture, 2014

Suppressing the Oblivious RAM timing channel while making information leakage and program efficiency trade-offs.

[BibT_eX]

[DOI]

Proceedings of the 20th IEEE International Symposium on High Performance Computer Architecture, 2014

2013

Toward Holistic Soft-Error-Resilient Shared-Memory Multicores.

[BibT_eX]

[DOI]

Computer, 2013

A framework to accelerate sequential programs on homogeneous multicores.

[BibT_eX]

[DOI]

Rachael Harding

Proceedings of the 21st IEEE/IFIP International Conference on VLSI and System-on-Chip, 2013

The locality-aware adaptive cache coherence protocol.

[BibT_eX]

[DOI]

George Kurian

Proceedings of the 40th Annual International Symposium on Computer Architecture, 2013

Towards efficient dynamic data placement in NoC-based multicores.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE 31st International Conference on Computer Design, 2013

A private level-1 cache architecture to exploit the latency and capacity tradeoffs in multicores operating at near-threshold voltages.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE 31st International Conference on Computer Design, 2013

MARTHA: architecture for control and emulation of power electronics and smart grid systems.

[BibT_eX]

[DOI]

Proceedings of the Design, Automation and Test in Europe, 2013

2012

HORNET: A Cycle-Level Multicore Simulator.

[BibT_eX]

[DOI]

Nanning Zheng

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2012

Empirical model for cooperative resizing of processor structures to exploit power-performance efficiency at runtime.

[BibT_eX]

[DOI]

IET Circuits Devices Syst., 2012

Low-Latency Mechanisms for Near-Threshold Operation of Private Caches in Shared Memory Multicores.

[BibT_eX]

[DOI]

Proceedings of the 45th Annual IEEE/ACM International Symposium on Microarchitecture, 2012

A low-overhead dynamic optimization framework for multicores.

[BibT_eX]

[DOI]

Rachael Harding

Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2012

2011

Microvisor: A Runtime Architecture for Thermal Management in Chip Multiprocessors.

[BibT_eX]

[DOI]

Trans. High Perform. Embed. Archit. Compil., 2011

Hardware/Software Codesign Architecture for Online Testing in Chip Multiprocessors.

[BibT_eX]

[DOI]

IEEE Trans. Dependable Secur. Comput., 2011

DCC: A Dependable Cache Coherence Multicore Architecture.

[BibT_eX]

[DOI]

IEEE Comput. Archit. Lett., 2011

Brief announcement: distributed shared memory based on computation migration.

[BibT_eX]

[DOI]

Mieszko Lis

Keun Sup Shim

Myong Hyon Cho

Proceedings of the SPAA 2011: Proceedings of the 23rd Annual ACM Symposium on Parallelism in Algorithms and Architectures, 2011

Time-Predictable Computer Architecture for Cyber-Physical Systems: Digital Emulation of Power Electronics Systems.

[BibT_eX]

[DOI]

Proceedings of the 32nd IEEE Real-Time Systems Symposium, 2011

Deadlock-free fine-grained thread migration.

[BibT_eX]

[DOI]

Proceedings of the NOCS 2011, 2011

Scalable, accurate multicore simulation in the 1000-core era.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2011

ARCc: A case for an architecturally redundant cache-coherence architecture for large multicores.

[BibT_eX]

[DOI]

Proceedings of the IEEE 29th International Conference on Computer Design, 2011

Performance Per Watt Benefits of Dynamic Core Morphing in Asymmetric Multicores.

[BibT_eX]

[DOI]

Rance Rodrigues

Arunachalam Annamalai

Israel Koren

Proceedings of the 2011 International Conference on Parallel Architectures and Compilation Techniques, 2011

2010

Thread Relocation: A Runtime Architecture for Tolerating Hard Errors in Chip Multiprocessors.

[BibT_eX]

[DOI]

IEEE Trans. Computers, 2010

Shadow checker (SC): A low-cost hardware scheme for online detection of faults in small memory structures of a microprocessor.

[BibT_eX]

[DOI]

Rance Rodrigues

Proceedings of the 2011 IEEE International Test Conference, 2010

A self-adaptive scheduler for asymmetric multi-cores.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM Great Lakes Symposium on VLSI 2009, 2010

A model to exploit power-performance efficiency in superscalar processors via structure resizing.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM Great Lakes Symposium on VLSI 2009, 2010

2009

Predictive Thermal Management for Chip Multiprocessors Using Co-designed Virtual Machines.

[BibT_eX]

[DOI]

Proceedings of the High Performance Embedded Architectures and Compilers, 2009

Improving yield and reliability of chip multiprocessors.

[BibT_eX]

[DOI]

Abhisek Pan

Proceedings of the Design, Automation and Test in Europe, 2009

Hardware/software co-design architecture for thermal management of chip multiprocessors.

[BibT_eX]

[DOI]

Proceedings of the Design, Automation and Test in Europe, 2009

A self-adaptive system architecture to address transistor aging.

[BibT_eX]

[DOI]

Proceedings of the Design, Automation and Test in Europe, 2009

2008

A framework for predictive dynamic temperature management of microprocessor systems.

[BibT_eX]

[DOI]