Vijay Nagarajan

Antonios Katsarakis

CoRR, 2021

Avocado: A Secure In-Memory Distributed Storage System.

[BibT_eX]

[DOI]

Proceedings of the 2021 USENIX Annual Technical Conference, 2021

Dvé: Improving DRAM Reliability and Performance On-Demand via Coherent Replication.

[BibT_eX]

[DOI]

Adarsh Patil

Rajeev Balasubramonian

Nicolai Oswald

Proceedings of the 48th ACM/IEEE Annual International Symposium on Computer Architecture, 2021

Towards the Synthesis of Coherence/Replication Protocols from Consistency Models via Real-Time Orderings.

[BibT_eX]

[DOI]

Panagiota Fatourou

Proceedings of the PaPoC@EuroSys 2021, 2021

Odyssey: the impact of modern hardware on strongly-consistent replication protocols.

[BibT_eX]

[DOI]

Antonios Katsarakis

Proceedings of the EuroSys '21: Sixteenth European Conference on Computer Systems, 2021

2020

A Primer on Memory Consistency and Cache Coherence, Second Edition

[BibT_eX]

[DOI]

Synthesis Lectures on Computer Architecture, Morgan & Claypool Publishers, ISBN: 978-3-031-01764-3, 2020

Kite: efficient and available release consistency for the datacenter.

[BibT_eX]

[DOI]

Proceedings of the PPoPP '20: 25th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2020

HieraGen: Automated Generation of Concurrent, Hierarchical Cache Coherence Protocols.

[BibT_eX]

[DOI]

Nicolai Oswald

Daniel J. Sorin

Proceedings of the 47th ACM/IEEE Annual International Symposium on Computer Architecture, 2020

Hermes: A Fast, Fault-Tolerant and Linearizable Replication Protocol.

[BibT_eX]

[DOI]

Antonios Katsarakis

M. R. Siavash Katebzadeh

Arpit Joshi

Aleksandar Dragojevic

Boris Grot

Proceedings of the ASPLOS '20: Architectural Support for Programming Languages and Operating Systems, 2020

Lazy Release Persistency.

[BibT_eX]

[DOI]

Proceedings of the ASPLOS '20: Architectural Support for Programming Languages and Operating Systems, 2020

2019

Poise: Balancing Thread-Level Parallelism and Memory System Performance in GPUs Using Machine Learning.

[BibT_eX]

[DOI]

Proceedings of the 25th IEEE International Symposium on High Performance Computer Architecture, 2019

2018

Solving the task variant allocation problem in distributed robotics.

[BibT_eX]

[DOI]

Auton. Robots, 2018

ProtoGen: Automatically Generating Directory Cache Coherence Protocols from Atomic Specifications.

[BibT_eX]

[DOI]

Nicolai Oswald

Daniel J. Sorin

Proceedings of the 45th ACM/IEEE Annual International Symposium on Computer Architecture, 2018

DHTM: Durable Hardware Transactional Memory.

[BibT_eX]

[DOI]

Proceedings of the 45th ACM/IEEE Annual International Symposium on Computer Architecture, 2018

Automatic Parameter Tuning of Motion Planning Algorithms.

[BibT_eX]

[DOI]

Michael F. P. O'Boyle

Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018

Scale-out ccNUMA: exploiting skew with strongly consistent caching.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth EuroSys Conference, 2018

VerC3: A library for explicit state synthesis of concurrent systems.

[BibT_eX]

[DOI]

Proceedings of the 2018 Design, Automation & Test in Europe Conference & Exhibition, 2018

Blasting through the Front-End Bottleneck with Shotgun.

[BibT_eX]

[DOI]

Rakesh Kumar

Boris Grot

Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating Systems, 2018

2017

Evaluating and mitigating bandwidth bottlenecks across the memory hierarchy in GPUs.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Symposium on Performance Analysis of Systems and Software, 2017

ATOM: Atomic Durability in Non-volatile Memory through Hardware Logging.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Symposium on High Performance Computer Architecture, 2017

Boomerang: A Metadata-Free Architecture for Control Flow Delivery.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Symposium on High Performance Computer Architecture, 2017

Verification of a lazy cache coherence protocol against a weak memory model.

[BibT_eX]

[DOI]

Proceedings of the 2017 Formal Methods in Computer Aided Design, 2017

2016

Fence Placement for Legacy Data-Race-Free Programs via Synchronization Read Detection.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2016

Cooperative Caching for GPUs.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2016

DCA: a DRAM-cache-aware DRAM controller.

[BibT_eX]

[DOI]

Cheng-Chieh Huang

Arpit Joshi

Proceedings of the International Conference for High Performance Computing, 2016

Task Variant Allocation in Distributed Robotics.

[BibT_eX]

[DOI]

Proceedings of the Robotics: Science and Systems XII, University of Michigan, Ann Arbor, Michigan, USA, June 18, 2016

C<sup>3</sup>D: Mitigating the NUMA bottleneck via coherent DRAM caches.

[BibT_eX]

[DOI]

Proceedings of the 49th Annual IEEE/ACM International Symposium on Microarchitecture, 2016

Automatic configuration of ROS applications for near-optimal performance.

[BibT_eX]

[DOI]

José Cano

Alejandro Bordallo

Subramanian Ramamoorthy

Sethu Vijayakumar

Proceedings of the 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2016

Characterizing memory bottlenecks in GPGPU workloads.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Symposium on Workload Characterization, 2016

McVerSi: A test generation framework for fast memory consistency verification in simulation.

[BibT_eX]

[DOI]

Marco Elver

Proceedings of the 2016 IEEE International Symposium on High Performance Computer Architecture, 2016

2015

Understanding the Effects of Data Corruption on Application Behavior Based on Data Characteristics.

[BibT_eX]

[DOI]

Georgios Stefanakis

Marcelo Cintra

Proceedings of the Computer Safety, Reliability, and Security, 2015

Efficient persist barriers for multicores.

[BibT_eX]

[DOI]

Proceedings of the 48th International Symposium on Microarchitecture, 2015

Dynamic process migration in heterogeneous ROS-based environments.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Advanced Robotics, 2015

RC3: Consistency Directed Cache Coherence for x86-64 with RC Extensions.

[BibT_eX]

[DOI]

Marco Elver

Proceedings of the 2015 International Conference on Parallel Architectures and Compilation, 2015

2014

Erratum: A system for debugging via online tracing and dynamic slicing.

[BibT_eX]

[DOI]

Dennis Jeffrey

Softw. Pract. Exp., 2014

Fence Scoping.

[BibT_eX]

[DOI]

Changhui Lin

Proceedings of the International Conference for High Performance Computing, 2014

Static Approximation of MPI Communication Graphs for Optimized Process Placement.

[BibT_eX]

[DOI]

Andrew J. McPherson

Marcelo Cintra

Proceedings of the Languages and Compilers for Parallel Computing, 2014

Increasing cache capacity via critical-words-only cache.

[BibT_eX]

[DOI]

Cheng-Chieh Huang

Proceedings of the 32nd IEEE International Conference on Computer Design, 2014

TSO-CC: Consistency directed cache coherence for TSO.

[BibT_eX]

[DOI]

Marco Elver

Proceedings of the 20th IEEE International Symposium on High Performance Computer Architecture, 2014

ATCache: reducing DRAM cache latency via a small SRAM tag cache.

[BibT_eX]

[DOI]

Cheng-Chieh Huang

Proceedings of the International Conference on Parallel Architectures and Compilation, 2014

2013

Fast RMWs for TSO: semantics and implementation.

[BibT_eX]

[DOI]

Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation, 2013

Address-aware fences.

[BibT_eX]

[DOI]

Changhui Lin

Proceedings of the International Conference on Supercomputing, 2013

2012

Erratum: A system for debugging via online tracing and dynamic slicing.

[BibT_eX]

[DOI]

Softw. Pract. Exp., 2012

A system for debugging via online tracing and dynamic slicing.

[BibT_eX]

[DOI]

Softw. Pract. Exp., 2012

Efficient Sequential Consistency Using Conditional Fences.

[BibT_eX]

[DOI]

Changhui Lin

Int. J. Parallel Program., 2012

SuperCoP: a general, correct, and performance-efficient supervised memory system.

[BibT_eX]

[DOI]

Proceedings of the Computing Frontiers Conference, CF'12, 2012

Efficient sequential consistency via conflict ordering.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Architectural Support for Programming Languages and Operating Systems, 2012

2010

Execution suppression: An automated iterative technique for locating memory errors.

[BibT_eX]

[DOI]

ACM Trans. Program. Lang. Syst., 2010

2009

IMPRESS: Improving Multicore Performance and Reliability via Efficient Software Support for Monitoring.

[BibT_eX]

[DOI]

Vijayanand Nagarajan

PhD thesis, 2009

Compiler-Assisted Memory Encryption for Embedded Processors.

[BibT_eX]

[DOI]

Arvind Krishnaswamy

Trans. High Perform. Embed. Archit. Compil., 2009

Automated dynamic detection of busy-wait synchronizations.

[BibT_eX]

[DOI]

Softw. Pract. Exp., 2009

Runtime monitoring on multicores via OASES.

[BibT_eX]

[DOI]

ACM SIGOPS Oper. Syst. Rev., 2009

Speculative Parallelization of Sequential Loops on Multicores.

[BibT_eX]

[DOI]

Int. J. Parallel Program., 2009

Architectural support for shadow memory in multiprocessors.

[BibT_eX]

[DOI]

Proceedings of the 5th International Conference on Virtual Execution Environments, 2009

Speculative Optimizations for Parallel Programs on Multicores.

[BibT_eX]

[DOI]

Proceedings of the Languages and Compilers for Parallel Computing, 2009

Self-recovery in server programs.

[BibT_eX]

[DOI]

Dennis Jeffrey

Proceedings of the 8th International Symposium on Memory Management, 2009

ECMon: exposing cache events for monitoring.

[BibT_eX]

[DOI]

Proceedings of the 36th International Symposium on Computer Architecture (ISCA 2009), 2009

2008

Copy or Discard execution model for speculative parallelization on multicores.

[BibT_eX]

[DOI]

Proceedings of the 41st Annual IEEE/ACM International Symposium on Microarchitecture (MICRO-41 2008), 2008

Dynamic recognition of synchronization operations for improved data race detection.

[BibT_eX]

[DOI]

Proceedings of the ACM/SIGSOFT International Symposium on Software Testing and Analysis, 2008

Support for symmetric shadow memory in multiprocessors.

[BibT_eX]

[DOI]

Proceedings of the 6th Workshop on Parallel and Distributed Systems: Testing, 2008

Scalable dynamic information flow tracking and its applications.

[BibT_eX]

[DOI]

Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

2007

High-throughput VLSI Implementations of Iterative Decoders and Related Code Construction Problems.

[BibT_eX]

[DOI]

J. VLSI Signal Process., 2007

ONTRAC: A system for efficient ONline TRACing for debugging.

[BibT_eX]

[DOI]

Proceedings of the 23rd IEEE International Conference on Software Maintenance (ICSM 2007), 2007

Matching Control Flow of Program Versions.

[BibT_eX]

[DOI]

Proceedings of the 23rd IEEE International Conference on Software Maintenance (ICSM 2007), 2007

2004

The effect of channel side information at transmitter on coding complexity.

[BibT_eX]

[DOI]