Nuwan Jayasena

Orcid: 0009-0005-2973-9479

According to our database¹, Nuwan Jayasena authored at least 45 papers between 2000 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

The qs Inequality: Quantifying the Double Penalty of Mixture-of-Experts at Inference.

[BibT_eX]

[DOI]

Vignesh Adhinarayanan

Nuwan Jayasena

CoRR, March, 2026

RAPID-Serve: Resource-efficient and Accelerated P/D Intra-GPU Disaggregation.

[BibT_eX]

[DOI]

Amna Masood

Pratishtha Gaur

Nuwan Jayasena

CoRR, January, 2026

2025

GOLDYLOC: Global Optimizations & Lightweight Dynamic Logic for Concurrency.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., June, 2025

Concurrent PIM and Load/Store Servicing in PIM-Enabled Memory.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2025

2024

Global Optimizations & Lightweight Dynamic Logic for Concurrency.

[BibT_eX]

[DOI]

CoRR, 2024

PIM-Potential: Broadening the Acceleration Reach of PIM Architectures.

[BibT_eX]

[DOI]

Johnathan Alsop

Shaizeen Aga

Mohamed Assem Ibrahim

Mahzabeen Islam

Nuwan Jayasena

Andrew McCrabb

Proceedings of the International Symposium on Memory Systems, 2024

T3: Transparent Tracking & Triggering for Fine-grained Overlap of Compute & Collectives.

[BibT_eX]

[DOI]

Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

2023

Inclusive-PIM: Hardware-Software Co-design for Broad Acceleration on Commercial PIM Architectures.

[BibT_eX]

[DOI]

Johnathan Alsop

Shaizeen Aga

Mohamed Assem Ibrahim

Mahzabeen Islam

Andrew McCrabb

Nuwan Jayasena

CoRR, 2023

Computation vs. Communication Scaling for Future Transformers on Future Hardware.

[BibT_eX]

[DOI]

CoRR, 2023

A Research Retrospective on AMD's Exascale Computing Journey.

[BibT_eX]

[DOI]

Gabriel H. Loh

Michael J. Schulte

Mike Ignatowski

Vignesh Adhinarayanan

Kishore Punniyamurthy

Proceedings of the 50th Annual International Symposium on Computer Architecture, 2023

Tale of Two Cs: Computation vs. Communication Scaling for Future Transformers on Future Hardware.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Workload Characterization, 2023

2022

Demystifying BERT: System Design Implications.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Workload Characterization, 2022

2021

Demystifying BERT: Implications for Accelerator Design.

[BibT_eX]

[DOI]

CoRR, 2021

2020

Morton filters: fast, compressed sparse cuckoo filters.

[BibT_eX]

[DOI]

Alex D. Breslow

Nuwan Jayasena

VLDB J., 2020

Memory Performance Optimization.

[BibT_eX]

[DOI]

Nuwan Jayasena

Proceedings of the 10th IEEE/ACM Workshop on Irregular Applications: Architectures and Algorithms, 2020

SeqPoint: Identifying Representative Iterations of Sequence-Based Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2020

2019

Co-ML: a case for <u>co</u>llaborative <u>ML</u> acceleration using near-data processing.

[BibT_eX]

[DOI]

Shaizeen Aga

Nuwan Jayasena

Mike Ignatowski

Proceedings of the International Symposium on Memory Systems, 2019

2018

CODA: Enabling Co-location of Computation and Data for Multiple GPU Systems.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2018

Morton Filters: Faster, Space-Efficient Cuckoo Filters via Biasing, Compression, and Decoupled Logical Sparsity.

[BibT_eX]

[DOI]

Alexander Dodd Breslow

Nuwan Jayasena

Proc. VLDB Endow., 2018

RegMutex: Inter-Warp GPU Register Time-Sharing.

[BibT_eX]

[DOI]

Farzad Khorasani

Hodjat Asghari Esfeden

Amin Farmahini Farahani

Nuwan Jayasena

Vivek Sarkar

Proceedings of the 45th ACM/IEEE Annual International Symposium on Computer Architecture, 2018

2017

Exploring the Processing-in-Memory design space.

[BibT_eX]

[DOI]

J. Syst. Archit., 2017

CODA: Enabling Co-location of Computation and Data for Near-Data Processing.

[BibT_eX]

[DOI]

CoRR, 2017

MemPod: A Clustered Architecture for Efficient and Scalable Migration in Flat Address Space Multi-level Memories.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Symposium on High Performance Computer Architecture, 2017

DVFS Space Exploration in Power Constrained Processing-in-Memory Systems.

[BibT_eX]

[DOI]

Proceedings of the Architecture of Computing Systems - ARCS 2017, 2017

HBM-Resident Prefetching for Heterogeneous Memory System.

[BibT_eX]

[DOI]

Proceedings of the Architecture of Computing Systems - ARCS 2017, 2017

2016

Near-Memory Data Services.

[BibT_eX]

[DOI]

Karthikeyan Sankaralingam

Cristian Estan

IEEE Micro, 2016

Horton Tables: Fast Hash Tables for In-Memory Data-Intensive Computing.

[BibT_eX]

[DOI]

Proceedings of the 2016 USENIX Annual Technical Conference, 2016

Analytical Study on Bandwidth Efficiency of Heterogeneous Memory Systems.

[BibT_eX]

[DOI]

Amin Farmahini Farahani

David Roberts

Nuwan Jayasena

Proceedings of the Second International Symposium on Memory Systems, 2016

Fine-Grained Task Migration for Graph Algorithms Using Processing in Memory.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016

HADM: Hybrid Analysis for Detection of Malware.

[BibT_eX]

[DOI]

Proceedings of SAI Intelligent Systems Conference (IntelliSys) 2016, 2016

Prefetching Techniques for Near-memory Throughput Processors.

[BibT_eX]

[DOI]

Proceedings of the 2016 International Conference on Supercomputing, 2016

2015

Achieving Exascale Capabilities through Heterogeneous Computing.

[BibT_eX]

[DOI]

IEEE Micro, 2015

GPGPU performance and power estimation using machine learning.

[BibT_eX]

[DOI]

Proceedings of the 21st IEEE International Symposium on High Performance Computer Architecture, 2015

Understanding idle behavior and power gating mechanisms in the context of modern benchmarks on CPU-GPU Integrated systems.

[BibT_eX]

[DOI]

Proceedings of the 21st IEEE International Symposium on High Performance Computer Architecture, 2015

Processing-in-Memory: Exploring the Design Space.

[BibT_eX]

[DOI]

Proceedings of the Architecture of Computing Systems - ARCS 2015, 2015

2014

A comparison of core power gating strategies implemented in modern hardware.

[BibT_eX]

[DOI]

Proceedings of the ACM SIGMETRICS / International Conference on Measurement and Modeling of Computer Systems, 2014

Managing DRAM Latency Divergence in Irregular GPGPU Applications.

[BibT_eX]

[DOI]

Rajeev Balasubramonian

Proceedings of the International Conference for High Performance Computing, 2014

TOP-PIM: throughput-oriented programmable processing in memory.

[BibT_eX]

[DOI]

Proceedings of the 23rd International Symposium on High-Performance Parallel and Distributed Computing, 2014

Improving Node-Level MapReduce Performance Using Processing-in-Memory Technologies.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2014: Parallel Processing Workshops, 2014

2013

A new perspective on processing-in-memory architecture design.

[BibT_eX]

[DOI]

Proceedings of the ACM SIGPLAN Workshop on Memory Systems Performance and Correctness, 2013

Load balancing in a changing world: dealing with heterogeneity and performance variability.

[BibT_eX]

[DOI]

Proceedings of the Computing Frontiers Conference, 2013

2005

Fault Tolerance Techniques for the Merrimac Streaming Supercomputer.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE SC2005 Conference on High Performance Networking and Computing, 2005

2004

Stream Register Files with Indexed Access.

[BibT_eX]

[DOI]

Proceedings of the 10th International Conference on High-Performance Computer Architecture (HPCA-10 2004), 2004

2003

Merrimac: Supercomputing with Streams.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE SC2003 Conference on High Performance Networking and Computing, 2003

2000

Smart Memories: a modular reconfigurable architecture.

[BibT_eX]

[DOI]

Proceedings of the 27th International Symposium on Computer Architecture (ISCA 2000), 2000

Nuwan Jayasena

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...