Daniel Wong

Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2023

2022

PowerMorph: QoS-Aware Server Power Reshaping for Data Center Regulation Service.

[BibT_eX]

[DOI]

Ali Jahanshahi

Nanpeng Yu

ACM Trans. Archit. Code Optim., 2022

ScaleServe: a scalable multi-GPU machine learning inference system and benchmarking suite.

[BibT_eX]

[DOI]

Ali Jahanshahi

Marcus Chow

Proceedings of the GPGPU@PPoPP 2022: Proceedings of the 14th Workshop on General Purpose Processing Using GPU, 2022

GPUCalorie: Floorplan Estimation for GPU Thermal Evaluation.

[BibT_eX]

[DOI]

Proceedings of the International IEEE Symposium on Performance Analysis of Systems and Software, 2022

2021

PAVER: Locality Graph-Based Thread Block Scheduling for GPUs.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., 2021

MAPA: multi-accelerator pattern allocation policy for multi-tenant GPU servers.

[BibT_eX]

[DOI]

Kiran Ranganath

Joshua D. Suetterlein

Joseph B. Manzano

Shuaiwen Leon Song

Proceedings of the International Conference for High Performance Computing, 2021

Energy Efficient Task Graph Execution Using Compute Unit Masking in GPUs.

[BibT_eX]

[DOI]

Proceedings of the IEEE/ACM Redefining Scalability for Diversely Heterogeneous Architectures Workshop, 2021

LocalityGuru: A PTX Analyzer for Extracting Thread Block-level Locality in GPGPUs.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Networking, Architecture and Storage, 2021

ICAP: Designing Inrush Current Aware Power Gating Switch for GPGPU.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Networking, Architecture and Storage, 2021

LC-MEMENTO: A Memory Model for Accelerated Architectures.

[BibT_eX]

[DOI]

Proceedings of the Languages and Compilers for Parallel Computing, 2021

BlockMaestro: Enabling Programmer-Transparent Task-based Execution in GPU Systems.

[BibT_eX]

[DOI]

AmirAli Abdolrashidi

Mangpo Phitchaya Phothilimtha

Proceedings of the 48th ACM/IEEE Annual International Symposium on Computer Architecture, 2021

2020

GPU-NEST: Characterizing Energy Efficiency of Multi-GPU Inference Servers.

[BibT_eX]

[DOI]

IEEE Comput. Archit. Lett., 2020

Transferable Graph Optimizers for ML Compilers.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

BOW: Breathing Operand Windows to Exploit Bypassing in GPUs.

[BibT_eX]

[DOI]

Proceedings of the 53rd Annual IEEE/ACM International Symposium on Microarchitecture, 2020

High-Performance Parallel Radix Sort on FPGA.

[BibT_eX]

[DOI]

Evangelos E. Papalexakis

Vassilis J. Tsotras

Walid A. Najjar

Proceedings of the 28th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2020

2019

Speeding up Collective Communications Through Inter-GPU Re-Routing.

[BibT_eX]

[DOI]

IEEE Comput. Archit. Lett., 2019

Locality-Aware GPU Register File.

[BibT_eX]

[DOI]

Hyeran Jeon

Nael B. Abu-Ghazaleh

Sindhuja Elango

IEEE Comput. Archit. Lett., 2019

Long-Term Reliability Management For Multitasking GPGPUs.

[BibT_eX]

[DOI]

Proceedings of the 16th International Conference on Synthesis, 2019

μDPM: Dynamic Power Management for the Microsecond Era.

[BibT_eX]

[DOI]

Chih-Hsun Chou

Laxmi N. Bhuyan

Proceedings of the 25th IEEE International Symposium on High Performance Computer Architecture, 2019

CORF: Coalescing Operand Register File for GPUs.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems, 2019

2018

Load-Triggered Warp Approximation on GPU.

[BibT_eX]

[DOI]

Zhenhong Liu

Nam Sung Kim

Proceedings of the International Symposium on Low Power Electronics and Design, 2018

Joint Server and Network Energy Saving in Data Centers for Latency-Sensitive Applications.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium, 2018

2017

Wireframe: supporting data-dependent parallelism through dependency graph execution in GPUs.

[BibT_eX]

[DOI]

AmirAli Abdolrashidi

Devashree Tripathy

Mehmet Esat Belviranli

Laxmi Narayan Bhuyan

Proceedings of the 50th Annual IEEE/ACM International Symposium on Microarchitecture, 2017

2016

Squeezing Energy Savings Out Of Similar Data and Computation in GPGPUs.

[BibT_eX]

[DOI]

Tiny Trans. Comput. Sci., 2016

STOMP: Statistical Techniques for Optimizing and Modeling Performance of Blocked Sparse Matrix Vector Multiplication.

[BibT_eX]

[DOI]

Steena Monteiro

Forrest N. Iandola

Proceedings of the 28th International Symposium on Computer Architecture and High Performance Computing, 2016

DynSleep: Fine-grained Power Management for a Latency-Critical Data Center Application.

[BibT_eX]

[DOI]

Chih-Hsun Chou

Laxmi N. Bhuyan

Proceedings of the 2016 International Symposium on Low Power Electronics and Design, 2016

Peak Efficiency Aware Scheduling for Highly Energy Proportional Servers.

[BibT_eX]

[DOI]

Proceedings of the 43rd ACM/IEEE Annual International Symposium on Computer Architecture, 2016

Origami: Folding Warps for Energy Efficient GPUs.

[BibT_eX]

[DOI]

Mohammad Abdel-Majeed

Justin Kuang

Proceedings of the 2016 International Conference on Supercomputing, 2016

Approximating warps with intra-warp operand value similarity.

[BibT_eX]

[DOI]

Nam Sung Kim

Proceedings of the 2016 IEEE International Symposium on High Performance Computer Architecture, 2016

Invited - Cross-layer modeling and optimization for electromigration induced reliability.

[BibT_eX]

[DOI]

Proceedings of the 53rd Annual Design Automation Conference, 2016

2015

A Retrospective Look Back on the Road Towards Energy Proportionality.

[BibT_eX]

[DOI]

Julia Chen

Proceedings of the 2015 IEEE International Symposium on Workload Characterization, 2015

2014

Implications of high energy proportional servers on cluster-wide energy proportionality.

[BibT_eX]

[DOI]

Proceedings of the 20th IEEE International Symposium on High Performance Computer Architecture, 2014

2013

Scaling the Energy Proportionality Wall with KnightShift.

[BibT_eX]

[DOI]

IEEE Micro, 2013

Warped gates: gating aware scheduling and power gating for GPGPUs.

[BibT_eX]

[DOI]

Mohammad Abdel-Majeed

Proceedings of the 46th Annual IEEE/ACM International Symposium on Microarchitecture, 2013

2012

KnightShift: Scaling the Energy Proportionality Wall through Server-Level Heterogeneity.

[BibT_eX]

[DOI]

Proceedings of the 45th Annual IEEE/ACM International Symposium on Microarchitecture, 2012

2010

Adaptive and Speculative Slack Simulations of CMPs on CMPs.

[BibT_eX]

[DOI]

Jianwei Chen

Lakshmi Kumar Dabbiru

Michel Dubois

Proceedings of the 43rd Annual IEEE/ACM International Symposium on Microarchitecture, 2010

Implementing games on pinball machines.

[BibT_eX]

[DOI]

Proceedings of the International Conference on the Foundations of Digital Games, 2010

Teaching Artificial Intelligence and Robotics Via Games.

[BibT_eX]

[DOI]