Sunita Chandrasekaran

Proceedings of the AMIA 2020, 2020

2019

<i>pointerchain: </i> Tracing pointers to their roots - A case study in molecular dynamics simulations.

[BibT_eX]

[DOI]

Parallel Comput., 2019

Analysis of OpenMP 4.5 Offloading in Implementations: Correctness and Overhead.

[BibT_eX]

[DOI]

Parallel Comput., 2019

Creating a portable, high-level graph analytics paradigm for compute and data-intensive applications.

[BibT_eX]

[DOI]

Int. J. High Perform. Comput. Netw., 2019

MPI + OpenACC: Accelerating radiation transport mini-application, minisweep, on heterogeneous systems.

[BibT_eX]

[DOI]

Robert Searles

Wayne Joubert

Comput. Phys. Commun., 2019

Assessing Performance Implications of Deep Copy Operations via Microbenchmarking.

[BibT_eX]

[DOI]

CoRR, 2019

Studying the Impact of Power Capping on MapReduce-based, Data-intensive Mini-applications on Intel KNL and KNM Architectures.

[BibT_eX]

[DOI]

Joshua Hoke Davis

Tao Gao

Michela Taufer

CoRR, 2019

Correction to: Computational approaches for cancer 2017 workshop overview.

[BibT_eX]

[DOI]

Eric Stahlberg

BMC Bioinform., 2019

Gecko: Hierarchical Distributed View of Heterogeneous Shared Memory Architectures.

[BibT_eX]

[DOI]

Proceedings of the 10th International Workshop on Programming Models and Applications for Multicores and Manycores, 2019

Characterization of Power Usage and Performance in Data-Intensive Applications Using MapReduce over MPI.

[BibT_eX]

[DOI]

Joshua Hoke Davis

Tao Gao

Proceedings of the Parallel Computing: Technology Trends, 2019

2018

The OpenACC data model: Preliminary study on its major challenges and implementations.

[BibT_eX]

[DOI]

Parallel Comput., 2018

Special issue on applications for the heterogeneous computing era 2017.

[BibT_eX]

[DOI]

Antonio J. Peña

Parallel Comput., 2018

Best Practices in Running Collaborative GPU Hackathons: Advancing Scientific Applications with a Sustained Impact.

[BibT_eX]

[DOI]

Comput. Sci. Eng., 2018

Power and Energy-efficiency Roofline Model for GPUs.

[BibT_eX]

[DOI]

Jeff Larkin

Larry Shi

CoRR, 2018

Computational approaches for Cancer 2017 workshop overview.

[BibT_eX]

[DOI]

Eric Stahlberg

BMC Bioinform., 2018

Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures.

[BibT_eX]

[DOI]

Robert Searles

Wayne Joubert

Proceedings of the Platform for Advanced Scientific Computing Conference, 2018

OpenMP 4.5 Validation and Verification Suite for Device Offload.

[BibT_eX]

[DOI]

Proceedings of the Evolving OpenMP for Evolving Architectures, 2018

Introduction to AsHES 2018.

[BibT_eX]

[DOI]

Antonio J. Peña

Min Si

Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium Workshops, 2018

Evaluating Support for OpenMP Offload Features.

[BibT_eX]

[DOI]

Proceedings of the 47th International Conference on Parallel Processing, 2018

2017

Special Issue on Topics on Heterogeneous Computing.

[BibT_eX]

[DOI]

Antonio J. Peña

Parallel Comput., 2017

OpenACC 2.5 Validation Testsuite Targeting Multiple Architectures.

[BibT_eX]

[DOI]

Kyle Friedline

M. Graham Lopez

Proceedings of the High Performance Computing, 2017

An Efficient Data Layout Transformation Algorithm for Locality-Aware Parallel Sparse FFT.

[BibT_eX]

[DOI]

Proceedings of the Seventh Workshop on Irregular Applications: Architectures and Algorithms, 2017

Implementing the OpenACC Data Model.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

Exploring Translation of OpenMP to OpenACC 2.5: Lessons Learned.

[BibT_eX]

[DOI]

Sergio Pino

Lori L. Pollock

Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

Introduction to AsHES Workshop.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

2016

Compiler transformation of nested loops for general purpose GPUs.

[BibT_eX]

[DOI]

Deepak Eachempati

Concurr. Comput. Pract. Exp., 2016

An Analytical Model-Based Auto-tuning Framework for Locality-Aware Loop Scheduling.

[BibT_eX]

[DOI]

Verónica G. Vergara Larrea

Proceedings of the High Performance Computing - 31st International Conference, 2016

From Describing to Prescribing Parallelism: Translating the SPEC ACCEL OpenACC Suite to OpenMP Target Directives.

[BibT_eX]

[DOI]

Sandra Wienke

Alexander Bobyr

William C. Brantley

Proceedings of the High Performance Computing, 2016

A Portable, High-Level Graph Analytics Framework Targeting Distributed, Heterogeneous Systems.

[BibT_eX]

[DOI]

Robert Searles

Stephen Herbein

Proceedings of the Third Workshop on Accelerator Programming Using Directives, 2016

cusFFT: A High-Performance Sparse Fast Fourier Transform Algorithm on GPUs.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016

Exploring Task Parallelism for Heterogeneous Systems Using Multicore Task Management API.

[BibT_eX]

[DOI]

Suyang Zhu

Proceedings of the Euro-Par 2016: Parallel Processing Workshops, 2016

2015

Multi-GPU Support on Single Node Using Directive-Based Programming Model.

[BibT_eX]

[DOI]

Sci. Program., 2015

Programming Models, Languages, and Compilers for Manycore and Heterogeneous Architectures.

[BibT_eX]

[DOI]

Xinmin Tian

Sci. Program., 2015

OpenMP-MCA: Leveraging Multiprocessor Embedded Systems Using Industry Standards.

[BibT_eX]

[DOI]

Peng Sun

Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, 2015

PLC Introduction and Committees.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, 2015

Deploying OpenMP Task Parallelism on Multicore Embedded Systems with MCA Task APIs.

[BibT_eX]

[DOI]

Peng Sun

Suyang Zhu

Proceedings of the 17th IEEE International Conference on High Performance Computing and Communications, 2015

2014

Accelerating Kirchhoff migration on GPU using directives.

[BibT_eX]

[DOI]

Maxime R. Hugues

Henri Calandra

Proceedings of the First Workshop on Accelerator Programming using Directives, 2014

SPEC ACCEL: A Standard Application Suite for Measuring Hardware Accelerator Performance.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing Systems. Performance Modeling, Benchmarking, and Simulation, 2014

Reduction Operations in Parallel Loops for GPGPUs.

[BibT_eX]

[DOI]

Proceedings of the 2014 PPOPP International Workshop on Programming Models and Applications for Multicores and Manycores, 2014

NAS Parallel Benchmarks for GPGPUs Using a Directive-Based Programming Model.

[BibT_eX]

[DOI]

Proceedings of the Languages and Compilers for Parallel Computing, 2014

A Validation Testsuite for OpenACC 1.0.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014

2013

C2FPGA - A dependency-timing graph design methodology.

[BibT_eX]

[DOI]

J. Parallel Distributed Comput., 2013

Parallel sparse FFT.

[BibT_eX]

[DOI]

Mauricio Araya-Polo

Amik St.-Cyr

Detlef Hohl

Proceedings of the 3rd Workshop on Irregular Applications - Architectures and Algorithms, 2013

libEOMP: a portable OpenMP runtime library based on MCA APIs for embedded systems.

[BibT_eX]

[DOI]

Jim Holt

Proceedings of the 2013 PPOPP International Workshop on Programming Models and Applications for Multicores and Manycores, 2013

Portable mapping of openMP to multicore embedded systems using MCA APIs.

[BibT_eX]

[DOI]

Peng Sun

Jim Holt

Proceedings of the SIGPLAN/SIGBED Conference on Languages, 2013

Compiling a High-Level Directive-Based Programming Model for GPGPUs.

[BibT_eX]

[DOI]

Proceedings of the Languages and Compilers for Parallel Computing, 2013

Exploring Programming Multi-GPUs Using OpenMP and OpenACC-Based Hybrid Model.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013

Statistical modeling of power/energy of scientific kernels on a multi-GPU system.

[BibT_eX]

[DOI]

Sayan Ghosh

Proceedings of the International Green Computing Conference, 2013

2012

Tools and algorithms for high-level algorithm mapping to FPGA

[BibT_eX]

[DOI]

PhD thesis, 2012

Poster: Statistical Power and Energy Modeling of Multi-GPU Kernels.

[BibT_eX]

[DOI]

Sayan Ghosh

Proceedings of the 2012 SC Companion: High Performance Computing, 2012

An OpenMP 3.1 Validation Testsuite.

[BibT_eX]

[DOI]

Proceedings of the OpenMP in a Heterogeneous World - 8th International Workshop on OpenMP, 2012

2010

A dependency graph based methodology for parallelizing HLL applications on FPGA (abstract only).

[BibT_eX]

[DOI]

Shilpa Shanbagh

Douglas L. Maskell

Proceedings of the ACM/SIGDA 18th International Symposium on Field Programmable Gate Arrays, 2010

2008

Capturing performance knowledge for automated analysis.

[BibT_eX]

[DOI]

Kevin A. Huck

Van Bui