Sunita Chandrasekaran

According to our database1, Sunita Chandrasekaran authored at least 75 papers between 2008 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
EZ: An efficient, charge conserving current deposition algorithm for electromagnetic particle-in-cell simulations.
Comput. Phys. Commun., October, 2023

Special issue on new trends in high-performance computing: Software systems and applications.
Softw. Pract. Exp., 2023

LLM4VV: Developing LLM-Driven Testsuite for Compiler Validation.
CoRR, 2023

Analysis of MURaM, a Solar Physics Application, for Scalability, Performance and Portability.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023


Hardware-Agnostic Interactive Exascale In Situ Visualization of Particle-In-Cell Simulations.
Proceedings of the Platform for Advanced Scientific Computing Conference, 2023

Implementing OpenMP's SIMD Directive in LLVM's GPU Runtime.
Proceedings of the 52nd International Conference on Parallel Processing, 2023


2022
Metrics and Design of an Instruction Roofline Model for AMD GPUs.
ACM Trans. Parallel Comput., 2022

OpenACC Acceleration of an Agent-Based Biological Simulation Framework.
Comput. Sci. Eng., 2022

Early Application Experiences on a Modern GPU-Accelerated Arm-based HPC Platform.
CoRR, 2022

Analysis of Validating and Verifying OpenACC Compilers 3.0 and Above.
CoRR, 2022


Analysis of Validating and Verifying OpenACC Compilers 3.0 and Above.
Proceedings of the 9th Workshop on Accelerator Programming Using Directives, 2022

ECP SOLLVE: Validation and Verification Testsuite Status Update and Compiler Insight for OpenMP.
Proceedings of the IEEE/ACM International Workshop on Performance, 2022

First Experiences in Performance Benchmarking with the New SPEChpc 2021 Suites.
Proceedings of the 22nd IEEE International Symposium on Cluster, 2022

2021
Challenges Porting a C++ Template-Metaprogramming Abstraction Layer to Directive-Based Offloading.
Proceedings of the Accelerator Programming Using Directives - 8th International Workshop, 2021

Refactoring the MPS/University of Chicago Radiative MHD (MURaM) model for GPU/CPU performance portability using OpenACC directives.
Proceedings of the PASC '21: Platform for Advanced Scientific Computing Conference, 2021

2020
Accelerating prediction of chemical shift of protein structures on GPUs: Using OpenACC.
PLoS Comput. Biol., 2020

Future Directions of the Cyberinfrastructure for Sustained Scientific Innovation (CSSI) Program.
CoRR, 2020

Performance Assessment of OpenMP Compilers Targeting NVIDIA V100 GPUs.
Proceedings of the Accelerator Programming Using Directives - 7th International Workshop, 2020

Towards a portable hierarchical view of distributed shared memory systems: challenges and solutions.
Proceedings of the PMAM@PPoPP '20: Eleventh International Workshop on Programming Models and Applications for Multicores and Manycores colocated with the 25th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2020

Proposing a Machine Learning Framework for Classification of Patient Cohorts Using Genomics Data.
Proceedings of the AMIA 2020, 2020

2019
<i>pointerchain: </i> Tracing pointers to their roots - A case study in molecular dynamics simulations.
Parallel Comput., 2019

Analysis of OpenMP 4.5 Offloading in Implementations: Correctness and Overhead.
Parallel Comput., 2019

Creating a portable, high-level graph analytics paradigm for compute and data-intensive applications.
Int. J. High Perform. Comput. Netw., 2019

MPI + OpenACC: Accelerating radiation transport mini-application, minisweep, on heterogeneous systems.
Comput. Phys. Commun., 2019

Assessing Performance Implications of Deep Copy Operations via Microbenchmarking.
CoRR, 2019

Studying the Impact of Power Capping on MapReduce-based, Data-intensive Mini-applications on Intel KNL and KNM Architectures.
CoRR, 2019

Correction to: Computational approaches for cancer 2017 workshop overview.
BMC Bioinform., 2019

Gecko: Hierarchical Distributed View of Heterogeneous Shared Memory Architectures.
Proceedings of the 10th International Workshop on Programming Models and Applications for Multicores and Manycores, 2019

Characterization of Power Usage and Performance in Data-Intensive Applications Using MapReduce over MPI.
Proceedings of the Parallel Computing: Technology Trends, 2019

2018
The OpenACC data model: Preliminary study on its major challenges and implementations.
Parallel Comput., 2018

Special issue on applications for the heterogeneous computing era 2017.
Parallel Comput., 2018

Best Practices in Running Collaborative GPU Hackathons: Advancing Scientific Applications with a Sustained Impact.
Comput. Sci. Eng., 2018

Power and Energy-efficiency Roofline Model for GPUs.
CoRR, 2018

Computational approaches for Cancer 2017 workshop overview.
BMC Bioinform., 2018

Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures.
Proceedings of the Platform for Advanced Scientific Computing Conference, 2018

OpenMP 4.5 Validation and Verification Suite for Device Offload.
Proceedings of the Evolving OpenMP for Evolving Architectures, 2018

Introduction to AsHES 2018.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium Workshops, 2018

Evaluating Support for OpenMP Offload Features.
Proceedings of the 47th International Conference on Parallel Processing, 2018

2017
Special Issue on Topics on Heterogeneous Computing.
Parallel Comput., 2017

OpenACC 2.5 Validation Testsuite Targeting Multiple Architectures.
Proceedings of the High Performance Computing, 2017

An Efficient Data Layout Transformation Algorithm for Locality-Aware Parallel Sparse FFT.
Proceedings of the Seventh Workshop on Irregular Applications: Architectures and Algorithms, 2017

Implementing the OpenACC Data Model.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

Exploring Translation of OpenMP to OpenACC 2.5: Lessons Learned.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

Introduction to AsHES Workshop.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

2016
Compiler transformation of nested loops for general purpose GPUs.
Concurr. Comput. Pract. Exp., 2016

An Analytical Model-Based Auto-tuning Framework for Locality-Aware Loop Scheduling.
Proceedings of the High Performance Computing - 31st International Conference, 2016


A Portable, High-Level Graph Analytics Framework Targeting Distributed, Heterogeneous Systems.
Proceedings of the Third Workshop on Accelerator Programming Using Directives, 2016

cusFFT: A High-Performance Sparse Fast Fourier Transform Algorithm on GPUs.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016

Exploring Task Parallelism for Heterogeneous Systems Using Multicore Task Management API.
Proceedings of the Euro-Par 2016: Parallel Processing Workshops, 2016

2015
Multi-GPU Support on Single Node Using Directive-Based Programming Model.
Sci. Program., 2015

Programming Models, Languages, and Compilers for Manycore and Heterogeneous Architectures.
Sci. Program., 2015

OpenMP-MCA: Leveraging Multiprocessor Embedded Systems Using Industry Standards.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, 2015

PLC Introduction and Committees.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, 2015

Deploying OpenMP Task Parallelism on Multicore Embedded Systems with MCA Task APIs.
Proceedings of the 17th IEEE International Conference on High Performance Computing and Communications, 2015

2014
Accelerating Kirchhoff migration on GPU using directives.
Proceedings of the First Workshop on Accelerator Programming using Directives, 2014

SPEC ACCEL: A Standard Application Suite for Measuring Hardware Accelerator Performance.
Proceedings of the High Performance Computing Systems. Performance Modeling, Benchmarking, and Simulation, 2014

Reduction Operations in Parallel Loops for GPGPUs.
Proceedings of the 2014 PPOPP International Workshop on Programming Models and Applications for Multicores and Manycores, 2014

NAS Parallel Benchmarks for GPGPUs Using a Directive-Based Programming Model.
Proceedings of the Languages and Compilers for Parallel Computing, 2014

A Validation Testsuite for OpenACC 1.0.
Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014

2013
C2FPGA - A dependency-timing graph design methodology.
J. Parallel Distributed Comput., 2013

Parallel sparse FFT.
Proceedings of the 3rd Workshop on Irregular Applications - Architectures and Algorithms, 2013

libEOMP: a portable OpenMP runtime library based on MCA APIs for embedded systems.
Proceedings of the 2013 PPOPP International Workshop on Programming Models and Applications for Multicores and Manycores, 2013

Portable mapping of openMP to multicore embedded systems using MCA APIs.
Proceedings of the SIGPLAN/SIGBED Conference on Languages, 2013

Compiling a High-Level Directive-Based Programming Model for GPGPUs.
Proceedings of the Languages and Compilers for Parallel Computing, 2013

Exploring Programming Multi-GPUs Using OpenMP and OpenACC-Based Hybrid Model.
Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013

Statistical modeling of power/energy of scientific kernels on a multi-GPU system.
Proceedings of the International Green Computing Conference, 2013

2012
Tools and algorithms for high-level algorithm mapping to FPGA
PhD thesis, 2012

Poster: Statistical Power and Energy Modeling of Multi-GPU Kernels.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

An OpenMP 3.1 Validation Testsuite.
Proceedings of the OpenMP in a Heterogeneous World - 8th International Workshop on OpenMP, 2012

2010
A dependency graph based methodology for parallelizing HLL applications on FPGA (abstract only).
Proceedings of the ACM/SIGDA 18th International Symposium on Field Programmable Gate Arrays, 2010

2008
Capturing performance knowledge for automated analysis.
Proceedings of the ACM/IEEE Conference on High Performance Computing, 2008


  Loading...