Vivek Kale

Orcid: 0000-0003-4687-1226

According to our database1, Vivek Kale authored at least 18 papers between 2010 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
CPU-GPU Tuning for Modern Scientific Applications using Node-Level Heterogeneity.
Proceedings of the 30th IEEE International Conference on High Performance Computing, 2023

2022
OpenMP application experiences: Porting to accelerated nodes.
Parallel Comput., 2022

OpenMP's Asynchronous Offloading for All-pairs Shortest Path Graph Algorithms on GPUs.
Proceedings of the 2022 IEEE/ACM International Workshop on Hierarchical Parallelism for Exascale Computing (HiPar), 2022

2021


Addressing Load Imbalance in Bioinformatics and Biomedical Applications: Efficient Scheduling across Multiple GPUs.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2021

2020
Toward Supporting Multi-GPU Targets via Taskloop and User-Defined Schedules.
Proceedings of the OpenMP: Portable Multi-Level Parallelism on Modern Systems, 2020

2019
Toward a Standard Interface for User-Defined Scheduling in OpenMP.
Proceedings of the OpenMP: Conquering the Full Hardware Spectrum, 2019

2015
Low-overhead scheduling for improving performance of scientific applications
PhD thesis, 2015

Composing Low-Overhead Scheduling Strategies for Improving Performance of Scientific Applications.
Proceedings of the OpenMP: Heterogenous Execution and Data Movements, 2015

2014
Locality-Optimized Mixed Static/Dynamic Scheduling for Improving Load Balancing on SMPs.
Proceedings of the 21st European MPI Users' Group Meeting, 2014

2013
MPI + MPI: a new hybrid approach to parallel programming with MPI plus shared memory.
Computing, 2013

Performance Analysis of the Lattice Boltzmann Model Beyond Navier-Stokes.
Proceedings of the 27th IEEE International Symposium on Parallel and Distributed Processing, 2013

2012
Abstract: Slack-Conscious Lightweight Loop Scheduling for Improving Scalability of Bulk-synchronous MPI Applications.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Leveraging MPI's One-Sided Communication Interface for Shared-Memory Programming.
Proceedings of the Recent Advances in the Message Passing Interface, 2012

Hybrid Static/dynamic Scheduling for Already Optimized Dense Matrix Factorization.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012

2011
Weighted locality-sensitive scheduling for mitigating noise on multi-core clusters.
Proceedings of the 18th International Conference on High Performance Computing, 2011

2010
Load Balancing for Regular Meshes on SMPs with MPI.
Proceedings of the Recent Advances in the Message Passing Interface, 2010


  Loading...