Shintaro Iwasaki

Orcid: 0000-0002-4748-8459

According to our database1, Shintaro Iwasaki authored at least 14 papers between 2016 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Preparing MPICH for exascale.
Int. J. High Perform. Comput. Appl., 2025

2024
AsHES 2024 Preface and Committee List.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024

2023
A Distributed Data-Parallel PyTorch Implementation of the Distributed Shampoo Optimizer for Training Neural Networks At-Scale.
CoRR, 2023

2022
OpenMP application experiences: Porting to accelerated nodes.
Parallel Comput., 2022

2021
Lightweight preemptive user-level threads.
Proceedings of the PPoPP '21: 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2021

2020
Analyzing the Performance Trade-Off in Implementing User-Level Threads.
IEEE Trans. Parallel Distributed Syst., 2020

Implementing Flexible Threading Support in Open MPI.
Proceedings of the Workshop on Exascale MPI, 2020

2019
TP-PARSEC: A Task Parallel PARSEC Benchmark Suite.
J. Inf. Process., 2019

Software combining to mitigate multithreaded MPI contention.
Proceedings of the ACM International Conference on Supercomputing, 2019

BOLT: Optimizing OpenMP Parallel Regions with User-Level Threads.
Proceedings of the 28th International Conference on Parallel Architectures and Compilation Techniques, 2019

2018
Argobots: A Lightweight Low-Level Threading and Tasking Framework.
IEEE Trans. Parallel Distributed Syst., 2018

Lessons learned from analyzing dynamic promotion for user-level threading.
Proceedings of the International Conference for High Performance Computing, 2018

2016
Autotuning of a Cut-Off for Task Parallel Programs.
Proceedings of the 10th IEEE International Symposium on Embedded Multicore/Many-core Systems-on-Chip, 2016

A Static Cut-off for Task Parallel Programs.
Proceedings of the 2016 International Conference on Parallel Architectures and Compilation, 2016


  Loading...