Toru Nagai

Orcid: 0000-0002-5827-1220

According to our database1, Toru Nagai authored at least 15 papers between 1995 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Auto-Tuning Mixed-Precision Computation by Specifying Multiple Regions.
Concurr. Comput. Pract. Exp., January, 2025

Performance Evaluation of Loop Body Splitting for Fast Modal Filtering in SCALE-DG on A64FX.
Proceedings of the 2025 International Conference on High Performance Computing in Asia-Pacific Region Workshops, 2025

2024
Performance Evaluation of CMOS Annealing with Support Vector Machine.
Proceedings of the 17th IEEE International Symposium on Embedded Multicore/Many-core Systems-on-Chip, 2024

Adaptation of XAI to Auto-tuning for Numerical Libraries.
Proceedings of the 17th IEEE International Symposium on Embedded Multicore/Many-core Systems-on-Chip, 2024

Implementing Fast Modal Filtering of SCALE-DG.
Proceedings of the IEEE International Conference on Cluster Computing, 2024

2023
Autotuning by Changing Directives and Number of Threads in OpenMP using ppOpen-AT.
CoRR, 2023

Implementation of Radio Wave Propagation using RT Cores and Consideration of Programming Models.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023

2022
Autotuning Power Consumption and Computation Accuracy using ppOpen-AT.
Proceedings of the 15th IEEE International Symposium on Embedded Multicore/Many-core Systems-on-Chip, 2022

2021
Parallelization of GKV benchmark using OpenACC.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2021

An Auto-tuning with Adaptation of A64 Scalable Vector Extension for SPIRAL.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2021

2020
Performance Evaluation of Accurate Matrix-Matrix Multiplication on GPU Using Sparse Matrix Multiplications.
Proceedings of the Eighth International Symposium on Computing and Networking Workshops, 2020

2018
Threaded Accurate Matrix-Matrix Multiplications with Sparse Matrix-Vector Multiplications.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium Workshops, 2018

Preconditioner Auto-Tuning Using Deep Learning for Sparse Iterative Algorithms.
Proceedings of the Sixth International Symposium on Computing and Networking, 2018

Optimizing Forward Computation in Adjoint Method via Multi-level Blocking.
Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region, 2018

1995
Benchmarking Fortran Intrinsic Functions.
Int. J. High Speed Comput., 1995


  Loading...