Naohito Nakasato

Orcid: 0000-0003-1195-5710

According to our database1, Naohito Nakasato authored at least 28 papers between 2005 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Evaluation of POSIT Arithmetic with Accelerators.
Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region, 2024

2023
Accelerating 128-bit Floating-Point Matrix Multiplication on FPGAs.
Proceedings of the 31st IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2023

2021
Acceleration of Gravitation Field Analysis for Asteroids by GPU Computation.
Proceedings of the 14th IEEE International Symposium on Embedded Multicore/Many-core Systems-on-Chip, 2021

2019
Effectiveness of performance tuning techniques for general matrix multiplication on the PEZY-SC2.
Proceedings of the 10th International Symposium on Highly-Efficient Accelerators and Reconfigurable Technologies, 2019

Performance Evaluation of Tsunami Simulation Exploiting Temporal Parallelism on FPGAs using OpenCL.
Proceedings of the 10th International Symposium on Highly-Efficient Accelerators and Reconfigurable Technologies, 2019

2018
Evaluations of OpenCL-written tsunami simulation on FPGA and comparison with GPU implementation.
J. Supercomput., 2018

High Performance High-Precision Floating-Point Operations on FPGAs Using OpenCL.
Proceedings of the International Conference on Field-Programmable Technology, 2018

Introduction of MNSTbot.
Proceedings of the International Conference on Field-Programmable Technology, 2018

2017
FPGA-based tsunami simulation: Performance comparison with GPUs, and roofline model for scalability analysis.
J. Parallel Distributed Comput., 2017

Performance Evaluation of Tsunami Simulation Using OpenCL on GPU and FPGA.
Proceedings of the 11th IEEE International Symposium on Embedded Multicore/Many-core Systems-on-Chip, 2017

2016
Parallelism for High-Performance Tsunami Simulation with FPGA: Spatial or Temporal?
Proceedings of the 24th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2016

2015
Stream Computation of Shallow Water Equation Solver for FPGA-based 1D Tsunami Simulation.
SIGARCH Comput. Archit. News, 2015

Application of GRAPE9-MPX for High Precision Calculation in Particle Physics and Performance Results.
Proceedings of the International Conference on Computational Science, 2015

2014
GPU Accelerated Hybrid Tree Algorithm for Collision Less N-body Simulations.
SIGARCH Comput. Archit. News, 2014

2012
Implementation of a parallel tree method on a GPU.
J. Comput. Sci., 2012

Blocked United Algorithm for the All-Pairs Shortest Paths Problem on Hybrid CPU-GPU Systems.
IEICE Trans. Inf. Syst., 2012

Astrophysical Particle Simulations on Heterogeneous CPU-GPU Systems
CoRR, 2012

Performance Tuning of Matrix Multiplication in OpenCL on Different GPUs and CPUs.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

GRAPE-MPs: Implementation of an SIMD for Quadruple/Hexuple/Octuple-Precision Arithmetic Operation on a Structured ASIC and an FPGA.
Proceedings of the IEEE 6th International Symposium on Embedded Multicore/Manycore SoCs, 2012

Implementing a Code Generator for Fast Matrix Multiplication in OpenCL on the GPU.
Proceedings of the IEEE 6th International Symposium on Embedded Multicore/Manycore SoCs, 2012

2011
A fast GEMM implementation on the cypress GPU.
SIGMETRICS Perform. Evaluation Rev., 2011

Multi-level Optimization of Matrix Multiplication for GPU-equipped Systems.
Proceedings of the International Conference on Computational Science, 2011

GRAPE-MP: An SIMD Accelerator Board for Multi-precision Arithmetic.
Proceedings of the International Conference on Computational Science, 2011

Blocked All-Pairs Shortest Paths Algorithm for Hybrid CPU-GPU System.
Proceedings of the 13th IEEE International Conference on High Performance Computing & Communication, 2011

2009
A compiler for high performance computing with many-core accelerators.
Proceedings of the 2009 IEEE International Conference on Cluster Computing, August 31, 2009

2005
PGR: A Software Package for Reconfigurable Super-Computing.
Proceedings of the 2005 International Conference on Field Programmable Logic and Applications (FPL), 2005

Astrophysical Hydrodynamics Simulations on a Reconfigurable System.
Proceedings of the 13th IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM 2005), 2005

Massively Parallel Processors Generator for Reconfigurable System.
Proceedings of the 13th IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM 2005), 2005


  Loading...