Kengo Nakajima

Orcid: 0000-0001-9751-5649

According to our database1, Kengo Nakajima authored at least 74 papers between 1997 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
MPI-Adapter2: An Automatic ABI Translation Library Builder for MPI Application Binary Portability.
Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region Workshops, 2024

2023
Dynamic Core Binding for Load Balancing of Applications Parallelized with MPI/OpenMP.
Proceedings of the Computational Science - ICCS 2023, 2023

2022
IEEE Special Issue on Innovative R&D Toward the Exascale Era.
IEEE Trans. Parallel Distributed Syst., 2022

A System-Wide Communication to Couple Multiple MPI Programs for Heterogeneous Computing.
Proceedings of the Parallel and Distributed Computing, Applications and Technologies, 2022

Acceleration of Optimized Coarse-Grid Operators by Spatial Redistribution for Multigrid Reduction in Time.
Proceedings of the Computational Science - ICCS 2022, 2022

Assignment of idle processors to spatial redistributed domains on coarse levels in multigrid reduction in time.
Proceedings of the HPC Asia 2022: International Conference on High Performance Computing in Asia-Pacific Region, Virtual Event, Japan, January 12, 2022

A Process Management Runtime with Dynamic Reconfiguration.
Proceedings of the HPCAsia 2022 Workshop: International Conference on High Performance Computing in Asia-Pacific Region Workshops, Virtual Event Japan, January 11, 2022

Communication-Computation Overlapping for Preconditioned Parallel Iterative Solvers with Dynamic Loop Scheduling.
Proceedings of the HPCAsia 2022 Workshop: International Conference on High Performance Computing in Asia-Pacific Region Workshops, Virtual Event Japan, January 11, 2022

Low/Adaptive Precision Computation in Preconditioned Iterative Solvers for Ill-Conditioned Problems.
Proceedings of the HPC Asia 2022: International Conference on High Performance Computing in Asia-Pacific Region, Virtual Event, Japan, January 12, 2022

Exploring Communication-Computation Overlap in Parallel Iterative Solvers on Manycore CPUs using Asynchronous Progress Control.
Proceedings of the HPCAsia 2022 Workshop: International Conference on High Performance Computing in Asia-Pacific Region Workshops, Virtual Event Japan, January 11, 2022

Development of a coupler h3-Open-UTIL/MP.
Proceedings of the HPC Asia 2022: International Conference on High Performance Computing in Asia-Pacific Region, Virtual Event, Japan, January 12, 2022

2021
Optimized Cascadic Multigrid Parareal Method for Explicit Time-Marching Schemes.
Proceedings of the 12th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, 2021

Efficient Parallel Multigrid Methods on Manycore Clusters with Double/Single Precision Computing.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2021

2020

Development of element-by-element kernel algorithms in unstructured finite-element solvers for many-core wide-SIMD CPUs: Application to earthquake simulation.
J. Comput. Sci., 2020

The Effectiveness of Low-Precision Floating Arithmetic on Numerical Codes: A Case Study on Power Consumption.
Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region, 2020

Multiplicative Schwartz-Type Block Multi-Color Gauss-Seidel Smoother for Algebraic Multigrid Methods.
Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region, 2020

2019
Parallel Multigrid Methods on Manycore Clusters with IHK/McKernel.
Proceedings of the 10th IEEE/ACM Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, 2019

Combining Algorithmic Rethinking and AVX-512 Intrinsics for Efficient Simulation of Subcellular Calcium Signaling.
Proceedings of the Computational Science - ICCS 2019, 2019

Development of Element-by-Element Kernel Algorithms in Unstructured Implicit Low-Order Finite-Element Earthquake Simulation for Many-Core Wide-SIMD CPUs.
Proceedings of the Computational Science - ICCS 2019, 2019

2018
Chebyshev Filter Diagonalization on Modern Manycore Processors and GPGPUs.
Proceedings of the High Performance Computing - 33rd International Conference, 2018

A fast scalable implicit solver for nonlinear time-evolution earthquake city problem on low-ordered unstructured finite elements with artificial intelligence and transprecision computing.
Proceedings of the International Conference for High Performance Computing, 2018

Algebraic Multigrid Solver Using Coarse Grid Aggregation with Independent Aggregation.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium Workshops, 2018

A Fast Scalable Implicit Solver with Concentrated Computation for Nonlinear Time-Evolution Problems on Low-Order Unstructured Finite Elements.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium, 2018

Performance and Scalability of Lightweight Multi-kernel Based Operating Systems.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium, 2018

Design of Parallel BEM Analyses Framework for SIMD Processors.
Proceedings of the Computational Science - ICCS 2018, 2018

Wave Propagation Simulation of Complex Multi-Material Problems with Fast Low-Order Unstructured Finite-Element Meshing and Analysis.
Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region, 2018

Load-Balancing-Aware Parallel Algorithms of H-Matrices with Adaptive Cross Approximation for GPUs.
Proceedings of the IEEE International Conference on Cluster Computing, 2018

2017
Implicit Low-Order Unstructured Finite-Element Multiple Simulation Enhanced by Dense Computation Using OpenACC.
Proceedings of the Accelerator Programming Using Directives - 4th International Workshop, 2017

Low-profile and small-sized spiral-shaped microstrip line antenna with multi-band operation in UHF frequency band.
Proceedings of the 2017 IEEE Radio and Wireless Symposium, 2017

Communication-Computation Overlapping with Dynamic Loop Scheduling for Preconditioned Parallel Iterative Solvers on Multicore and Manycore Clusters.
Proceedings of the 46th International Conference on Parallel Processing Workshops, 2017

Hierarchical Parallelization of Multi-coloring Algorithms for Block IC Preconditioners.
Proceedings of the 19th IEEE International Conference on High Performance Computing and Communications; 15th IEEE International Conference on Smart City; 3rd IEEE International Conference on Data Science and Systems, 2017

2016
Performance Analysis of SA-AMG Method by Setting Extracted Near-Kernel Vectors.
Proceedings of the High Performance Computing for Computational Science - VECPAR 2016, 2016

Parallel Iterative Solvers for Ill-conditioned Problems with Heterogeneous Material Properties.
Proceedings of the International Conference on Computational Science 2016, 2016

From FLOPS to BYTES: disruptive change in high-performance computing towards the post-moore era.
Proceedings of the ACM International Conference on Computing Frontiers, CF'16, 2016

2015
Multi-scale Coupling Simulation of Seismic Waves and Building Vibrations Using ppOpen-HPC.
Proceedings of the International Conference on Computational Science, 2015

2014
Optimization of serial and parallel communications for parallel geometric multigrid method.
Proceedings of the 20th IEEE International Conference on Parallel and Distributed Systems, 2014

Implementation and Evaluation of an AMR Framework for FDM Applications.
Proceedings of the International Conference on Computational Science, 2014

2012
Numerical Simulation of Long-Term Fate of CO<sub>2</sub> Stored in Deep Reservoir Rocks on Massively Parallel Vector Supercomputer.
Proceedings of the High Performance Computing for Computational Science, 2012

Implementation and Evaluation of 3D Finite Element Method Application for CUDA.
Proceedings of the High Performance Computing for Computational Science, 2012

Automatic Tuning of Parallel Multigrid Solvers Using OpenMP/MPI Hybrid Parallel Programming Models.
Proceedings of the High Performance Computing for Computational Science, 2012

Control Formats for Unsymmetric and Symmetric Sparse Matrix-Vector Multiplications on OpenMP Implementations.
Proceedings of the High Performance Computing for Computational Science, 2012

OpenMP/MPI Hybrid Parallel ILU(k) Preconditioner for FEM Based on Extended Hierarchical Interface Decomposition for Multi-core Clusters.
Proceedings of the High Performance Computing for Computational Science, 2012

Revisiting Persistent Communication in MPI.
Proceedings of the Recent Advances in the Message Passing Interface, 2012

New strategy for coarse grid solvers in parallel multigrid methods using OpenMP/MPI hybrid programming models.
Proceedings of the 2012 PPOPP International Workshop on Programming Models and Applications for Multicores and Manycores, 2012

Topic 15: High Performance and Scientific Applications.
Proceedings of the Euro-Par 2012 Parallel Processing - 18th International Conference, 2012

OpenMP/MPI Hybrid Parallel Multigrid Method on Fujitsu FX10 Supercomputer System.
Proceedings of the 2012 IEEE International Conference on Cluster Computing Workshops, 2012

2011
First International Workshop on Advances in High-Performance Computational Earth Sciences: Applications and Frameworks (IHPCES).
Proceedings of the International Conference on Computational Science, 2011

Introduction.
Proceedings of the Euro-Par 2011 Parallel Processing - 17th International Conference, 2011

2010
Parallel Multigrid Solvers Using OpenMP/MPI Hybrid Programming Models on Multi-Core/Multi-Socket Clusters.
Proceedings of the High Performance Computing for Computational Science - VECPAR 2010, 2010

A Multi-Scale Heart Simulation on Massively Parallel Computers.
Proceedings of the Conference on High Performance Computing Networking, 2010

2009
Participatory Simulation Environment gumonji/Q: A Network Game Empowered by Agents.
Proceedings of the Principles of Practice in Multi-Agent Systems, 2009

Parallel Multistage Preconditioners by Extended Hierarchical Interface Decomposition for Ill-Conditioned Problems.
Proceedings of the Parallel Computing: From Multicores and GPU's to Petascale, 2009

Flat MPI vs. Hybrid: Evaluation of Parallel Programming Models for Preconditioned Iterative Solvers on .
Proceedings of the ICPPW 2009, 2009

2008
Participatory Simulation Platform Using Network Games.
Proceedings of the Intelligent Agents and Multi-Agent Systems, 2008

Parallel multistage preconditioners by Hierarchical Interface Decomposition on "T2K Open Super Computer (Todai Combined Cluster)" with Hybrid parallel programming models.
Proceedings of the 2008 IEEE International Conference on Cluster Computing, 29 September, 2008

2007
Parallel Preconditioning Methods with Selective Fill-Ins and Selective Overlapping for Ill-Conditioned Problems in Finite-Element Methods.
Proceedings of the Computational Science - ICCS 2007, 7th International Conference, Beijing, China, May 27, 2007

Parallel Multistage Preconditioners Based on a Hierarchical Graph Decomposition for SMP Cluster Architectures with a Hybrid Parallel Programming Model.
Proceedings of the High Performance Computing and Communications, 2007

2006
The Impact of Parallel Programming Models on the Performance of Iterative Linear Solvers for Finite Element Applications.
Proceedings of the High Performance Computing for Computational Science, 2006

2005
Parallel iterative solvers for finite-element methods using an OpenMP/MPI hybrid programming model on the Earth Simulator.
Parallel Comput., 2005

2004
Parallel iterative solvers with selective blocking preconditioning for simulations of fault-zone contact.
Numer. Linear Algebra Appl., 2004

2003
Optimizing parallel performance of unstructured volume rendering for the Earth Simulator.
Parallel Comput., 2003

Global and localized parallel preconditioning techniques for large scale solid Earth simulations.
Future Gener. Comput. Syst., 2003

Parallel Iterative Solvers of GeoFEM with Selective Blocking Preconditioning for Nonlinear Contact Problems on the Earth Simulator.
Proceedings of the ACM/IEEE SC2003 Conference on High Performance Networking and Computing, 2003

OpenMP / MPI Hybrid vs. Flat MPI on the Earth Simulator: Parallel Iterative Solvers for Finite Element Method.
Proceedings of the High Performance Computing, 5th International Symposium, 2003

Parallel Finite Element Analysis Platform for the Earth Simulator: GeoFEM.
Proceedings of the Computational Science - ICCS 2003, 2003

2002
Parallel iterative solvers for unstructured grids using a directive/MPI hybrid programming model for the GeoFEM platform on SMP cluster architectures.
Concurr. Comput. Pract. Exp., 2002

Parallel multilevel iterative linear solvers with unstructured adaptive grids for simulations in earth science.
Concurr. Comput. Pract. Exp., 2002

Parallel Iterative Solvers for Unstructured Grids Using an OpenMP/MPI Hybrid Programming Model for the GeoFEM Platform on SMP Cluster Architectures.
Proceedings of the High Performance Computing, 4th International Symposium, 2002

Parallel performance optimization of large-scale unstructured data visualization for the earth simulator.
Proceedings of the 4th Eurographics Workshop on Parallel Graphics and Visualization, 2002

2001
Parallel 3D Adaptive Compressible Navier-Stokes Solver in GeoFEM with Dynamic Load-Balancing by DRAMA Library.
Proceedings of the High-Performance Computing and Networking, 9th International Conference, 2001

1999
GeoFEM: High-Performance Parallel FEM for Geophysical Applications.
Proceedings of the High Performance Computing, Second International Symposium, 1999

1998
Highly Stable Localized ILU Preconditioning for Unstructured Grids.
Proceedings of the High-Performance Computing and Networking, 1998

1997
Parallel Iterative Solvers with Localized ILU Preconditioning.
Proceedings of the High-Performance Computing and Networking, 1997


  Loading...