Proceedings of the HPC Asia 2022: International Conference on High Performance Computing in Asia-Pacific Region, Virtual Event, Japan, January 12, 2022

A Process Management Runtime with Dynamic Reconfiguration.

[BibT_eX]

[DOI]

Shinji Sumimoto

Toshihiro Hanawa

Kengo Nakajima

Proceedings of the HPCAsia 2022 Workshop: International Conference on High Performance Computing in Asia-Pacific Region Workshops, Virtual Event Japan, January 11, 2022

Communication-Computation Overlapping for Preconditioned Parallel Iterative Solvers with Dynamic Loop Scheduling.

[BibT_eX]

[DOI]

Proceedings of the HPCAsia 2022 Workshop: International Conference on High Performance Computing in Asia-Pacific Region Workshops, Virtual Event Japan, January 11, 2022

Low/Adaptive Precision Computation in Preconditioned Iterative Solvers for Ill-Conditioned Problems.

[BibT_eX]

[DOI]

Masatoshi Kawai

Kengo Nakajima

Proceedings of the HPC Asia 2022: International Conference on High Performance Computing in Asia-Pacific Region, Virtual Event, Japan, January 12, 2022

Exploring Communication-Computation Overlap in Parallel Iterative Solvers on Manycore CPUs using Asynchronous Progress Control.

[BibT_eX]

[DOI]

Proceedings of the HPCAsia 2022 Workshop: International Conference on High Performance Computing in Asia-Pacific Region Workshops, Virtual Event Japan, January 11, 2022

Development of a coupler h3-Open-UTIL/MP.

[BibT_eX]

[DOI]

Takashi Arakawa

Hisashi Yashiro

Kengo Nakajima

Proceedings of the HPC Asia 2022: International Conference on High Performance Computing in Asia-Pacific Region, Virtual Event, Japan, January 12, 2022

2021

Optimized Cascadic Multigrid Parareal Method for Explicit Time-Marching Schemes.

[BibT_eX]

[DOI]

Yen-Chen Chen

Kengo Nakajima

Proceedings of the 12th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, 2021

Efficient Parallel Multigrid Methods on Manycore Clusters with Double/Single Precision Computing.

[BibT_eX]

[DOI]

Kengo Nakajima

Takeshi Ogita

Masatoshi Kawai

Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2021

2020

ESSEX: Equipping Sparse Solvers For Exascale.

[BibT_eX]

[DOI]

Melven Röhrig-Zöllner

Proceedings of the Software for Exascale Computing - SPPEXA 2016-2019, 2020

Development of element-by-element kernel algorithms in unstructured finite-element solvers for many-core wide-SIMD CPUs: Application to earthquake simulation.

[BibT_eX]

[DOI]

J. Comput. Sci., 2020

The Effectiveness of Low-Precision Floating Arithmetic on Numerical Codes: A Case Study on Power Consumption.

[BibT_eX]

[DOI]

Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region, 2020

Multiplicative Schwartz-Type Block Multi-Color Gauss-Seidel Smoother for Algebraic Multigrid Methods.

[BibT_eX]

[DOI]

Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region, 2020

2019

Parallel Multigrid Methods on Manycore Clusters with IHK/McKernel.

[BibT_eX]

[DOI]

Proceedings of the 10th IEEE/ACM Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, 2019

Combining Algorithmic Rethinking and AVX-512 Intrinsics for Efficient Simulation of Subcellular Calcium Signaling.

[BibT_eX]

[DOI]

Proceedings of the Computational Science - ICCS 2019, 2019

Development of Element-by-Element Kernel Algorithms in Unstructured Implicit Low-Order Finite-Element Earthquake Simulation for Many-Core Wide-SIMD CPUs.

[BibT_eX]

[DOI]

Proceedings of the Computational Science - ICCS 2019, 2019

2018

Chebyshev Filter Diagonalization on Modern Manycore Processors and GPGPUs.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing - 33rd International Conference, 2018

A fast scalable implicit solver for nonlinear time-evolution earthquake city problem on low-ordered unstructured finite elements with artificial intelligence and transprecision computing.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2018

Algebraic Multigrid Solver Using Coarse Grid Aggregation with Independent Aggregation.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium Workshops, 2018

A Fast Scalable Implicit Solver with Concentrated Computation for Nonlinear Time-Evolution Problems on Low-Order Unstructured Finite Elements.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium, 2018

Performance and Scalability of Lightweight Multi-kernel Based Operating Systems.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium, 2018

Design of Parallel BEM Analyses Framework for SIMD Processors.

[BibT_eX]

[DOI]

Proceedings of the Computational Science - ICCS 2018, 2018

Wave Propagation Simulation of Complex Multi-Material Problems with Fast Low-Order Unstructured Finite-Element Meshing and Analysis.

[BibT_eX]

[DOI]

Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region, 2018

Load-Balancing-Aware Parallel Algorithms of H-Matrices with Adaptive Cross Approximation for GPUs.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Cluster Computing, 2018

2017

Implicit Low-Order Unstructured Finite-Element Multiple Simulation Enhanced by Dense Computation Using OpenACC.

[BibT_eX]

[DOI]

Proceedings of the Accelerator Programming Using Directives - 4th International Workshop, 2017

Low-profile and small-sized spiral-shaped microstrip line antenna with multi-band operation in UHF frequency band.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Radio and Wireless Symposium, 2017

Communication-Computation Overlapping with Dynamic Loop Scheduling for Preconditioned Parallel Iterative Solvers on Multicore and Manycore Clusters.

[BibT_eX]

[DOI]

Kengo Nakajima

Toshihiro Hanawa

Proceedings of the 46th International Conference on Parallel Processing Workshops, 2017

Hierarchical Parallelization of Multi-coloring Algorithms for Block IC Preconditioners.

[BibT_eX]

[DOI]

Masatoshi Kawai

Akihiro Ida

Kengo Nakajima

Proceedings of the 19th IEEE International Conference on High Performance Computing and Communications; 15th IEEE International Conference on Smart City; 3rd IEEE International Conference on Data Science and Systems, 2017

2016

Performance Analysis of SA-AMG Method by Setting Extracted Near-Kernel Vectors.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing for Computational Science - VECPAR 2016, 2016

Parallel Iterative Solvers for Ill-conditioned Problems with Heterogeneous Material Properties.

[BibT_eX]

[DOI]

Kengo Nakajima

Proceedings of the International Conference on Computational Science 2016, 2016

From FLOPS to BYTES: disruptive change in high-performance computing towards the post-moore era.

[BibT_eX]

[DOI]

Proceedings of the ACM International Conference on Computing Frontiers, CF'16, 2016

2015

Multi-scale Coupling Simulation of Seismic Waves and Building Vibrations Using ppOpen-HPC.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Computational Science, 2015

2014

Optimization of serial and parallel communications for parallel geometric multigrid method.

[BibT_eX]

[DOI]

Kengo Nakajima

Proceedings of the 20th IEEE International Conference on Parallel and Distributed Systems, 2014

Implementation and Evaluation of an AMR Framework for FDM Applications.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Computational Science, 2014

2012

Numerical Simulation of Long-Term Fate of CO<sub>2</sub> Stored in Deep Reservoir Rocks on Massively Parallel Vector Supercomputer.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing for Computational Science, 2012

Implementation and Evaluation of 3D Finite Element Method Application for CUDA.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing for Computational Science, 2012

Automatic Tuning of Parallel Multigrid Solvers Using OpenMP/MPI Hybrid Parallel Programming Models.

[BibT_eX]

[DOI]

Kengo Nakajima

Proceedings of the High Performance Computing for Computational Science, 2012

Control Formats for Unsymmetric and Symmetric Sparse Matrix-Vector Multiplications on OpenMP Implementations.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing for Computational Science, 2012

OpenMP/MPI Hybrid Parallel ILU(k) Preconditioner for FEM Based on Extended Hierarchical Interface Decomposition for Multi-core Clusters.

[BibT_eX]

[DOI]

Masae Hayashi

Kengo Nakajima

Proceedings of the High Performance Computing for Computational Science, 2012

Revisiting Persistent Communication in MPI.

[BibT_eX]

[DOI]

Yutaka Ishikawa

Kengo Nakajima

Atsushi Hori

Proceedings of the Recent Advances in the Message Passing Interface, 2012

New strategy for coarse grid solvers in parallel multigrid methods using OpenMP/MPI hybrid programming models.

[BibT_eX]

[DOI]

Kengo Nakajima

Proceedings of the 2012 PPOPP International Workshop on Programming Models and Applications for Multicores and Manycores, 2012

Topic 15: High Performance and Scientific Applications.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2012 Parallel Processing - 18th International Conference, 2012

OpenMP/MPI Hybrid Parallel Multigrid Method on Fujitsu FX10 Supercomputer System.

[BibT_eX]

[DOI]

Kengo Nakajima

Proceedings of the 2012 IEEE International Conference on Cluster Computing Workshops, 2012

2011

First International Workshop on Advances in High-Performance Computational Earth Sciences: Applications and Frameworks (IHPCES).

[BibT_eX]

[DOI]

Takashi Furumura

Kengo Nakajima

Masaki Satoh

Proceedings of the International Conference on Computational Science, 2011

Introduction.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2011 Parallel Processing - 17th International Conference, 2011

2010

Parallel Multigrid Solvers Using OpenMP/MPI Hybrid Programming Models on Multi-Core/Multi-Socket Clusters.

[BibT_eX]

[DOI]

Kengo Nakajima

Proceedings of the High Performance Computing for Computational Science - VECPAR 2010, 2010

A Multi-Scale Heart Simulation on Massively Parallel Computers.

[BibT_eX]

[DOI]

Proceedings of the Conference on High Performance Computing Networking, 2010

2009

Participatory Simulation Environment gumonji/Q: A Network Game Empowered by Agents.

[BibT_eX]

[DOI]

Proceedings of the Principles of Practice in Multi-Agent Systems, 2009

Parallel Multistage Preconditioners by Extended Hierarchical Interface Decomposition for Ill-Conditioned Problems.

[BibT_eX]

[DOI]

Kengo Nakajima

Proceedings of the Parallel Computing: From Multicores and GPU's to Petascale, 2009

Flat MPI vs. Hybrid: Evaluation of Parallel Programming Models for Preconditioned Iterative Solvers on .

[BibT_eX]

[DOI]

Kengo Nakajima

Proceedings of the ICPPW 2009, 2009

2008

Participatory Simulation Platform Using Network Games.

[BibT_eX]

[DOI]

Proceedings of the Intelligent Agents and Multi-Agent Systems, 2008

Parallel multistage preconditioners by Hierarchical Interface Decomposition on "T2K Open Super Computer (Todai Combined Cluster)" with Hybrid parallel programming models.

[BibT_eX]

[DOI]

Kengo Nakajima

Proceedings of the 2008 IEEE International Conference on Cluster Computing, 29 September, 2008

2007

Parallel Preconditioning Methods with Selective Fill-Ins and Selective Overlapping for Ill-Conditioned Problems in Finite-Element Methods.

[BibT_eX]

[DOI]

Kengo Nakajima

Proceedings of the Computational Science - ICCS 2007, 7th International Conference, Beijing, China, May 27, 2007

Parallel Multistage Preconditioners Based on a Hierarchical Graph Decomposition for SMP Cluster Architectures with a Hybrid Parallel Programming Model.

[BibT_eX]

[DOI]

Kengo Nakajima

Proceedings of the High Performance Computing and Communications, 2007

2006

The Impact of Parallel Programming Models on the Performance of Iterative Linear Solvers for Finite Element Applications.

[BibT_eX]

[DOI]

Kengo Nakajima

Proceedings of the High Performance Computing for Computational Science, 2006

2005

Parallel iterative solvers for finite-element methods using an OpenMP/MPI hybrid programming model on the Earth Simulator.

[BibT_eX]

[DOI]

Kengo Nakajima

Parallel Comput., 2005

2004

Parallel iterative solvers with selective blocking preconditioning for simulations of fault-zone contact.

[BibT_eX]

[DOI]

Kengo Nakajima

Hiroshi Okuda

Numer. Linear Algebra Appl., 2004

2003

Optimizing parallel performance of unstructured volume rendering for the Earth Simulator.

[BibT_eX]

[DOI]

Li Chen

Issei Fujishiro

Kengo Nakajima

Parallel Comput., 2003

Parallel Iterative Solvers of GeoFEM with Selective Blocking Preconditioning for Nonlinear Contact Problems on the Earth Simulator.

[BibT_eX]

[DOI]

Kengo Nakajima

Proceedings of the ACM/IEEE SC2003 Conference on High Performance Networking and Computing, 2003

OpenMP / MPI Hybrid vs. Flat MPI on the Earth Simulator: Parallel Iterative Solvers for Finite Element Method.

[BibT_eX]

[DOI]

Kengo Nakajima

Proceedings of the High Performance Computing, 5th International Symposium, 2003

Global and Localized Parallel Preconditioning Techniques for Large Scale Solid Earth Simulations.

[BibT_eX]

[DOI]

Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS 2003), 2003

Parallel Finite Element Analysis Platform for the Earth Simulator: GeoFEM.

[BibT_eX]

[DOI]

Proceedings of the Computational Science - ICCS 2003, 2003

2002

Parallel iterative solvers for unstructured grids using a directive/MPI hybrid programming model for the GeoFEM platform on SMP cluster architectures.

[BibT_eX]

[DOI]

Kengo Nakajima

Hiroshi Okuda

Concurr. Comput. Pract. Exp., 2002

Parallel multilevel iterative linear solvers with unstructured adaptive grids for simulations in earth science.

[BibT_eX]

[DOI]

Kengo Nakajima

Concurr. Comput. Pract. Exp., 2002

Parallel Iterative Solvers for Unstructured Grids Using an OpenMP/MPI Hybrid Programming Model for the GeoFEM Platform on SMP Cluster Architectures.

[BibT_eX]