Bálint Joó

Orcid: 0000-0002-4229-7960

According to our database1, Bálint Joó authored at least 25 papers between 2004 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023

Physics guided machine learning for multi-material decomposition of tissues from dual-energy CT scans of simulated breast models with calcifications.
Proceedings of the High Performance Computing for Imaging 2023, 2023

2022
Early Application Experiences on a Modern GPU-Accelerated Arm-based HPC Platform.
CoRR, 2022

2019
Performance Portability of a Wilson Dslash Stencil Operator Mini-App Using Kokkos and SYCL.
Proceedings of the 2019 IEEE/ACM International Workshop on Performance, 2019

2018
A per-cent-level determination of the nucleon axial coupling from quantum chromodynamics.
Nat., 2018

Simulating the weak death of the neutron in a femtoscale universe with near-Exascale computing.
CoRR, 2018

Lessons Learned from Optimizing Kernels for Adaptive Aggregation Multi-grid Solvers in Lattice QCD.
Proceedings of the High Performance Computing, 2018

Simulating the <i>weak</i> death of the Neutron in a femtoscale universe with near-exascale computing.
Proceedings of the International Conference for High Performance Computing, 2018

2016
Optimizing a Multiple Right-Hand Side Dslash Kernel for Intel Knights Corner.
Proceedings of the High Performance Computing, 2016

Optimizing Wilson-Dirac Operator and Linear Solvers for Intel® KNL.
Proceedings of the High Performance Computing, 2016

Accelerating lattice QCD multigrid on GPUs using fine-grained parallelization.
Proceedings of the International Conference for High Performance Computing, 2016

2015
Improving concurrency and asynchrony in multithreaded MPI applications using software offloading.
Proceedings of the International Conference for High Performance Computing, 2015

2014
Lattice QCD with Domain Decomposition on Intel® Xeon Phi Co-Processors.
Proceedings of the International Conference for High Performance Computing, 2014

A Framework for Lattice QCD Calculations on GPUs.
Proceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium, 2014

Improving Communication Performance and Scalability of Native Applications on Intel Xeon Phi Coprocessor Clusters.
Proceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium, 2014

2013
Lattice QCD on Intel® Xeon PhiTM Coprocessors.
Proceedings of the Supercomputing - 28th International Supercomputing Conference, 2013

2012
Lattice QCD on GPU clusters, using the QUDA library and the Chroma software system.
Int. J. High Perform. Comput. Appl., 2012

Automatic Offloading C++ Expression Templates to CUDA Enabled GPUs.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum, 2012

2011
Building the International Lattice Data Grid.
Comput. Phys. Commun., 2011

High-performance lattice QCD for multi-core based parallel systems using a cache-friendly hybrid threaded-MPI approach.
Proceedings of the Conference on High Performance Computing Networking, 2011

Scaling lattice QCD beyond 100 GPUs.
Proceedings of the Conference on High Performance Computing Networking, 2011

2010
Parallelizing the QUDA Library for Multi-GPU Calculations in Lattice Quantum Chromodynamics.
Proceedings of the Conference on High Performance Computing Networking, 2010

2008
Continuing Progress on a Lattice QCD Software Infrastructure
CoRR, 2008

2005
Overview of the QCDSP and QCDOC computers.
IBM J. Res. Dev., 2005

2004
QCDOC: A 10 Teraflops Computer for Tightly-Coupled Calculations.
Proceedings of the ACM/IEEE SC2004 Conference on High Performance Networking and Computing, 2004


  Loading...