Tetsuya Hoshino

Orcid: 0009-0004-5349-6852

According to our database1, Tetsuya Hoshino authored at least 28 papers between 2004 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Layer-wise MoE Routing Locality under Shared-Prefix Code Generation: Token-Identity Decomposition and Compile-Equivalent Fork Redundancy.
CoRR, April, 2026

Improving HPC Code Generation Capability of LLMs via Online Reinforcement Learning with Real-Machine Benchmark Rewards.
CoRR, February, 2026

Learning-Augmented Performance Model for Tensor Product Factorization in High-Order FEM.
IEEE Access, 2026

Tensor-Core-Optimized Strategies for BLR × Tall-Skinny Matrix Multiplication in BEM.
Proceedings of the Supercomputing Asia and International Conference on High Performance Computing in Asia Pacific Region, 2026

Evaluating Claude Code's Coding and Test Automation for GPU Acceleration ofa Legacy Fortran Application: A GeoFEM Case Study.
Proceedings of the Supercomputing Asia and International Conference on High Performance Computing in Asia Pacific Region Workshops, 2026

2025
3Dify: a Framework for Procedural 3D-CG Generation Assisted by LLMs Using MCP and RAG.
CoRR, October, 2025

VibeCodeHPC: An Agent-Based Iterative Prompting Auto-Tuner for HPC Code Generation Using LLMs.
CoRR, October, 2025

Towards Generalized Parameter Tuning in Coherent Ising Machines: A Portfolio-Based Approach.
CoRR, July, 2025

Performance Evaluation of General Purpose Large Language Models for Basic Linear Algebra Subprograms Code Generation.
CoRR, July, 2025

Performance Evaluation of Loop Body Splitting for Fast Modal Filtering in SCALE-DG on A64FX.
Proceedings of the 2025 International Conference on High Performance Computing in Asia-Pacific Region Workshops, 2025

An Algorithm Portfolio Approach for Parameter Tuning in Coherent Ising Machines.
Proceedings of the Thirteenth International Symposium on Computing and Networking, CANDAR 2025, 2025

2024
Performance Evaluation of CMOS Annealing with Support Vector Machine.
Proceedings of the 17th IEEE International Symposium on Embedded Multicore/Many-core Systems-on-Chip, 2024

Adaptation of XAI to Auto-tuning for Numerical Libraries.
Proceedings of the 17th IEEE International Symposium on Embedded Multicore/Many-core Systems-on-Chip, 2024

Optimize Efficiency of Utilizing Systems by Dynamic Core Binding.
Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region Workshops, 2024

Implementing Fast Modal Filtering of SCALE-DG.
Proceedings of the IEEE International Conference on Cluster Computing, 2024

2023
Implementation of Radio Wave Propagation using RT Cores and Consideration of Programming Models.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023

Auto-tuning Mixed-precision Computation by Specifying Multiple Regions.
Proceedings of the Eleventh International Symposium on Computing and Networking, CANDAR 2023, Matsue, Japan, November 28, 2023

2022
Optimizations of H-matrix-vector Multiplication for Modern Multi-core Processors.
Proceedings of the IEEE International Conference on Cluster Computing, 2022

2020
Development of training environment for deep learning with medical images on supercomputer system based on asynchronous parallel Bayesian optimization.
J. Supercomput., 2020

Repeated coordination with private learning.
J. Econ. Theory, 2020

2018
Design of Parallel BEM Analyses Framework for SIMD Processors.
Proceedings of the Computational Science - ICCS 2018, 2018

Load-Balancing-Aware Parallel Algorithms of H-Matrices with Adaptive Cross Approximation for GPUs.
Proceedings of the IEEE International Conference on Cluster Computing, 2018

2016
A Directive-Based Data Layout Abstraction for Performance Portability of OpenACC Applications.
Proceedings of the 18th IEEE International Conference on High Performance Computing and Communications; 14th IEEE International Conference on Smart City; 2nd IEEE International Conference on Data Science and Systems, 2016

2014
An OpenACC extension for data layout transformation.
Proceedings of the First Workshop on Accelerator Programming using Directives, 2014

2013
CUDA vs OpenACC: Performance Case Studies with Kernel Benchmarks and a Memory-Bound CFD Application.
Proceedings of the 13th IEEE/ACM International Symposium on Cluster, 2013

2008
Low-load server crawler: design and evaluation.
Proceedings of the 17th International Conference on World Wide Web, 2008

2006
Geographic locations of web servers.
Proceedings of the 15th international conference on World Wide Web, 2006

2004
Evoked-cerebral blood oxygenation changes in false-negative activations in BOLD contrast functional MRI of patients with brain tumors.
NeuroImage, 2004


  Loading...