Kenjiro Taura

Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region, 2018

2017

SDAC: Porting Scientific Data to Spark RDDs.

[BibT_eX]

[DOI]

Tian Yang

Liu Chao

Proceedings of the Network and Parallel Computing, 2017

Autonomic Resource Management for Program Orchestration in Large-Scale Data Analysis.

[BibT_eX]

[DOI]

Masahiro Tanaka

Kentaro Torisawa

Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017

Cache Friendly Parallelization of Neural Encoder-Decoder Models Without Padding on Multi-core Architecture.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

Delay Spotter: A Tool for Spotting Scheduler-Caused Delays in Task Parallel Runtime Systems.

[BibT_eX]

[DOI]

An Huynh

Proceedings of the 2017 IEEE International Conference on Cluster Computing, 2017

2016

Scalable Work Stealing of Native Threads on an x86-64 Infiniband Cluster.

[BibT_eX]

[DOI]

Shigeki Akiyama

J. Inf. Process., 2016

Fragmented BWT: An Extended BWT for Full-Text Indexing.

[BibT_eX]

[DOI]

Masaru Ito

Hiroshi Inoue

Proceedings of the String Processing and Information Retrieval, 2016

Autotuning of a Cut-Off for Task Parallel Programs.

[BibT_eX]

[DOI]

Shintaro Iwasaki

Proceedings of the 10th IEEE International Symposium on Embedded Multicore/Many-core Systems-on-Chip, 2016

Scaling FMM with Data-Driven OpenMP Tasks on Multicore Architectures.

[BibT_eX]

[DOI]

Proceedings of the OpenMP: Memory, Devices, and Tasks, 2016

Tapas: An Implicitly Parallel Programming Framework for Hierarchical N-Body Algorithms.

[BibT_eX]

[DOI]

Proceedings of the 22nd IEEE International Conference on Parallel and Distributed Systems, 2016

A Quest for Unified, Global View Parallel Programming Models for Our Future.

[BibT_eX]

[DOI]

Proceedings of the 6th International Workshop on Runtime and Operating Systems for Supercomputers, 2016

From FLOPS to BYTES: disruptive change in high-performance computing towards the post-moore era.

[BibT_eX]

[DOI]

Proceedings of the ACM International Conference on Computing Frontiers, CF'16, 2016

Low Latency and Resource-Aware Program Composition for Large-Scale Data Analysis.

[BibT_eX]

[DOI]

Masahiro Tanaka

Kentaro Torisawa

Proceedings of the IEEE/ACM 16th International Symposium on Cluster, 2016

A Static Cut-off for Task Parallel Programs.

[BibT_eX]

[DOI]

Shintaro Iwasaki

Proceedings of the 2016 International Conference on Parallel Architectures and Compilation, 2016

2015

SIMD- and Cache-Friendly Algorithm for Sorting an Array of Structures.

[BibT_eX]

[DOI]

Hiroshi Inoue

Proc. VLDB Endow., 2015

DAGViz: a DAG visualization tool for analyzing task-parallel program traces.

[BibT_eX]

[DOI]

Proceedings of the 2nd Workshop on Visual Performance Analysis, 2015

Scalable Task-Parallel SGD on Matrix Factorization in Multicore Architectures.

[BibT_eX]

[DOI]

Yusuke Nishioka

Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, 2015

Uni-Address Threads: Scalable Thread Management for RDMA-Based Work Stealing.

[BibT_eX]

[DOI]

Shigeki Akiyama

Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing, 2015

2014

Faster Set Intersection with SIMD instructions by Reducing Branch Mispredictions.

[BibT_eX]

[DOI]

Hiroshi Inoue

Moriyoshi Ohara

Proc. VLDB Endow., 2014

ParaLite: A Parallel Database System for Data-Intensive Workflows.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2014

Scalable analysis of multicore data reuse and sharing.

[BibT_eX]

[DOI]

Miquel Pericàs

Proceedings of the 2014 International Conference on Supercomputing, 2014

MassiveThreads: A Thread Library for High Productivity Languages.

[BibT_eX]

[DOI]

Jun Nakashima

Proceedings of the Concurrent Objects and Beyond, 2014

2013

Fork-Join and Data-Driven Execution Models on Multi-core Architectures: Case Study of the FMM.

[BibT_eX]

[DOI]

Proceedings of the Supercomputing - 28th International Supercomputing Conference, 2013

Analysis of Data Reuse in Task-Parallel Runtimes.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing Systems. Performance Modeling, Benchmarking and Simulation, 2013

Design and implementation of a customizable work stealing scheduler.

[BibT_eX]

[DOI]

Jun Nakashima

Sho Nakatani

Proceedings of the 3rd International Workshop on Runtime and Operating Systems for Supercomputers, 2013

Parallel and memory-efficient Burrows-Wheeler transform.

[BibT_eX]

[DOI]

Shinya Hayashi

Proceedings of the 2013 IEEE International Conference on Big Data (IEEE BigData 2013), 2013

A selective checkpointing mechanism for query plans in a parallel database system.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Conference on Big Data (IEEE BigData 2013), 2013

2012

Design and Implementation of a High Productivity Language with Communication Aggregation for Parallel Scientific Computation.

[BibT_eX]

[DOI]

Katsuaki Ikegami

Inf. Media Technol., 2012

Parallel Computational Reconfiguration Based on a PGAS Model.

[BibT_eX]

[DOI]

Kentaro Hara

Inf. Media Technol., 2012

Half-process: A Process Partially Sharing Its Address Space with Other Processes.

[BibT_eX]

[DOI]

Kentaro Hara

Inf. Media Technol., 2012

A Task Parallel Implementation of Fast Multipole Methods.

[BibT_eX]

[DOI]

Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Acceleration of Data-Intensive Workflow Applications by Using File Access History.

[BibT_eX]

[DOI]

Miki Horiuchi

Proceedings of the 2012 SC Companion: High Performance Computing, 2012

A Comparative Study of Data Processing Approaches for Text Processing Workflows.

[BibT_eX]

[DOI]

Proceedings of the 2012 SC Companion: High Performance Computing, 2012

An Empirical Performance Study of Chapel Programming Language.

[BibT_eX]

[DOI]

Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum, 2012

ParaLite: Supporting Collective Queries in Database System to Parallelize User-Defined Executable.

[BibT_eX]

[DOI]

Proceedings of the 12th IEEE/ACM International Symposium on Cluster, 2012

2011

Performance Evaluation of a Distributed File System with Locality-Aware Metadata Lookups.

[BibT_eX]

[DOI]

Inf. Media Technol., 2011

2010

Easy and instantaneous processing for data-intensive workflows.

[BibT_eX]

[DOI]

Proceedings of the 3rd Workshop on Many-Task Computing on Grids and Supercomputers, 2010

File-access patterns of data-intensive workflow applications and their implications to distributed filesystems.

[BibT_eX]

[DOI]

Takeshi Shibata

SungJun Choi

Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, 2010

A global address space framework for irregular applications.

[BibT_eX]

[DOI]

Kentaro Hara

Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, 2010

ParaTrac: a fine-grained profiler for data-intensive workflows.

[BibT_eX]

[DOI]

Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, 2010

Design and Implementation of GXP Make - A Workflow System Based on Make.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on e-Science, 2010

File-Access Characteristics of Data-Intensive Workflow Applications.

[BibT_eX]

[DOI]

Takeshi Shibata

SungJun Choi

Proceedings of the 10th IEEE/ACM International Conference on Cluster, 2010

Fine-Grained Profiling for Data-Intensive Workflows.

[BibT_eX]

[DOI]

Proceedings of the 10th IEEE/ACM International Conference on Cluster, 2010

2009

High performance wide-area overlay using deadlock-free routing.

[BibT_eX]

[DOI]

Ken Hironaka

Proceedings of the 18th ACM International Symposium on High Performance Distributed Computing, 2009

GMount: An Ad Hoc and Locality-Aware Distributed File System by Using SSH and FUSE.

[BibT_eX]

[DOI]

Proceedings of the 9th IEEE/ACM International Symposium on Cluster Computing and the Grid, 2009

2008

gluepy: A Simple Distributed Python Programming Framework for Complex Grid Environments.

[BibT_eX]

[DOI]

Proceedings of the Languages and Compilers for Parallel Computing, 2008

A scalable high-performance communication library for wide-area environments.

[BibT_eX]

[DOI]

Ken Hironaka

Proceedings of the 9th IEEE/ACM International Conference on Grid Computing (Grid 2008), Tsukuba, Japan, September 29, 2008

GMount: Build your grid file system on the fly.

[BibT_eX]

[DOI]

Proceedings of the 9th IEEE/ACM International Conference on Grid Computing (Grid 2008), Tsukuba, Japan, September 29, 2008

A Stable Broadcast Algorithm.

[BibT_eX]

[DOI]

Proceedings of the 8th IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2008), 2008

Scalable Data Gathering for Real-Time Monitoring Systems on Distributed Computing.

[BibT_eX]

[DOI]

Yoshikazu Kamoshida

Proceedings of the 8th IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2008), 2008

2007

A Low-stretch Object Migration Scheme for Wide-area Environments.

[BibT_eX]

[DOI]

Ken Hironaka

Inf. Media Technol., 2007

Locality-aware connection management and rank assignment for wide-area MPI.

[BibT_eX]

[DOI]

Proceedings of the 12th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2007

A fast topology inference: a building block for network-aware parallel processing.

[BibT_eX]

[DOI]

Tatsuya Shirai

Proceedings of the 16th International Symposium on High-Performance Distributed Computing (HPDC-16 2007), 2007

Locality-aware Connection Management and Rank Assignment forWide-area MPI.

[BibT_eX]

[DOI]

Proceedings of the Seventh IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2007), 2007

2006

Autonomous Collaborative Environment for Project Based Learning.

[BibT_eX]

Proceedings of the Intelligent Autonomous Systems 9, 2006

Monte Carlo Go Has a Way to Go.

[BibT_eX]

[DOI]

Proceedings of the Proceedings, 2006

2005

Worldwide computing: Adaptive middleware and programming technology for dynamic Grid environments.

[BibT_eX]

[DOI]

Carlos A. Varela

Paolo Ciancarini

Sci. Program., 2005

An Adaptive File Distribution Algorithm for Wide Area Network.

[BibT_eX]

[DOI]

Takashi Hoshino

Scalable Comput. Pract. Exp., 2005

Collective operations for wide-area message passing systems using adaptive spanning trees.

[BibT_eX]

[DOI]

Proceedings of the 6th IEEE/ACM International Conference on Grid Computing (GRID 2005), 2005

A scalable and efficient self-organizing failure detector for grid applications.

[BibT_eX]

[DOI]

Yuuki Horita

Proceedings of the 6th IEEE/ACM International Conference on Grid Computing (GRID 2005), 2005

Highly latency tolerant Gaussian elimination.

[BibT_eX]

[DOI]

Proceedings of the 6th IEEE/ACM International Conference on Grid Computing (GRID 2005), 2005

2004

Routing and resource discovery in Phoenix Grid-enabled message passing library.

[BibT_eX]

[DOI]

Kenji Kaneda

Proceedings of the 4th IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGrid 2004), 2004

High performance LU factorization for non-dedicated clusters.

[BibT_eX]

[DOI]

Proceedings of the 4th IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGrid 2004), 2004

2003

Phoenix: a parallel programming model for accommodating dynamically joining/leaving resources.

[BibT_eX]

[DOI]

Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2003

2002

Reducing pause time of conservative collectors.

[BibT_eX]

[DOI]

Proceedings of The Workshop on Memory Systems Performance (MSP 2002), 2002

AnZenMail: A Secure and Certified E-mail System.

[BibT_eX]

[DOI]

Proceedings of the Software Security -- Theories and Systems, 2002

Virtual Private Grid: A Command Shell for Utilizing Hundreds of Machines Efficiently.

[BibT_eX]

[DOI]

Kenji Kaneda

Proceedings of the 2nd IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2002), 2002

2001

Fusion of Concurrent Invocations of Exclusive Methods.

[BibT_eX]

[DOI]

Yoshihiro Oyama

Proceedings of the Parallel Computing Technologies, 2001

Predicting Scalability of Parallel Garbage Collectors on Shared Memory Multiprocessors.

[BibT_eX]

[DOI]

Proceedings of the 15th International Parallel & Distributed Processing Symposium (IPDPS-01), 2001

2000

Extending Java virtual machine with integer-reference conversion.

[BibT_eX]

[DOI]

Yutaka Oiwa

Concurr. Pract. Exp., 2000

The MicroGrid: a Scientific Tool for Modeling Computational Grids.

[BibT_eX]

[DOI]

Proceedings of the Proceedings Supercomputing 2000, 2000

Performance Evaluation of OpenMP Applications with Nested Parallelism.

[BibT_eX]

[DOI]

Proceedings of the Languages, 2000

Online Computation of Critical Paths for Multithreaded Languages.

[BibT_eX]

[DOI]

Yoshihiro Oyama

Proceedings of the Parallel and Distributed Processing, 2000

A Heuristic Algorithm for Mapping Communicating Tasks on Heterogeneous Resources.

[BibT_eX]

[DOI]

Andrew A. Chien

Proceedings of the 9th Heterogeneous Computing Workshop, 2000

1999

StackThreads/MP: Integrating Futures into Calling Standards.

[BibT_eX]

[DOI]

Kunio Tabata

Proceedings of the 1999 ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPOPP'99), 1999

1998

Comparing Reference Counting and Global Mark-and-Sweep on Parallel Computers.

[BibT_eX]

[DOI]

Hirotaka Yamamoto

Proceedings of the Languages, 1998

1997

A Scalable Mark-Sweep Garbage Collector on Large-Scale Shared-Memory Machines.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE Conference on Supercomputing, 1997

An Effective Garbage Collection Strategy for Parallel Programming Languages on Large Scale Distributed-Memory Machines.

[BibT_eX]

[DOI]

Proceedings of the Sixth ACM SIGPLAN Symposium on Principles & Practice of Parallel Programming (PPOPP), 1997

Fine-grain Multithreading with Minimal Compiler Support - A Cost Effective Approach to Implementing Efficient Multithreading Languages.

[BibT_eX]

[DOI]

Proceedings of the ACM SIGPLAN '97 Conference on Programming Language Design and Implementation (PLDI), 1997

An Efficient Compilation Framework for Languages Based on a Concurrent Process Calculus.

[BibT_eX]

[DOI]

Yoshihiro Oyama

Proceedings of the Euro-Par '97 Parallel Processing, 1997

1996

Visualization of RNA secondary structures using highly parallel computers.

[BibT_eX]

[DOI]

Comput. Appl. Biosci., 1996

1995

Schematic: A Concurrent Object-Oriented Extension to Scheme.

[BibT_eX]

[DOI]

Proceedings of the Object-Based Parallel and Distributed Computation, 1995

1994

StackThreads: An Abstract Machine for Scheduling Fine-Grain Threads on Stock CPUs.

[BibT_eX]

[DOI]

Proceedings of the Theory and Practice of Parallel Programming, 1994

ABCL/f: A Future-Based Polymorphic Typed Concurrent Object-Oriented Language- Its Design and Implementation.

[BibT_eX]

[DOI]

Proceedings of the Specification of Parallel Algorithms, 1994

1993

Implementing concurrent object-oriented languages on multicomputers.

[BibT_eX]

[DOI]

IEEE Parallel Distributed Technol. Syst. Appl., 1993

Highly Efficient and Encapsulated Re-use of Synchronization Code in Concurrent Object-Oriented Languages.

[BibT_eX]

[DOI]

Proceedings of the Eighth Annual Conference on Object-Oriented Programming Systems, 1993

1992

An Efficient Implementation Scheme of Concurrent Object-Oriented Languages on Stock Multicomputers.

[BibT_eX]

[DOI]