Yasushi Negishi

According to our database1, Yasushi Negishi authored at least 20 papers between 1996 and 2020.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2020
Compiling ONNX Neural Network Models Using MLIR.
CoRR, 2020

Acceleration of Large Deep Learning Training with Hybrid GPU Memory Management of Swapping and Re-computing.
Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), 2020

2019
Profiling based Out-of-core Hybrid Method for Large Neural Networks.
CoRR, 2019

Profiling based out-of-core hybrid method for large neural networks: poster.
Proceedings of the 24th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2019

High Resolution Medical Image Segmentation Using Data-Swapping Method.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2019, 2019

Automatic GPU memory management for large neural models in TensorFlow.
Proceedings of the 2019 ACM SIGPLAN International Symposium on Memory Management, 2019

2018
Fast and Accurate 3D Medical Image Segmentation with Data-swapping Method.
CoRR, 2018

TFLMS: Large Model Support in TensorFlow by Graph Rewriting.
CoRR, 2018

Involving CPUs into Multi-GPU Deep Learning.
Proceedings of the 2018 ACM/SPEC International Conference on Performance Engineering, 2018

2013
An event-processing system alerting analytically to networked vehicles.
Proceedings of the 16th International IEEE Conference on Intelligent Transportation Systems, 2013

2012
A Systematic Approach toward Automated Performance Analysis and Tuning.
IEEE Trans. Parallel Distributed Syst., 2012

A static analysis tool using a three-step approach for data races in HPC programs.
Proceedings of the 10th Workshop on Parallel and Distributed Systems: Testing, 2012

An Efficient Framework for Multi-dimensional Tuning of High Performance Computing Applications.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012

Tool-assisted Optimization of Shared-memory Accesses in UPC Applications.
Proceedings of the 14th IEEE International Conference on High Performance Computing and Communication & 9th IEEE International Conference on Embedded Software and Systems, 2012

2010
Overlapping Methods of All-to-All Communication and FFT Algorithms for Torus-Connected Massively Parallel Supercomputers.
Proceedings of the Conference on High Performance Computing Networking, 2010

Application tuning through bottleneck-driven refactoring.
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

2009
A proposal of operation history management system for source-to-source optimization of HPC programs.
Proceedings of the 7th Workshop on Parallel and Distributed Systems: Testing, 2009

A Holistic Approach towards Automated Performance Analysis and Tuning.
Proceedings of the Euro-Par 2009 Parallel Processing, 2009

2000
Efficient Error Correction Code Configurations for Quasi-Nonvolatile Data Retention by DRAMs.
Proceedings of the 15th IEEE International Symposium on Defect and Fault-Tolerance in VLSI Systems (DFT 2000), 2000

1996
A Portable Communication System for Video-On-Demand Applications Using the Existing Infrastructure.
Proceedings of the Proceedings IEEE INFOCOM '96, 1996


  Loading...