Nan Ding

Orcid: 0000-0001-9624-9449

Affiliations:
  • Lawrence Berkeley National Laboratory, Computational Research Division, Berkeley, CA, USA
  • Tsinghua University, Department of Computer Science and Technology, Beijing, China (PhD 2018)


According to our database1, Nan Ding authored at least 18 papers between 2014 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Evaluating the Potential of Disaggregated Memory Systems for HPC applications.
CoRR, 2023

Unified Communication Optimization Strategies for Sparse Triangular Solver on CPU and GPU Clusters.
Proceedings of the International Conference for High Performance Computing, 2023

Evaluating the Performance of One-sided Communication on CPUs and GPUs.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

2022
Instruction Roofline: An insightful visual performance model for GPUs.
Concurr. Comput. Pract. Exp., 2022

A Methodology for Evaluating Tightly-integrated and Disaggregated Accelerated Architectures.
Proceedings of the IEEE/ACM International Workshop on Performance Modeling, 2022

2021
Accelerating large scale <i>de novo</i> metagenome assembly using GPUs.
Proceedings of the International Conference for High Performance Computing, 2021

Evaluating Performance and Portability of a core bioinformatics kernel on multiple vendor GPUs.
Proceedings of the International Workshop on Performance, 2021

A Message-Driven, Multi-GPU Parallel Sparse Triangular Solver.
Proceedings of the 2021 SIAM Conference on Applied and Computational Discrete Algorithms, 2021

2020
APMT: an automatic hardware counter-based performance modeling tool for HPC applications.
CCF Trans. High Perform. Comput., 2020

Leveraging One-Sided Communication for Sparse Triangular Solvers.
Proceedings of the 2020 SIAM Conference on Parallel Processing for Scientific Computing, 2020

LOGAN: High-Performance GPU-Based X-Drop Long-Read Alignment.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020

GPU accelerated partial order multiple sequence alignment for long reads self-correction.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium Workshops, 2020

2019
Using hardware counter-based performance model to diagnose scaling issues of HPC applications.
Neural Comput. Appl., 2019

An automatic performance model-based scheduling tool for coupled climate system models.
J. Parallel Distributed Comput., 2019

An Instruction Roofline Model for GPUs.
Proceedings of the 2019 IEEE/ACM Performance Modeling, 2019

2017
Redesigning CAM-SE for peta-scale climate modeling performance and ultra-high resolution on Sunway TaihuLight.
Proceedings of the International Conference for High Performance Computing, 2017

2016
Refactoring and optimizing the community atmosphere model (CAM) on the sunway taihulight supercomputer.
Proceedings of the International Conference for High Performance Computing, 2016

2014
CESMTuner: An Auto-tuning Framework for the Community Earth System Model.
Proceedings of the 2014 IEEE International Conference on High Performance Computing and Communications, 2014


  Loading...