Zhenkun Cai

Orcid: 0000-0002-0199-4866

According to our database1, Zhenkun Cai authored at least 20 papers between 2018 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
DiskGNN: Bridging I/O Efficiency and Model Accuracy for Out-of-Core GNN Training.
Proc. ACM Manag. Data, February, 2025

Adaptive Parallel Training for Graph Neural Networks.
Proceedings of the 30th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2025

MuseGNN: Forming Scalable, Convergent GNN Layers that Minimize a Sampling-Based Energy.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
PPS: Fair and efficient black-box scheduling for multi-tenant GPU clusters.
Parallel Comput., 2024

4DBInfer: A 4D Benchmarking Toolbox for Graph-Centric Predictive Modeling on Relational DBs.
CoRR, 2024

4DBInfer: A 4D Benchmarking Toolbox for Graph-Centric Predictive Modeling on RDBs.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

DF-GNN: Dynamic Fusion Framework for Attention Graph Neural Networks on GPUs.
Proceedings of the Learning on Graphs Conference, 26-29 November 2024, Virtual., 2024

2023
FEC: Efficient Deep Recommendation Model Training with Flexible Embedding Communication.
Proc. ACM Manag. Data, 2023

MuseGNN: Interpretable and Convergent Graph Neural Network Layers at Scale.
CoRR, 2023

gSampler: General and Efficient GPU-based Graph Sampling for Graph Learning.
Proceedings of the 29th Symposium on Operating Systems Principles, 2023

DSP: Efficient GNN Training with Multiple GPUs.
Proceedings of the 28th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2023

DGI: An Easy and Efficient Framework for GNN Model Evaluation.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

2022
Elastic Deep Learning in Multi-Tenant GPU Clusters.
IEEE Trans. Parallel Distributed Syst., 2022

TensorOpt: Exploring the Tradeoffs in Distributed DNN Training With Auto-Parallelism.
IEEE Trans. Parallel Distributed Syst., 2022

DGI: Easy and Efficient Inference for GNNs.
CoRR, 2022

2021
Seastar: vertex-centric programming for graph neural networks.
Proceedings of the EuroSys '21: Sixteenth European Conference on Computer Systems, 2021

DGCL: an efficient communication library for distributed GNN training.
Proceedings of the EuroSys '21: Sixteenth European Conference on Computer Systems, 2021

2020
Improving resource utilization by timely fine-grained scheduling.
Proceedings of the EuroSys '20: Fifteenth EuroSys Conference 2020, 2020

2018
FlexPS: Flexible Parallelism Control in Parameter Server Architecture.
Proc. VLDB Endow., 2018

Scalable De Novo Genome Assembly Using Pregel.
Proceedings of the 34th IEEE International Conference on Data Engineering, 2018


  Loading...