Zhiyuan Shao

Orcid: 0000-0003-2139-6465

According to our database¹, Zhiyuan Shao authored at least 57 papers between 2003 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2025

DynPipe: Toward Dynamic End-to-End Pipeline Parallelism for Interference-Aware DNN Training.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., November, 2025

RT-GNN: Accelerating Sparse Graph Neural Networks by Tensor-CUDA Kernel Fusion.

[BibT_eX]

[DOI]

ACM Trans. Archit. Code Optim., March, 2025

BRP-SpMM: Block-Row Partition Based Sparse Matrix Multiplication with Tensor and CUDA Cores.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2025

CeDMA: Enhancing Memory Efficiency of Heterogeneous Accelerator Systems Through Central DMA Controlling.

[BibT_eX]

[DOI]

Proceedings of the Advanced Parallel Processing Technologies, 2025

2024

A survey on dynamic graph processing on GPUs: concepts, terminologies and systems.

[BibT_eX]

[DOI]

Frontiers Comput. Sci., August, 2024

ScalaBFS2: A High-performance BFS Accelerator on an HBM-enhanced FPGA Chip.

[BibT_eX]

[DOI]

ACM Trans. Reconfigurable Technol. Syst., June, 2024

Towards High-Performance Graph Processing: From a Hardware/Software Co-Design Perspective.

[BibT_eX]

[DOI]

J. Comput. Sci. Technol., March, 2024

MiCache: An MSHR-inclusive Non-blocking Cache Design for FPGAs.

[BibT_eX]

[DOI]

Proceedings of the 2024 ACM/SIGDA International Symposium on Field Programmable Gate Arrays, 2024

Parallel Truss Maintenance Algorithms for Dynamic Hypergraphs.

[BibT_eX]

[DOI]

Proceedings of the Computing and Combinatorics - 30th International Conference, 2024

2023

Evaluating RISC-V Vector Instruction Set Architecture Extension with Computer Vision Workloads.

[BibT_eX]

[DOI]

J. Comput. Sci. Technol., July, 2023

2022

Accelerating Backward Aggregation in GCN Training With Execution Path Preparing on GPUs.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2022

Cross-Language Binary-Source Code Matching with Intermediate Representations.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Software Analysis, 2022

Towards Fast GPU-based Sparse DNN Inference: A Hybrid Compute Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE High Performance Extreme Computing Conference, 2022

2021

Efficient Graph Processing with Invalid Update Filtration.

[BibT_eX]

[DOI]

IEEE Trans. Big Data, 2021

ScalaBFS: A Scalable BFS Accelerator on HBM-Enhanced FPGAs.

[BibT_eX]

[DOI]

CoRR, 2021

ScalaBFS: A Scalable BFS Accelerator on FPGA-HBM Platform.

[BibT_eX]

[DOI]

Proceedings of the FPGA '21: The 2021 ACM/SIGDA International Symposium on Field Programmable Gate Arrays, Virtual Event, USA, February 28, 2021

Predicting Hepatoma-Related Genes Based on Representation Learning of PPI network and Gene Ontology Annotations.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2021

2020

Processing Grid-format Real-world Graphs on DRAM-based FPGA Accelerators with Application-specific Caching Mechanisms.

[BibT_eX]

[DOI]

ACM Trans. Reconfigurable Technol. Syst., 2020

Optimizing Memory Performance of Xilinx FPGAs under Vitis.

[BibT_eX]

[DOI]

CoRR, 2020

Scaph: Scalable GPU-Accelerated Graph Processing with Value-Driven Differential Scheduling.

[BibT_eX]

[DOI]

Proceedings of the 2020 USENIX Annual Technical Conference, 2020

2019

Efficient Recommendation of De-Identification Policies Using MapReduce.

[BibT_eX]

[DOI]

IEEE Trans. Big Data, 2019

BlockGraphChi: Enabling Block Update in Out-of-Core Graph Processing.

[BibT_eX]

[DOI]

Int. J. Parallel Program., 2019

Improving Performance of Graph Processing on FPGA-DRAM Platform by Two-level Vertex Caching.

[BibT_eX]

[DOI]

Proceedings of the 2019 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, 2019

Fast Maximal Clique Enumeration for Real-World Graphs.

[BibT_eX]

[DOI]

Proceedings of the Database Systems for Advanced Applications, 2019

2018

Scalable Data Race Detection for Lock-Intensive Programs with Pending Period Representation.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2018

MomentSA: A Fast and Accurate Method for Stochastic Kronecker Graph Parameter Computing.

[BibT_eX]

[DOI]

Proceedings of the 22nd IEEE International Conference on Computer Supported Cooperative Work in Design, 2018

2017

FOG: A Fast Out-of-Core Graph Processing Framework.

[BibT_eX]

[DOI]

Int. J. Parallel Program., 2017

A task-based approach for finding SCCs in real-world graphs on external memory.

[BibT_eX]

[DOI]

Concurr. Comput. Pract. Exp., 2017

Data Race Detection by Understanding Synchronization Relationships of Thread Segments.

[BibT_eX]

[DOI]

Zhiyuan Shao

Jian Peng

Hai Jin

Proceedings of the 25th Euromicro International Conference on Parallel, 2017

2016

Finding SCCs in Real-World Graphs on External Memory: A Task-Based Approach.

[BibT_eX]

[DOI]

Proceedings of the 15th International Symposium on Parallel and Distributed Computing, 2016

Improving fairness of network bandwidth allocation for virtual machines in cloud environment.

[BibT_eX]

[DOI]

Zhiyuan Shao

Kai Zhang

Hai Jin

Proceedings of the 2016 IEEE International Black Sea Conference on Communications and Networking, 2016

2015

Is Your Graph Algorithm Eligible for Nondeterministic Execution?

[BibT_eX]

[DOI]

Proceedings of the 44th International Conference on Parallel Processing, 2015

2014

A segment-based sparse matrix-vector multiplication on CUDA.

[BibT_eX]

[DOI]

Concurr. Comput. Pract. Exp., 2014

A GPU-based parallel method for evolutionary tree construction.

[BibT_eX]

[DOI]

Comput. Electr. Eng., 2014

2013

VSA: An offline scheduling analyzer for Xen virtual machine monitor.

[BibT_eX]

[DOI]

Future Gener. Comput. Syst., 2013

FRESA: A Frequency-Sensitive Sampling-Based Approach for Data Race Detection.

[BibT_eX]

[DOI]

Neng Huang

Zhiyuan Shao

Hai Jin

Proceedings of the Network and Parallel Computing - 10th IFIP International Conference, 2013

RTRM: A Response Time-Based Replica Management Strategy for Cloud Storage System.

[BibT_eX]

[DOI]

Proceedings of the Grid and Pervasive Computing - 8th International Conference, 2013

2012

Parallelization Mechanisms of Neighbor-Joining for CUDA Enabled Devices.

[BibT_eX]

[DOI]

Proceedings of the Seventh ChinaGrid Annual Conference, ChinaGrid 2012, Beijing, 2012

Implementing Smith-Waterman Algorithm with Two-Dimensional Cache on GPUs.

[BibT_eX]

[DOI]

Proceedings of the 2012 Second International Conference on Cloud and Green Computing, 2012

2011

Analyzing and Improving MPI Communication Performance in Overcommitted Virtualized Systems.

[BibT_eX]

[DOI]

Proceedings of the MASCOTS 2011, 2011

Optimization of Sparse Matrix-Vector Multiplication with Variant CSR on GPUs.

[BibT_eX]

[DOI]

Proceedings of the 17th IEEE International Conference on Parallel and Distributed Systems, 2011

2010

FTDS: Adjusting Virtual Computing Resources in Threshing Cases.

[BibT_eX]

[DOI]

Proceedings of the 18th Euromicro Conference on Parallel, 2010

2009

ClientVisor: leverage COTS OS functionalities for power management in virtualized desktop environment.

[BibT_eX]

[DOI]

ACM SIGOPS Oper. Syst. Rev., 2009

ClientVisor: leverage COTS OS functionalities for power management in virtualized desktop environment.

[BibT_eX]

[DOI]

Proceedings of the 5th International Conference on Virtual Execution Environments, 2009

Virtual Machine Resource Management for High Performance Computing Applications.

[BibT_eX]

[DOI]

Zhiyuan Shao

Hai Jin

Yong Li

Proceedings of the IEEE International Symposium on Parallel and Distributed Processing with Applications, 2009

A performance study of web server based on Hardware-assisted Virtual Machine.

[BibT_eX]

[DOI]

Zhiyuan Shao

Hai Jin

De Zhang

Proceedings of the 7th IEEE/ACS International Conference on Computer Systems and Applications, 2009

2008

ER-TCP: an efficient TCP fault-tolerance scheme for cluster computing.

[BibT_eX]

[DOI]

J. Supercomput., 2008

Optimized Implementation of Ray Tracing on Cell Broadband Engine.

[BibT_eX]

[DOI]

Proceedings of the 2008 International Conference on Multimedia and Ubiquitous Engineering (MUE 2008), 2008

Two-Level Parallel Implementation of FDTD Algorithm on CBE.

[BibT_eX]

[DOI]

Bo Li

Hai Jin

Zhiyuan Shao

Proceedings of the IEEE International Conference on Networking, Sensing and Control, 2008

ChinaV: Building Virtualized Computing System.

[BibT_eX]

[DOI]

Proceedings of the 10th IEEE International Conference on High Performance Computing and Communications, 2008

2006

FreeSpeech: A Novel Wireless Approach for Conference Projecting and Cooperating.

[BibT_eX]

[DOI]

Proceedings of the Ubiquitous Intelligence and Computing, Third International Conference, 2006

Middleware Based High Performance and High Available Database Cluster.

[BibT_eX]

[DOI]

Zhiyuan Shao

Hai Jin

Proceedings of the Grid and Cooperative Computing, 2006

AR-TCP: Actively Replicated TCP Connections for Cluster of Workstations.

[BibT_eX]

[DOI]

Zhiyuan Shao

Hai Jin

Jie Wu

Proceedings of the Japan-China Joint Workshop on Frontier of Computer Science and Technology, 2006

2005

TCP-ABC: From Multiple TCP Connections to Atomic Broadcasting.

[BibT_eX]

[DOI]

Proceedings of the Network and Parallel Computing, IFIP International Conference, 2005

ER-TCP: An Efficient Fault-Tolerance Scheme for TCP Connections.

[BibT_eX]

[DOI]

Proceedings of the Parallel and Distributed Processing and Applications, 2005

2003

HARTs: high availability cluster architecture with redundant TCP stacks.

[BibT_eX]

[DOI]

Proceedings of the 22nd IEEE International Performance Computing and Communications Conference, 2003

Cluster Architecture with Lightweighted Redundant TCP Stacks.

[BibT_eX]

[DOI]

Hai Jin

Zhiyuan Shao

Proceedings of the 2003 IEEE International Conference on Cluster Computing (CLUSTER 2003), 2003

Zhiyuan Shao

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...