Runhua Zhang

Orcid: 0000-0003-3487-5289

Affiliations:

Beihang University, Beijing Key Lab Digital Media, State Key Lab Virtual Real Technology and Systems, China
Guizhou University, Department of Computer Science and Technology, Beijing, China (former)

According to our database¹, Runhua Zhang authored at least 13 papers between 2019 and 2026.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2026

LSAF: A load-balancing SpGEMM acceleration framework with dynamic package and static partition for multi-core systolic arrays.

[BibT_eX]

[DOI]

Parallel Comput., 2026

2025

FlexPie: Accelerate Distributed Inference on Edge Devices with Flexible Combinatorial Optimization[Technical Report].

[BibT_eX]

[DOI]

CoRR, February, 2025

JOVS: Joint Optimization of Vectorization and Scheduling for DNN on AI DSPs.

[BibT_eX]

[DOI]

Proceedings of the 37th ACM Symposium on Parallelism in Algorithms and Architectures, 2025

SAM-Lightning: Segment Anything Model for Efficient Inference and Reduced Memory Footprint.

[BibT_eX]

[DOI]

Proceedings of the PRICAI 2025: Trends in Artificial Intelligence, 2025

SparDR: Accelerating Unstructured Sparse DNN Inference via Dataflow Optimization.

[BibT_eX]

[DOI]

Proceedings of the Design, Automation & Test in Europe Conference, 2025

2024

A high-performance dataflow-centric optimization framework for deep learning inference on the edge.

[BibT_eX]

[DOI]

J. Syst. Archit., 2024

2023

LOCP: Latency-optimized channel pruning for CNN inference acceleration on GPUs.

[BibT_eX]

[DOI]

J. Supercomput., September, 2023

Xenos : Dataflow-Centric Optimization to Accelerate Model Inference on Edge Devices.

[BibT_eX]

[DOI]

Proceedings of the Database Systems for Advanced Applications, 2023

2022

CNNBooster: Accelerating CNN Inference with Latency-aware Channel Pruning for GPU.

[BibT_eX]

[DOI]

Proceedings of the IEEE Intl Conf on Parallel & Distributed Processing with Applications, 2022

2021

Robustness-aware 2-bit quantization with real-time performance for neural network.

[BibT_eX]

[DOI]

Neurocomputing, 2021

SRQ: Self-Reference quantization scheme for lightweight neural network.

[BibT_eX]

[DOI]

Proceedings of the 31st Data Compression Conference, 2021

2020

DSANA: A distributed machine learning acceleration solution based on dynamic scheduling and network acceleration.

[BibT_eX]

[DOI]

Proceedings of the 22nd IEEE International Conference on High Performance Computing and Communications; 18th IEEE International Conference on Smart City; 6th IEEE International Conference on Data Science and Systems, 2020

2019

HiPower: A High-Performance RDMA Acceleration Solution for Distributed Transaction Processing.

[BibT_eX]

[DOI]

Proceedings of the Network and Parallel Computing, 2019

Runhua Zhang

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...