Runhua Zhang

Orcid: 0000-0003-3487-5289

Affiliations:
  • Beihang University, Beijing Key Lab Digital Media, State Key Lab Virtual Real Technology and Systems, China
  • Guizhou University, Department of Computer Science and Technology, Beijing, China (former)


According to our database1, Runhua Zhang authored at least 13 papers between 2019 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
LSAF: A load-balancing SpGEMM acceleration framework with dynamic package and static partition for multi-core systolic arrays.
Parallel Comput., 2026

2025
FlexPie: Accelerate Distributed Inference on Edge Devices with Flexible Combinatorial Optimization[Technical Report].
CoRR, February, 2025

JOVS: Joint Optimization of Vectorization and Scheduling for DNN on AI DSPs.
Proceedings of the 37th ACM Symposium on Parallelism in Algorithms and Architectures, 2025

SAM-Lightning: Segment Anything Model for Efficient Inference and Reduced Memory Footprint.
Proceedings of the PRICAI 2025: Trends in Artificial Intelligence, 2025

SparDR: Accelerating Unstructured Sparse DNN Inference via Dataflow Optimization.
Proceedings of the Design, Automation & Test in Europe Conference, 2025

2024
A high-performance dataflow-centric optimization framework for deep learning inference on the edge.
J. Syst. Archit., 2024

2023
LOCP: Latency-optimized channel pruning for CNN inference acceleration on GPUs.
J. Supercomput., September, 2023

Xenos : Dataflow-Centric Optimization to Accelerate Model Inference on Edge Devices.
Proceedings of the Database Systems for Advanced Applications, 2023

2022
CNNBooster: Accelerating CNN Inference with Latency-aware Channel Pruning for GPU.
Proceedings of the IEEE Intl Conf on Parallel & Distributed Processing with Applications, 2022

2021
Robustness-aware 2-bit quantization with real-time performance for neural network.
Neurocomputing, 2021

SRQ: Self-Reference quantization scheme for lightweight neural network.
Proceedings of the 31st Data Compression Conference, 2021

2020
DSANA: A distributed machine learning acceleration solution based on dynamic scheduling and network acceleration.
Proceedings of the 22nd IEEE International Conference on High Performance Computing and Communications; 18th IEEE International Conference on Smart City; 6th IEEE International Conference on Data Science and Systems, 2020

2019
HiPower: A High-Performance RDMA Acceleration Solution for Distributed Transaction Processing.
Proceedings of the Network and Parallel Computing, 2019


  Loading...