Zhaorui Zhang
Orcid: 0000-0003-0284-1113
According to our database1,
Zhaorui Zhang
authored at least 27 papers
between 2015 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
FedEFsz: Fair Cross-Silo Federated Learning System With Error-Bounded Lossy Compression.
IEEE Trans. Parallel Distributed Syst., December, 2025
Ocelot: An Interactive, Efficient Distributed Compression-As-a-Service Platform With Optimized Data Compression Techniques.
IEEE Trans. Parallel Distributed Syst., December, 2025
ACM Comput. Surv., November, 2025
MoE-Compression: How the Compression Error of Experts Affects the Inference Accuracy of MoE Model?
CoRR, September, 2025
FedCSpc: A Cross-Silo Federated Learning System With Error-Bounded Lossy Parameter Compression.
IEEE Trans. Parallel Distributed Syst., July, 2025
CoRR, March, 2025
CLLoRA: An Approach to Measure the Effects of the Context Length for LLM Fine-Tuning.
CoRR, February, 2025
ZCCL: Significantly Improving Collective Communication With Error-Bounded Lossy Compression.
CoRR, February, 2025
Recursive Confidence Training for Pseudo-Labeling Calibration in Semi-Supervised Few-Shot Learning.
IEEE Trans. Image Process., 2025
StoreLLM: Energy Efficient Large Language Model Inference with Permanently Pre-stored Attention Matrices.
Proceedings of the 16th ACM International Conference on Future and Sustainable Energy Systems, 2025
2024
Optimization of operating angles of disc coulters for maize residue management using discrete element method.
Comput. Electron. Agric., 2024
Proceedings of the International Conference for High Performance Computing, 2024
POSTER: Accelerating High-Precision Integer Multiplication used in Cryptosystems with GPUs.
Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2024
A Compiler-Like Framework for Optimizing Cryptographic Big Integer Multiplication on GPUs.
Proceedings of the 57th IEEE/ACM International Symposium on Microarchitecture, 2024
An Optimized Error-controlled MPI Collective Framework Integrated with Lossy Compression.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024
2023
Proceedings of the 2023 3rd International Conference on Big Data, 2023
2022
IEEE Trans. Parallel Distributed Syst., 2022
SaPus: Self-Adaptive Parameter Update Strategy for DNN Training on Multi-GPU Clusters.
IEEE Trans. Parallel Distributed Syst., 2022
Momentum-driven adaptive synchronization model for distributed DNN training on HPC clusters.
J. Parallel Distributed Comput., 2022
Proceedings of the Human Aspects of IT for the Aged Population. Technology in Everyday Living, 2022
2021
ACM Trans. Graph., 2021
2020
Proceedings of the Natural Language Processing and Chinese Computing, 2020
2016
FPGA-Based High-Performance Collision Detection: An Enabling Technique for Image-Guided Robotic Surgery.
Frontiers Robotics AI, 2016
2015
An Application Specific Instruction Set Processor (ASIP) for Adaptive Filters in Neural Prosthetics.
IEEE ACM Trans. Comput. Biol. Bioinform., 2015
Proceedings of the 7th International Conference on Cybernetics and Intelligent Systems, 2015