Zhuoran Ji
Orcid: 0000-0001-9767-2767
  According to our database1,
  Zhuoran Ji
  authored at least 18 papers
  between 2018 and 2025.
  
  
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
- 
    on orcid.org
On csauthors.net:
Bibliography
  2025
FedCSpc: A Cross-Silo Federated Learning System With Error-Bounded Lossy Parameter Compression.
    
  
    IEEE Trans. Parallel Distributed Syst., July, 2025
    
  
Cube-fx: Mapping Taylor Expansion Onto Matrix Multiplier-Accumulators of Huawei Ascend AI Processors.
    
  
    IEEE Trans. Parallel Distributed Syst., June, 2025
    
  
VESTA: A Secure and Efficient FHE-based Three-Party Vectorized Evaluation System for Tree Aggregation Models.
    
  
    Proceedings of the Abstracts of the 2025 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, 2025
    
  
Accelerating Number Theoretic Transform with Multi-GPU Systems for Efficient Zero Knowledge Proof.
    
  
    Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2025
    
  
  2024
POSTER: Accelerating High-Precision Integer Multiplication used in Cryptosystems with GPUs.
    
  
    Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2024
    
  
A Compiler-Like Framework for Optimizing Cryptographic Big Integer Multiplication on GPUs.
    
  
    Proceedings of the 57th IEEE/ACM International Symposium on Microarchitecture, 2024
    
  
Accelerating Multi-Scalar Multiplication for Efficient Zero Knowledge Proofs with Multi-GPU Systems.
    
  
    Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024
    
  
  2023
    Proceedings of the 43rd IEEE International Conference on Distributed Computing Systems, 2023
    
  
  2022
Momentum-driven adaptive synchronization model for distributed DNN training on HPC clusters.
    
  
    J. Parallel Distributed Comput., 2022
    
  
    Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022
    
  
Efficient exact K-nearest neighbor graph construction for billion-scale datasets using GPUs with tensor cores.
    
  
    Proceedings of the ICS '22: 2022 International Conference on Supercomputing, Virtual Event, June 28, 2022
    
  
Optimizing Aggregate Computation of Graph Neural Networks with on-GPU Interpreter-Style Programming.
    
  
    Proceedings of the International Conference on Parallel Architectures and Compilation Techniques, 2022
    
  
  2021
    Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium, 2021
    
  
    Proceedings of the ICPP 2021: 50th International Conference on Parallel Processing, Lemont, IL, USA, August 9, 2021
    
  
    Proceedings of the Euro-Par 2021: Parallel Processing, 2021
    
  
  2019
HNMTP Conv: Optimize Convolution Algorithm for Single-Image Convolution Neural Network Inference on Mobile GPUs.
    
  
    CoRR, 2019
    
  
HG-Caffe: Mobile and Embedded Neural Network GPU (OpenCL) Inference Engine with FP16 Supporting.
    
  
    CoRR, 2019
    
  
  2018