Zinuo Cai

Orcid: 0000-0001-9373-8474

According to our database1, Zinuo Cai authored at least 18 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
SMore: Enhancing GPU Utilization in Deep Learning Clusters by Serverless-Based Co-Location Scheduling.
IEEE Trans. Parallel Distributed Syst., May, 2025

MemoriaNova: Optimizing Memory-Aware Model Inference for Edge Computing.
ACM Trans. Archit. Code Optim., March, 2025

FasDL: An Efficient Serverless-Based Training Architecture With Communication Optimization and Resource Configuration.
IEEE Trans. Computers, February, 2025

LLMaaS: Serving Large-Language Models on Trusted Serverless Computing Platforms.
IEEE Trans. Artif. Intell., February, 2025

AARC: Automated Affinity-aware Resource Configuration for Serverless Workflows.
CoRR, February, 2025

HyperTuneFaaS: A serverless framework for hyperparameter tuning in image processing models.
Displays, 2025

2024
SLoB: Suboptimal Load Balancing Scheduling in Local Heterogeneous GPU Clusters for Large Language Model Inference.
IEEE Trans. Comput. Soc. Syst., December, 2024

SPSC: Stream Processing Framework Atop Serverless Computing for Industrial Big Data.
IEEE Trans. Cybern., November, 2024

SMSS: Stateful Model Serving in Metaverse With Serverless Computing and GPU Sharing.
IEEE J. Sel. Areas Commun., March, 2024

RIDIC: Real-Time Intelligent Transportation System With Dispersed Computing.
IEEE Trans. Intell. Transp. Syst., January, 2024

Sustainable Serverless Computing With Cold-Start Optimization and Automatic Workflow Resource Scheduling.
IEEE Trans. Sustain. Comput., 2024

faaShark: An End-to-End Network Traffic Analysis System Atop Serverless Computing.
IEEE Trans. Netw. Sci. Eng., 2024

Deep Convolutional Linear Precoder Neural Network for Rate Splitting Strategy of Aerial Computing Networks.
IEEE Trans. Netw. Sci. Eng., 2024

Hermes: Memory-Efficient Pipeline Inference for Large Models on Edge Devices.
Proceedings of the 42nd IEEE International Conference on Computer Design, 2024

2023
GUARDIAN: A Hardware-Assisted Distributed Framework to Enhance Deep Learning Security.
IEEE Trans. Comput. Soc. Syst., December, 2023

Chrion: Optimizing Recurrent Neural Network Inference by Collaboratively Utilizing CPUs and GPUs.
CoRR, 2023

Towards Variance Reduction for Reinforcement Learning of Industrial Decision-making Tasks: A Bi-Critic based Demand-Constraint Decoupling Approach.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

2021
Themis: A Fair Evaluation Platform for Computer Vision Competitions.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021


  Loading...