Shulai Zhang

Orcid: 0000-0002-0802-7203

According to our database1, Shulai Zhang authored at least 17 papers between 2019 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
MuxTune: Efficient Multi-Task LLM Fine-Tuning in Multi-Tenant Datacenters via Spatial-Temporal Backbone Multiplexing.
Proceedings of the 23rd USENIX Symposium on Networked Systems Design and Implementation, 2026

Arena: Efficiently Training Large Models via Dynamic Scheduling and Adaptive Parallelism Co-Design.
Proceedings of the 21st European Conference on Computer Systems, 2026

2025
Harli: SLO-Aware Co-location of LLM Inference and PEFT-based Finetuning on Model-as-a-Service Platforms.
CoRR, November, 2025

Boosting Embodied AI Agents through Perception-Generation Disaggregation and Asynchronous Pipeline Execution.
CoRR, September, 2025

ARACHNE: Optimizing Distributed Parallel Applications with Reduced Inter-Process Communication.
ACM Trans. Archit. Code Optim., June, 2025

Efficient Performance-Aware GPU Sharing with Compatibility and Isolation through Kernel Space Interception.
Proceedings of the 2025 USENIX Annual Technical Conference, 2025

COMET: Fine-grained Computation-communication Overlapping for Mixture-of-Experts.
Proceedings of the Eighth Conference on Machine Learning and Systems, 2025

Improving GPU Sharing Performance through Adaptive Bubbleless Spatial-Temporal Sharing.
Proceedings of the Twentieth European Conference on Computer Systems, 2025

2024
Towards Fast Setup and High Throughput of GPU Serverless Computing.
CoRR, 2024

A Codesign of Scheduling and Parallelization for Large Model Training in Heterogeneous Clusters.
CoRR, 2024

2023
Maximizing the Utilization of GPUs Used by Cloud Gaming through Adaptive Co-location with Combo.
Proceedings of the 2023 ACM Symposium on Cloud Computing, SoCC 2023, 2023

2022
PAME: precision-aware multi-exit DNN serving for reducing latencies of batched inferences.
Proceedings of the ICS '22: 2022 International Conference on Supercomputing, Virtual Event, June 28, 2022

2021
Dubhe: Towards Data Unbiasedness with Homomorphic Encryption in Federated Learning Client Selection.
Proceedings of the ICPP 2021: 50th International Conference on Parallel Processing, Lemont, IL, USA, August 9, 2021

2020
Difficulty Prediction for Proof-of-Work Based Blockchains.
Proceedings of the 21st IEEE International Workshop on Signal Processing Advances in Wireless Communications, 2020

A General Difficulty Control Algorithm for Proof-of-Work Based Blockchains.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Facial Image Deformation Based on Landmark Detection.
CoRR, 2019

Exploiting Caching and Prediction to Promote User Experience for a Real-Time Wireless VR Service.
Proceedings of the 2019 IEEE Global Communications Conference, 2019


  Loading...