Yanying Lin
Orcid: 0000-0002-4809-9543
According to our database1,
Yanying Lin authored at least 19 papers
between 2020 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
Workload-Adapted Resource Allocation for LLM Distributed Serving in Serverless Clusters.
IEEE Trans. Parallel Distributed Syst., June, 2026
FASER: Fine-Grained Phase Management for Speculative Decoding in Dynamic LLM Serving.
CoRR, April, 2026
FlexPipe: Adapting Dynamic LLM Serving Through Inflight Pipeline Refactoring in Fragmented Serverless Clusters.
Proceedings of the 21st European Conference on Computer Systems, 2026
2025
IEEE Trans. Serv. Comput., 2025
IEEE Trans. Cloud Comput., 2025
IAE-LoRa: Interference-Aware and Energy-Efficient LoRa Optimization Using Reinforcement Learning.
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2025
Rock: Serving Multimodal Models in Cloud with Heterogeneous-Aware Resource Orchestration for Thousands of LoRA Adapters.
Proceedings of the IEEE International Conference on Cluster Computing, 2025
Understanding Diffusion Model Serving in Production: A Top-Down Analysis of Workload, Scheduling, and Resource Efficiency.
Proceedings of the 2025 ACM Symposium on Cloud Computing, 2025
2024
Planck: Optimizing LLM Inference Performance in Pipeline Parallelism with Fine-Grained SLO Constraint.
Proceedings of the IEEE International Conference on Web Services, 2024
Proceedings of the 44th IEEE International Conference on Distributed Computing Systems, 2024
EINS: Edge-Cloud Deep Model Inference with Network-Efficiency Schedule in Serverless.
Proceedings of the 27th International Conference on Computer Supported Cooperative Work in Design, 2024
2023
IEEE/ACM Trans. Netw., June, 2023
IEEE Trans. Serv. Comput., 2023
Proceedings of the 29th IEEE International Conference on Parallel and Distributed Systems, 2023
2022
System-level Implications of Serverless: Workload Characterizing and Performance Understanding.
Proceedings of the IEEE Intl Conf on Parallel & Distributed Processing with Applications, 2022
Proceedings of the 24th IEEE Int Conf on High Performance Computing & Communications; 8th Int Conf on Data Science & Systems; 20th Int Conf on Smart City; 8th Int Conf on Dependability in Sensor, 2022
2021
PEAN: A Packet-level End-to-end Attentive Network for Encrypted Traffic Identification.
Proceedings of the 2021 IEEE 23rd Int Conf on High Performance Computing & Communications; 7th Int Conf on Data Science & Systems; 19th Int Conf on Smart City; 7th Int Conf on Dependability in Sensor, 2021
Proceedings of the Cloud Computing - CLOUD 2021, 2021
2020
LBNN: Perceiving the State Changes of a Core Telecommunications Network via Linear Bayesian Neural Network.
Proceedings of the 26th IEEE International Conference on Parallel and Distributed Systems, 2020