Yanying Lin

Orcid: 0000-0002-4809-9543

According to our database1, Yanying Lin authored at least 19 papers between 2020 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Workload-Adapted Resource Allocation for LLM Distributed Serving in Serverless Clusters.
IEEE Trans. Parallel Distributed Syst., June, 2026

FASER: Fine-Grained Phase Management for Speculative Decoding in Dynamic LLM Serving.
CoRR, April, 2026

FlexPipe: Adapting Dynamic LLM Serving Through Inflight Pipeline Refactoring in Fragmented Serverless Clusters.
Proceedings of the 21st European Conference on Computer Systems, 2026

2025
Serving LLM in Distributed GPU Cluster With Fine-Grain Pipeline Constraints.
IEEE Trans. Serv. Comput., 2025

Understanding Serverless Inference in Mobile-Edge Networks: A Benchmark Approach.
IEEE Trans. Cloud Comput., 2025

IAE-LoRa: Interference-Aware and Energy-Efficient LoRa Optimization Using Reinforcement Learning.
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2025

Rock: Serving Multimodal Models in Cloud with Heterogeneous-Aware Resource Orchestration for Thousands of LoRA Adapters.
Proceedings of the IEEE International Conference on Cluster Computing, 2025

Understanding Diffusion Model Serving in Production: A Top-Down Analysis of Workload, Scheduling, and Resource Efficiency.
Proceedings of the 2025 ACM Symposium on Cloud Computing, 2025

2024
Planck: Optimizing LLM Inference Performance in Pipeline Parallelism with Fine-Grained SLO Constraint.
Proceedings of the IEEE International Conference on Web Services, 2024

QUART: Latency-Aware FaaS System for Pipelining Large Model Inference.
Proceedings of the 44th IEEE International Conference on Distributed Computing Systems, 2024

EINS: Edge-Cloud Deep Model Inference with Network-Efficiency Schedule in Serverless.
Proceedings of the 27th International Conference on Computer Supported Cooperative Work in Design, 2024

2023
A Novel Multimodal Deep Learning Framework for Encrypted Traffic Classification.
IEEE/ACM Trans. Netw., June, 2023

Serverless Computing: State-of-the-Art, Challenges and Opportunities.
IEEE Trans. Serv. Comput., 2023

FLASH: Low-Latency Serverless Model Inference with Multi-Core Parallelism in Edge.
Proceedings of the 29th IEEE International Conference on Parallel and Distributed Systems, 2023

2022
System-level Implications of Serverless: Workload Characterizing and Performance Understanding.
Proceedings of the IEEE Intl Conf on Parallel & Distributed Processing with Applications, 2022

ESBench: Understanding Deep Learning Inference Overheads for Edge Serverless.
Proceedings of the 24th IEEE Int Conf on High Performance Computing & Communications; 8th Int Conf on Data Science & Systems; 20th Int Conf on Smart City; 8th Int Conf on Dependability in Sensor, 2022

2021
PEAN: A Packet-level End-to-end Attentive Network for Encrypted Traffic Identification.
Proceedings of the 2021 IEEE 23rd Int Conf on High Performance Computing & Communications; 7th Int Conf on Data Science & Systems; 19th Int Conf on Smart City; 7th Int Conf on Dependability in Sensor, 2021

BBServerless: A Bursty Traffic Benchmark for Serverless.
Proceedings of the Cloud Computing - CLOUD 2021, 2021

2020
LBNN: Perceiving the State Changes of a Core Telecommunications Network via Linear Bayesian Neural Network.
Proceedings of the 26th IEEE International Conference on Parallel and Distributed Systems, 2020


  Loading...