We stand with Ukraine

We stand with Ukraine

Yanying Lin

Orcid: 0000-0002-4809-9543

According to our database¹, Yanying Lin authored at least 19 papers between 2020 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Workload-Adapted Resource Allocation for LLM Distributed Serving in Serverless Clusters.

[DOI]

,

,

,

,

,

,

IEEE Trans. Parallel Distributed Syst., June, 2026

FASER: Fine-Grained Phase Management for Speculative Decoding in Dynamic LLM Serving.

[DOI]

,

,

,

Dmitrii Ustiugov

CoRR, April, 2026

FlexPipe: Adapting Dynamic LLM Serving Through Inflight Pipeline Refactoring in Fragmented Serverless Clusters.

[DOI]

,

,

,

,

Proceedings of the 21st European Conference on Computer Systems, 2026

2025

Serving LLM in Distributed GPU Cluster With Fine-Grain Pipeline Constraints.

[DOI]

,

,

,

,

,

,

IEEE Trans. Serv. Comput., 2025

Understanding Serverless Inference in Mobile-Edge Networks: A Benchmark Approach.

[DOI]

,

,

,

,

Kenneth B. Kent

,

,

,

IEEE Trans. Cloud Comput., 2025

IAE-LoRa: Interference-Aware and Energy-Efficient LoRa Optimization Using Reinforcement Learning.

[DOI]

,

,

,

,

Proceedings of the Advanced Intelligent Computing Technology and Applications, 2025

Rock: Serving Multimodal Models in Cloud with Heterogeneous-Aware Resource Orchestration for Thousands of LoRA Adapters.

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the IEEE International Conference on Cluster Computing, 2025

Understanding Diffusion Model Serving in Production: A Top-Down Analysis of Workload, Scheduling, and Resource Efficiency.

[DOI]

,

,

,

,

,

,

,

,

,

,

Proceedings of the 2025 ACM Symposium on Cloud Computing, 2025

2024

Planck: Optimizing LLM Inference Performance in Pipeline Parallelism with Fine-Grained SLO Constraint.

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE International Conference on Web Services, 2024

QUART: Latency-Aware FaaS System for Pipelining Large Model Inference.

[DOI]

,

,

,

,

,

,

,

Proceedings of the 44th IEEE International Conference on Distributed Computing Systems, 2024

EINS: Edge-Cloud Deep Model Inference with Network-Efficiency Schedule in Serverless.

[DOI]

,

,

,

,

,

Proceedings of the 27th International Conference on Computer Supported Cooperative Work in Design, 2024

2023

A Novel Multimodal Deep Learning Framework for Encrypted Traffic Classification.

[DOI]

,

,

,

,

IEEE/ACM Trans. Netw., June, 2023

Serverless Computing: State-of-the-Art, Challenges and Opportunities.

[DOI]

,

,

,

,

IEEE Trans. Serv. Comput., 2023

FLASH: Low-Latency Serverless Model Inference with Multi-Core Parallelism in Edge.

[DOI]

,

,

,

,

,

,

Proceedings of the 29th IEEE International Conference on Parallel and Distributed Systems, 2023

2022

System-level Implications of Serverless: Workload Characterizing and Performance Understanding.

[DOI]

,

,

Proceedings of the IEEE Intl Conf on Parallel & Distributed Processing with Applications, 2022

ESBench: Understanding Deep Learning Inference Overheads for Edge Serverless.

[DOI]

,

,

,

Proceedings of the 24th IEEE Int Conf on High Performance Computing & Communications; 8th Int Conf on Data Science & Systems; 20th Int Conf on Smart City; 8th Int Conf on Dependability in Sensor, 2022

2021

PEAN: A Packet-level End-to-end Attentive Network for Encrypted Traffic Identification.

[DOI]

,

,

,

,

Proceedings of the 2021 IEEE 23rd Int Conf on High Performance Computing & Communications; 7th Int Conf on Data Science & Systems; 19th Int Conf on Smart City; 7th Int Conf on Dependability in Sensor, 2021

BBServerless: A Bursty Traffic Benchmark for Serverless.

[DOI]

,

,

,

,

,

Proceedings of the Cloud Computing - CLOUD 2021, 2021

2020

LBNN: Perceiving the State Changes of a Core Telecommunications Network via Linear Bayesian Neural Network.

[DOI]

,

,

,

,

,

Proceedings of the 26th IEEE International Conference on Parallel and Distributed Systems, 2020

Loading...