Tingfeng Lan

According to our database1, Tingfeng Lan authored at least 8 papers between 2023 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Efficient and Workload-Aware LLM Serving via Runtime Layer Swapping and KV Cache Resizing.
CoRR, June, 2025

SCORPIO: Serving the Right Requests at the Right Time for Heterogeneous SLOs in LLM Inference.
CoRR, May, 2025

ZenFlow: Enabling Stall-Free Offloading Training via Asynchronous Updates.
CoRR, May, 2025

Towards Efficient LLM Storage Reduction via Tensor Deduplication and Delta Compression.
CoRR, May, 2025

mLoRA: Fine-Tuning LoRA Adapters via Highly-Efficient Pipeline Parallelism in Multiple GPUs.
Proc. VLDB Endow., February, 2025

λScale: Enabling Fast Scaling for Serverless Large Language Model Inference.
CoRR, February, 2025

2024
DLRover-RM: Resource Optimization for Deep Recommendation Models Training in the cloud.
Proc. VLDB Endow., August, 2024

2023
ASPEN: High-Throughput LoRA Fine-Tuning of Large Language Models with a Single GPU.
CoRR, 2023


  Loading...