Minchen Yu
Orcid: 0000-0002-6797-9028
  According to our database1,
  Minchen Yu
  authored at least 18 papers
  between 2018 and 2025.
  
  
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
  2025
MemShare: Memory Efficient Inference for Large Reasoning Models through KV Cache Reuse.
    
  
    CoRR, July, 2025
    
  
    CoRR, July, 2025
    
  
SpecServe: Efficient and SLO-Aware Large Language Model Serving with Adaptive Speculative Decoding.
    
  
    CoRR, March, 2025
    
  
    CoRR, February, 2025
    
  
Pheromone: Restructuring Serverless Computing With Data-Centric Function Orchestration.
    
  
    IEEE Trans. Netw., 2025
    
  
Torpor: GPU-Enabled Serverless Computing for Low-Latency, Resource-Efficient Inference.
    
  
    Proceedings of the 2025 USENIX Annual Technical Conference, 2025
    
  
    Proceedings of the 2025 USENIX Annual Technical Conference, 2025
    
  
  2024
    CoRR, 2024
    
  
  2023
    CoRR, 2023
    
  
Following the Data, Not the Function: Rethinking Function Orchestration in Serverless Computing.
    
  
    Proceedings of the 20th USENIX Symposium on Networked Systems Design and Implementation, 2023
    
  
  2022
Enabling Cost-Effective, SLO-Aware Machine Learning Inference Serving on Public Cloud.
    
  
    IEEE Trans. Cloud Comput., 2022
    
  
  2021
    CoRR, 2021
    
  
CrystalPerf: Learning to Characterize the Performance of Dataflow Computation through Code Analysis.
    
  
    Proceedings of the 2021 USENIX Annual Technical Conference, 2021
    
  
Gillis: Serving Large Neural Networks in Serverless Functions with Automatic Model Partitioning.
    
  
    Proceedings of the 41st IEEE International Conference on Distributed Computing Systems, 2021
    
  
  2020
    Proceedings of the 39th IEEE Conference on Computer Communications, 2020
    
  
  2019
MArk: Exploiting Cloud Services for Cost-Effective, SLO-Aware Machine Learning Inference Serving.
    
  
    Proceedings of the 2019 USENIX Annual Technical Conference, 2019
    
  
  2018
    Proceedings of the ACM Symposium on Cloud Computing, 2018