Ruwen Fan

Orcid: 0009-0002-3590-7473

According to our database1, Ruwen Fan authored at least 6 papers between 2024 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Swarm: Co-Activation Aware KVCache Offloading Across Multiple SSDs.
CoRR, March, 2026

Discard-Based Garbage Collection for Distributed Log-Structured Storage Systems in ByteDance.
Proceedings of the 24th USENIX Conference on File and Storage Technologies, 2026

2025
GPREEMPT: GPU Preemptive Scheduling Made General and Efficient.
Proceedings of the 2025 USENIX Annual Technical Conference, 2025

Neuralink: Fast on-Device LLM Inference with Neuron Co-Activation Linking.
Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2025

2024
Ripple: Accelerating LLM Inference on Smartphones with Correlation-Aware Neuron Management.
CoRR, 2024

MaxEmbed: Maximizing SSD bandwidth utilization for huge embedding models serving.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024


  Loading...