Chaoyi Ruan

According to our database1, Chaoyi Ruan authored at least 23 papers between 2018 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Accelerating Long-Tail Generation in Synchronous RLHF Training via Adaptive Tensor Parallelism.
CoRR, May, 2026

GLPilot: Efficient Distributed GNN Training With Learnable Embeddings.
IEEE Trans. Parallel Distributed Syst., February, 2026

Lagom: Unleashing the Power of Communication and Computation Overlapping for Distributed LLM Training.
CoRR, February, 2026

RL over Commodity Networks: Overcoming the Bandwidth Barrier with Lossless Sparse Deltas.
CoRR, February, 2026

Revisiting Parameter Server in LLM Post-Training.
CoRR, January, 2026

Libra: Flexible Request Partitioning and Scheduling for Serving Unbalanced and Dynamic LLM Workloads.
Proceedings of the 23rd USENIX Symposium on Networked Systems Design and Implementation, 2026

Cortex: Achieving Low-Latency, Cost-Efficient Remote Data Access For LLM via Semantic-Aware Knowledge Caching.
Proceedings of the 23rd USENIX Symposium on Networked Systems Design and Implementation, 2026

2025
Reaching Agreement Among Reasoning LLM Agents.
CoRR, December, 2025

Asteria: Semantic-Aware Cross-Region Caching for Agentic LLM Tool Access.
CoRR, September, 2025

PMPO: Probabilistic Metric Prompt Optimization for Small and Large Language Models.
CoRR, May, 2025

HCT-QA: A Benchmark for Question Answering on Human-Centric Tables.
CoRR, April, 2025

DynaServe: Unified and Elastic Tandem-Style Execution for Dynamic Disaggregated LLM Serving.
CoRR, April, 2025

Fanar: An Arabic-Centric Multimodal Generative AI Platform.
CoRR, January, 2025

DHeLlam: General-Purpose, Automatic Micro-Batch Co-Execution for Distributed LLM Training.
Proceedings of the 43rd IEEE International Conference on Computer Design, 2025

PMPO: Probabilistic Metric Prompt Optimization for Small and Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

2024
PolyBase: Adapting to Data Affinity Changes in Geo-Replicated Database via Row-Level Paxos-Group Affiliation Re-Assignment.
Proc. VLDB Endow., November, 2024

Hiding Communication Cost in Distributed LLM Training via Micro-batch Co-execution.
CoRR, 2024

2023
Persistent Memory Disaggregation for Cloud-Native Relational Databases.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

2021
Leveraging NVMe SSDs for Building a Fast, Cost-effective, LSM-tree-based KV Store.
ACM Trans. Storage, 2021

Towards Cost-Effective and Elastic Cloud Database Deployment via Memory Disaggregation.
Proc. VLDB Endow., 2021

SpanDB: A Fast, Cost-Effective LSM-tree Based KV Store on Hybrid Storage.
Proceedings of the 19th USENIX Conference on File and Storage Technologies, 2021

Achieving low tail-latency and high scalability for serializable transactions in edge computing.
Proceedings of the EuroSys '21: Sixteenth European Conference on Computer Systems, 2021

2018
AppDNA: App Behavior Profiling via Graph-based Deep Learning.
Proceedings of the 2018 IEEE Conference on Computer Communications, 2018


  Loading...