Baotong Lu

Orcid: 0000-0002-0230-1048

According to our database1, Baotong Lu authored at least 20 papers between 2020 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
DualSpec: Accelerating Deep Research Agents via Dual-Process Action Speculation.
CoRR, March, 2026

KEEP: A KV-Cache-Centric Memory Management System for Efficient Embodied Planning.
CoRR, February, 2026

RetroInfer: A Vector Storage Engine for Scalable Long-Context LLM Inference.
Proc. VLDB Endow., January, 2026

2025
Shard: A Scalable and Resize-optimized Hash Index on Disaggregated Memory.
Proc. VLDB Endow., December, 2025

Scalable Distributed Vector Search via Accuracy Preserving Index Construction.
CoRR, December, 2025

Towards Efficient and Scalable Distributed Vector Search with RDMA.
CoRR, July, 2025

Towards Robustness: A Critique of Current Vector Database Assessments.
CoRR, July, 2025

AnTKV: Anchor Token-Aware Sub-Bit Vector Quantization for KV Cache in Large Language Models.
CoRR, June, 2025

RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference.
CoRR, May, 2025

Attribute Filtering in Approximate Nearest Neighbor Search: An In-depth Experimental Study.
Proc. ACM Manag. Data, 2025

SGDRC: Software-Defined Dynamic Resource Control for Concurrent DNN Inference on NVIDIA GPUs.
Proceedings of the 30th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2025

2024
DEX: Scalable Range Indexing on Disaggregated Memory.
Proc. VLDB Endow., June, 2024

RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval.
CoRR, 2024

Missile: Fine-Grained, Hardware-Level GPU Resource Isolation for Multi-Tenant DNN Inference.
CoRR, 2024

DEX: Scalable Range Indexing on Disaggregated Memory [Extended Version].
CoRR, 2024

2022
Are Updatable Learned Indexes Ready?
Proc. VLDB Endow., 2022

2021
Scaling Dynamic Hash Tables on Real Persistent Memory.
SIGMOD Rec., 2021

APEX: A High-Performance Learned Index on Persistent Memory.
Proc. VLDB Endow., 2021

2020
Dash: Scalable Hashing on Persistent Memory.
Proc. VLDB Endow., 2020

High Performance Depthwise and Pointwise Convolutions on Mobile Devices.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020


  Loading...