Baotong Lu

Orcid: 0000-0002-0230-1048

According to our database¹, Baotong Lu authored at least 22 papers between 2020 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Unifying Sparse Attention with Hierarchical Memory for Scalable Long-Context LLM Serving.

[BibT_eX]

[DOI]

CoRR, April, 2026

DualSpec: Accelerating Deep Research Agents via Dual-Process Action Speculation.

[BibT_eX]

[DOI]

CoRR, March, 2026

Corrigendum: Attribute Filtering in Approximate Nearest Neighbor Search: An In-depth Experimental Study: [Experiments & Analysis].

[BibT_eX]

[DOI]

Proc. ACM Manag. Data, February, 2026

KEEP: A KV-Cache-Centric Memory Management System for Efficient Embodied Planning.

[BibT_eX]

[DOI]

CoRR, February, 2026

RetroInfer: A Vector Storage Engine for Scalable Long-Context LLM Inference.

[BibT_eX]

[DOI]

Proc. VLDB Endow., January, 2026

2025

Shard: A Scalable and Resize-optimized Hash Index on Disaggregated Memory.

[BibT_eX]

[DOI]

Proc. VLDB Endow., December, 2025

Scalable Distributed Vector Search via Accuracy Preserving Index Construction.

[BibT_eX]

[DOI]

CoRR, December, 2025

Towards Efficient and Scalable Distributed Vector Search with RDMA.

[BibT_eX]

[DOI]

CoRR, July, 2025

Towards Robustness: A Critique of Current Vector Database Assessments.

[BibT_eX]

[DOI]

CoRR, July, 2025

AnTKV: Anchor Token-Aware Sub-Bit Vector Quantization for KV Cache in Large Language Models.

[BibT_eX]

[DOI]

CoRR, June, 2025

RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference.

[BibT_eX]

[DOI]

CoRR, May, 2025

Attribute Filtering in Approximate Nearest Neighbor Search: An In-depth Experimental Study.

[BibT_eX]

[DOI]

Proc. ACM Manag. Data, 2025

SGDRC: Software-Defined Dynamic Resource Control for Concurrent DNN Inference on NVIDIA GPUs.

[BibT_eX]

[DOI]

Proceedings of the 30th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2025

2024

DEX: Scalable Range Indexing on Disaggregated Memory.

[BibT_eX]

[DOI]

Proc. VLDB Endow., June, 2024

RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval.

[BibT_eX]

[DOI]

CoRR, 2024

Missile: Fine-Grained, Hardware-Level GPU Resource Isolation for Multi-Tenant DNN Inference.

[BibT_eX]

[DOI]

CoRR, 2024

DEX: Scalable Range Indexing on Disaggregated Memory [Extended Version].

[BibT_eX]

[DOI]

CoRR, 2024

2022

Are Updatable Learned Indexes Ready?

[BibT_eX]

[DOI]

Proc. VLDB Endow., 2022

2021

Scaling Dynamic Hash Tables on Real Persistent Memory.

[BibT_eX]

[DOI]

SIGMOD Rec., 2021

APEX: A High-Performance Learned Index on Persistent Memory.

[BibT_eX]

[DOI]

Proc. VLDB Endow., 2021

2020

Dash: Scalable Hashing on Persistent Memory.

[BibT_eX]

[DOI]

Proc. VLDB Endow., 2020

High Performance Depthwise and Pointwise Convolutions on Mobile Devices.

[BibT_eX]

[DOI]

Pengfei Zhang

Eric Lo

Baotong Lu

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Baotong Lu

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...