Feifei Li

Orcid: 0009-0003-0770-5775

Affiliations:
  • Alibaba Group, Sunnyvale, CA, USA
  • University of Utah, School of Computing, Salt Lake City, UT, USA
  • Boston University, Computer Science Department, MA, USA (PhD 2007)


According to our database1, Feifei Li authored at least 185 papers between 2001 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Sub-trajectory clustering with deep reinforcement learning.
VLDB J., May, 2024

Towards Practical Oblivious Join Processing.
IEEE Trans. Knowl. Data Eng., April, 2024

SeRF: Segment Graph for Range-Filtering Approximate Nearest Neighbor Search.
Proc. ACM Manag. Data, February, 2024

2023
Rethinking Learned Cost Models: Why Start from Scratch?
Proc. ACM Manag. Data, December, 2023

Rethink Query Optimization in HTAP Databases.
Proc. ACM Manag. Data, December, 2023

TEE-based General-purpose Computational Backend for Secure Delegated Data Processing.
Proc. ACM Manag. Data, December, 2023

An Efficient Transfer Learning Based Configuration Adviser for Database Tuning.
Proc. VLDB Endow., November, 2023

SmartLite: A DBMS-based Serving System for DNN Inference in Resource-constrained Environments.
Proc. VLDB Endow., November, 2023

Secure Sampling for Approximate Multi-party Query Processing.
Proc. ACM Manag. Data, September, 2023

Learning-based query optimization for multi-probe approximate nearest neighbor search.
VLDB J., May, 2023

ROVEC: Runtime Optimization of Vectorized Expression Evaluation for Column Store.
IEEE Trans. Knowl. Data Eng., March, 2023

SimpleTS: An Efficient and Universal Model Selection Framework for Time Series Forecasting.
Proc. VLDB Endow., 2023

PolarDB-SCC: A Cloud-Native Database Ensuring Low Latency for Strongly Consistent Reads.
Proc. VLDB Endow., 2023

Ganos Aero: A Cloud-Native System for Big Raster Data Management and Processing.
Proc. VLDB Endow., 2023

Real-time Workload Pattern Analysis for Large-scale Cloud Databases.
Proc. VLDB Endow., 2023

Lindorm TSDB: A Cloud-native Time-series Database for Large-scale Monitoring Systems.
Proc. VLDB Endow., 2023

Anser: Adaptive Information Sharing Framework of AnalyticDB.
Proc. VLDB Endow., 2023

Eigen: End-to-end Resource Optimization for Large-Scale Databases on the Cloud.
Proc. VLDB Endow., 2023

OneShotSTL: One-Shot Seasonal-Trend Decomposition For Online Time Series Anomaly Detection And Forecasting.
Proc. VLDB Endow., 2023

CatSQL: Towards Real World Natural Language to SQL Applications.
Proc. VLDB Endow., 2023

Modernization of Databases in the Cloud Era: Building Databases that Run Like Legos.
Proc. VLDB Endow., 2023

A Unified and Efficient Coordinating Framework for Autonomous DBMS Tuning.
Proc. ACM Manag. Data, 2023

PolarDB-IMCI: A Cloud-Native HTAP Database System at Alibaba.
Proc. ACM Manag. Data, 2023

Detecting Logic Bugs of Join Optimizations in DBMS.
Proc. ACM Manag. Data, 2023

Knock Out 2PC with Practicality Intact: a High-performance and General Distributed Transaction Protocol (Technical Report).
CoRR, 2023

A Bayesian approach for bandit online optimization with switching cost.
Proceedings of the Uncertainty in Artificial Intelligence, 2023

Encrypted Databases Made Secure Yet Maintainable.
Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation, 2023

EulerFD: An Efficient Double-Cycle Approximation of Functional Dependencies.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

Knock Out 2PC with Practicality Intact: a High-performance and General Distributed Transaction Protocol.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

Active Sampling for Sparse Table by Bayesian Optimization with Adaptive Resolution.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

Persistent Memory Disaggregation for Cloud-Native Relational Databases.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

ShapleyIQ: Influence Quantification by Shapley Values for Performance Debugging of Microservices.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

2022
Efficient Oblivious Query Processing for Range and kNN Queries.
IEEE Trans. Knowl. Data Eng., 2022

AB-tree: Index for Concurrent Random Sampling and Updates.
Proc. VLDB Endow., 2022

SA-LSM : Optimize Data Layout for LSM-tree Based Storage using Survival Analysis.
Proc. VLDB Endow., 2022

Facilitating Database Tuning with Hyper-Parameter Optimization: A Comprehensive Experimental Evaluation.
Proc. VLDB Endow., 2022

Ganos: A Multidimensional, Dynamic, and Scene-Oriented Cloud-Native Spatial Database Engine.
Proc. VLDB Endow., 2022

Operon: An Encrypted Database for Ownership-Preserving Data Management.
Proc. VLDB Endow., 2022

HEDA: Multi-Attribute Unbounded Aggregation over Homomorphically Encrypted Database.
Proc. VLDB Endow., 2022

VRE: A Versatile, Robust, and Economical Trajectory Data System.
Proc. VLDB Endow., 2022

Tair-PMem: a Fully Durable Non-Volatile Memory Database.
Proc. VLDB Endow., 2022

CloudJump: Optimizing Cloud Databases for Cloud Storages.
Proc. VLDB Endow., 2022

LPC-AD: Fast and Accurate Multivariate Time Series Anomaly Detection via Latent Predictive Coding.
CoRR, 2022

A Sampling-based Learning Framework for Big Databases.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Towards Dynamic and Safe Configuration Tuning for Cloud Databases.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

ESDB: Processing Extremely Skewed Workloads in Real-time.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

PreQR: Pre-training Representation for SQL Understanding.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

Remus: Efficient Live Migration for Distributed Databases with Snapshot Isolation.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

Towards Practical Oblivious Join.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

Releasing Private Data for Numerical Queries.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Ubiquitous Verification in Centralized Ledger Database.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

PinSQL: Pinpoint Root Cause SQLs to Resolve Performance Issues in Cloud Databases.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

A Comparative Study of in-Database Inference Approaches.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

Efficient and Oblivious Query Processing for Range and kNN Queries (Extended Abstract).
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

PolarDB-X: An Elastic Distributed Relational Database for Cloud-Native Applications.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

2021
Semantic embedding for regions of interest.
VLDB J., 2021

Towards Cost-Effective and Elastic Cloud Database Deployment via Memory Disaggregation.
Proc. VLDB Endow., 2021

Revisiting the Design of LSM-tree Based OLTP Storage Engine with Persistent Memory.
Proc. VLDB Endow., 2021

Cquirrel: Continuous Query Processing over Acyclic Relational Schemas.
Proc. VLDB Endow., 2021

Building Enclave-Native Storage Engines for Practical Encrypted Databases.
Proc. VLDB Endow., 2021

Database Workload Characterization with Query Plan Encoders.
Proc. VLDB Endow., 2021

VeriDB: An SGX-based Verifiable Database.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

ResTune: Resource Oriented Tuning Boosted by Meta-Learning for Cloud Databases.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

At-the-time and Back-in-time Persistent Sketches.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

Weighted Distinct Sampling: Cardinality Estimation for SPJ Queries.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

PolarDB Serverless: A Cloud Native Database for Disaggregated Data Centers.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

LogStore: A Cloud-Native and Multi-Tenant Log Database.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

Spatial Independent Range Sampling.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

Constrained Non-Affine Alignment of Embeddings.
Proceedings of the IEEE International Conference on Data Mining, 2021

SLIMSTORE: A Cloud-based Deduplication System for Multi-version Backups.
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021

2020
LedgerDB: A Centralized Ledger Database for Universal Audit and Verification.
Proc. VLDB Endow., 2020

Leaper: A Learned Prefetcher for Cache Invalidation in LSM-tree based Storage Engines.
Proc. VLDB Endow., 2020

AnalyticDB-V: A Hybrid Analytical Engine Towards Query Fusion for Structured and Unstructured Data.
Proc. VLDB Endow., 2020

Diagnosing Root Causes of Intermittent Slow Queries in Large-Scale Cloud Databases.
Proc. VLDB Endow., 2020

Efficient Join Synopsis Maintenance for Data Warehouse.
Proceedings of the 2020 International Conference on Management of Data, 2020

FalconDB: Blockchain-based Collaborative Database.
Proceedings of the 2020 International Conference on Management of Data, 2020

Timon: A Timestamped Event Database for Efficient Telemetry Data Processing and Analytics.
Proceedings of the 2020 International Conference on Management of Data, 2020

Two-Level Data Compression using Machine Learning in Time Series Database.
Proceedings of the 36th IEEE International Conference on Data Engineering, 2020

HybrIDX: New Hybrid Index for Volume-hiding Range Queries in Data Outsourcing Services.
Proceedings of the 40th IEEE International Conference on Distributed Computing Systems, 2020

FPGA-Accelerated Compactions for LSM-based Key-Value Store.
Proceedings of the 18th USENIX Conference on File and Storage Technologies, 2020

HotRing: A Hotspot-Aware In-Memory Key-Value Store.
Proceedings of the 18th USENIX Conference on File and Storage Technologies, 2020

2019
SolarDB: Toward a Shared-Everything Database on Distributed Log-Structured Storage.
ACM Trans. Storage, 2019

Spell: Online Streaming Parsing of Large Unstructured System Logs.
IEEE Trans. Knowl. Data Eng., 2019

AnalyticDB: Real-time OLAP Database System at Alibaba Cloud.
Proc. VLDB Endow., 2019

iBTune: Individualized Buffer Tuning for Large-scale Cloud Databases.
Proc. VLDB Endow., 2019

Cloud native database systems at Alibaba: Opportunities and Challenges.
Proc. VLDB Endow., 2019

CATIRI: An Efficient Method for Content-and-Text Based Image Retrieval.
J. Comput. Sci. Technol., 2019

Privacy-Preserving Outsourced Speech Recognition for Smart IoT Devices.
IEEE Internet Things J., 2019

Feature Detection and Attenuation in Embeddings.
CoRR, 2019

Pcard: Personalized Restaurants Recommendation from Card Payment Transaction Records.
Proceedings of the World Wide Web Conference, 2019

X-Engine: An Optimized Storage Engine for Large-scale E-commerce Transaction Processing.
Proceedings of the 2019 International Conference on Management of Data, 2019

Bursty Event Detection Throughout Histories.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

AI Pro: Data Processing Framework for AI Models.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

Narrowing the Gap Between Serverless and its State with Storage Functions.
Proceedings of the ACM Symposium on Cloud Computing, SoCC 2019, 2019

2018
Preface to the special issue on advances in Spatio-temporal data analysis and management.
GeoInformatica, 2018

Solar: Towards a Shared-Everything Database on Distributed Log-Structured Storage.
Proceedings of the 2018 USENIX Annual Technical Conference, 2018

Random Sampling over Joins Revisited.
Proceedings of the 2018 International Conference on Management of Data, 2018

Persistent Bloom Filter: Membership Testing for the Entire History.
Proceedings of the 2018 International Conference on Management of Data, 2018

OpenTag: Open Attribute Value Extraction from Product Profiles.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

Secure DIMM: Moving ORAM Primitives Closer to Memory.
Proceedings of the IEEE International Symposium on High Performance Computer Architecture, 2018

2017
Trip Planning Queries in Road Network Databases.
Proceedings of the Encyclopedia of GIS., 2017

Preface to the special issue on big data search and mining.
World Wide Web, 2017

ATOM: Efficient Tracking, Monitoring, and Orchestration of Cloud Resources.
IEEE Trans. Parallel Distributed Syst., 2017

Wander Join and XDB: Online Aggregation via Random Walks.
SIGMOD Rec., 2017

Distributed Trajectory Similarity Search.
Proc. VLDB Endow., 2017

Compass: Spatio Temporal Sentiment Analysis of US Election What Twitter Says!
Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, NS, Canada, August 13, 2017

DeepLog: Anomaly Detection and Diagnosis from System Logs through Deep Learning.
Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security, 2017

2016
Exact and approximate flexible aggregate similarity search.
VLDB J., 2016

Oblivious RAM: A Dissection and Experimental Evaluation.
Proc. VLDB Endow., 2016

Simba: Efficient In-Memory Spatial Analytics.
Proceedings of the 2016 International Conference on Management of Data, 2016

Matrix Sketching Over Sliding Windows.
Proceedings of the 2016 International Conference on Management of Data, 2016

Graph Analytics Through Fine-Grained Parallelism.
Proceedings of the 2016 International Conference on Management of Data, 2016

Wander Join: Online Aggregation for Joins.
Proceedings of the 2016 International Conference on Management of Data, 2016

Privacy Preserving Subgraph Matching on Large Graphs in Cloud.
Proceedings of the 2016 International Conference on Management of Data, 2016

Wander Join: Online Aggregation via Random Walks.
Proceedings of the 2016 International Conference on Management of Data, 2016

Fast and Concurrent RDF Queries with RDMA-Based Distributed Graph Exploration.
Proceedings of the 12th USENIX Symposium on Operating Systems Design and Implementation, 2016

Spell: Streaming Parsing of System Event Logs.
Proceedings of the IEEE 16th International Conference on Data Mining, 2016

Simba: spatial in-memory big data analysis.
Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, GIS 2016, Burlingame, California, USA, October 31, 2016

2015
Spatial Online Sampling and Aggregation.
Proc. VLDB Endow., 2015

Distributed Online Tracking.
Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31, 2015

STORM: Spatio-Temporal Online Reasoning and Management of Large Spatio-Temporal Data.
Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31, 2015

Fixed-function hardware sorting accelerators for near data MapReduce execution.
Proceedings of the 33rd IEEE International Conference on Computer Design, 2015

ATOM: Automated tracking, orchestration and monitoring of resource usage in infrastructure as a service systems.
Proceedings of the 2015 IEEE International Conference on Big Data (IEEE BigData 2015), Santa Clara, CA, USA, October 29, 2015

2014
Dynamic monitoring of optimal locations in road network databases.
VLDB J., 2014

Scalable Keyword Search on Large RDF Data.
IEEE Trans. Knowl. Data Eng., 2014

Continuous Matrix Approximation on Distributed Data.
Proc. VLDB Endow., 2014

Comparing Implementations of Near-Data Computing with In-Memory MapReduce Workloads.
IEEE Micro, 2014

Scalable data summarization on big data.
Distributed Parallel Databases, 2014

Scalable histograms on large probabilistic data.
Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2014

NDC: Analyzing the impact of 3D-stacked memory+logic devices on MapReduce workloads.
Proceedings of the 2014 IEEE International Symposium on Performance Analysis of Systems and Software, 2014

2013
Spatial Approximate String Search.
IEEE Trans. Knowl. Data Eng., 2013

Rethinking Abstractions for Big Data: Why, Where, How, and What.
CoRR, 2013

Quality and efficiency for kernel density estimates in large data.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2013

Optimal splitters for temporal and multi-version databases.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2013

Adaptive log compression for massive log data.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2013

Secure nearest neighbor revisited.
Proceedings of the 29th IEEE International Conference on Data Engineering, 2013

CloudDB 2013: fifth international workshop on cloud data management.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

LogKV: Exploiting Key-Value Stores for Log Processing.
Proceedings of the Sixth Biennial Conference on Innovative Data Systems Research, 2013

2012
Query Access Assurance in Outsourced Databases.
IEEE Trans. Serv. Comput., 2012

Ranking Large Temporal Data.
Proc. VLDB Endow., 2012

ColumbuScout: towards building local search engines over large databases.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2012

Efficient Threshold Monitoring for Distributed Probabilistic Data.
Proceedings of the IEEE 28th International Conference on Data Engineering (ICDE 2012), 2012

Scalable Multi-query Optimization for SPARQL.
Proceedings of the IEEE 28th International Conference on Data Engineering (ICDE 2012), 2012

Towards Fair Sharing of Block Storage in a Multi-tenant Cloud.
Proceedings of the 4th USENIX Workshop on Hot Topics in Cloud Computing, 2012

Efficient parallel kNN joins for large data in MapReduce.
Proceedings of the 15th International Conference on Extending Database Technology, 2012

2011
The World in a Nutshell: Concise Range Queries.
IEEE Trans. Knowl. Data Eng., 2011

Group Enclosing Queries.
IEEE Trans. Knowl. Data Eng., 2011

Semantics of Ranking Queries for Probabilistic Data.
IEEE Trans. Knowl. Data Eng., 2011

Building Wavelet Histograms on Large Data in MapReduce.
Proc. VLDB Endow., 2011

Rewriting queries on SPARQL views.
Proceedings of the 20th International Conference on World Wide Web, 2011

Flexible aggregate similarity search.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2011

Optimal location queries in road network databases.
Proceedings of the 27th International Conference on Data Engineering, 2011

Multi-approximate-keyword routing in GIS data.
Proceedings of the 19th ACM SIGSPATIAL International Symposium on Advances in Geographic Information Systems, 2011

2010
Top-<i>k</i> queries on temporal data.
VLDB J., 2010

Authenticated Index Structures for Aggregation Queries.
ACM Trans. Inf. Syst. Secur., 2010

Logging every footstep: quantile summaries for the entire history.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2010

Probabilistic string similarity joins.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2010

K nearest neighbor queries and kNN-Joins in large relational databases (almost) for free.
Proceedings of the 26th International Conference on Data Engineering, 2010

Approximate string search in spatial databases.
Proceedings of the 26th International Conference on Data Engineering, 2010

2009
Small synopses for group-by query verification on outsourced data streams.
ACM Trans. Database Syst., 2009

Robust approximate aggregation in sensor data management systems.
ACM Trans. Database Syst., 2009

Ranking distributed probabilistic data.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2009

A Concise Representation of Range Queries.
Proceedings of the 25th International Conference on Data Engineering, 2009

Reverse Furthest Neighbors in Spatial Databases.
Proceedings of the 25th International Conference on Data Engineering, 2009

Improving Transaction-Time DBMS Performance and Functionality.
Proceedings of the 25th International Conference on Data Engineering, 2009

Semantics of Ranking Queries for Probabilistic Data and Expected Ranks.
Proceedings of the 25th International Conference on Data Engineering, 2009

2008
Trip Planning Queries in Road Network Databases.
Proceedings of the Encyclopedia of GIS., 2008

Authenticated Index Structures for Outsourced Databases.
Proceedings of the Handbook of Database Security - Applications and Trends, 2008

Efficient Processing of Top-k Queries in Uncertain Databases with x-Relations.
IEEE Trans. Knowl. Data Eng., 2008

Finding frequent items in probabilistic data.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2008

Efficient Processing of Top-k Queries in Uncertain Databases.
Proceedings of the 24th International Conference on Data Engineering, 2008

Randomized Synopses for Query Assurance on Data Streams.
Proceedings of the 24th International Conference on Data Engineering, 2008

2007
Time Series Compressibility and Privacy.
Proceedings of the 33rd International Conference on Very Large Data Bases, 2007

Proof-Infused Streams: Enabling Authentication of Sliding Window Queries On Streams.
Proceedings of the 33rd International Conference on Very Large Data Bases, 2007

Hiding in the Crowd: Privacy Preservation on Evolving Streams through Correlation Tracking.
Proceedings of the 23rd International Conference on Data Engineering, 2007

2006
Dynamic authenticated index structures for outsourced databases.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2006

Characterizing and Exploiting Reference Locality in Data Stream Applications.
Proceedings of the 22nd International Conference on Data Engineering, 2006

2005
Robust Aggregation in Sensor Networks.
IEEE Data Eng. Bull., 2005

On Trip Planning Queries in Spatial Databases.
Proceedings of the Advances in Spatial and Temporal Databases, 9th International Symposium, 2005

2004
Towards building logical views of websites.
Data Knowl. Eng., 2004

Spatio-Temporal Aggregation Using Sketches.
Proceedings of the 20th International Conference on Data Engineering, 2004

Approximate Aggregation Techniques for Sensor Databases.
Proceedings of the 20th International Conference on Data Engineering, 2004

2002
A visual tool for building logical data models of websites.
Proceedings of the Fourth ACM CIKM International Workshop on Web Information and Data Management (WIDM 2002), 2002

Wiccap Data Model: Mapping Physical Websites to Logical Views.
Proceedings of the Conceptual Modeling, 2002

2001
An Information Concierge for the Web.
Proceedings of the 12th International Workshop on Database and Expert Systems Applications (DEXA 2001), 2001


  Loading...