Xuanhe Zhou

Orcid: 0000-0002-2285-7836

According to our database1, Xuanhe Zhou authored at least 79 papers between 2019 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
SetupX: Can LLM Agents Learn from Past Failures in Functionality-Correct Code Repository Setup?
CoRR, May, 2026

MinerU-Popo: Universal Post-Processing Model for Structured Document Parsing.
CoRR, May, 2026

MemReranker: Reasoning-Aware Reranking for Agent Memory Retrieval.
CoRR, May, 2026

Workspace-Bench 1.0: Benchmarking AI Agents on Workspace Tasks with Large-Scale File Dependencies.
CoRR, May, 2026

Intern-Atlas: A Methodological Evolution Graph as Research Infrastructure for AI Scientists.
CoRR, April, 2026

Automating Database-Native Function Code Synthesis with LLMs.
CoRR, April, 2026

MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale.
CoRR, April, 2026

AQD: Online Adaptive Query Dispatcher for HTAP Databases.
Proc. VLDB Endow., March, 2026

GRADE: Benchmarking Discipline-Informed Reasoning in Image Editing.
CoRR, March, 2026

Dial: A Knowledge-Grounded Dialect-Specific NL2SQL System.
CoRR, March, 2026

FireRed-OCR Technical Report.
CoRR, March, 2026

Disk-Resident Graph ANN Search: An Experimental Evaluation.
CoRR, March, 2026

DBAIOps: A Reasoning LLM-Enhanced Database Operation and Maintenance System using Knowledge Graphs.
Proc. VLDB Endow., February, 2026

The Trinity of Consistency as a Defining Principle for General World Models.
CoRR, February, 2026

MoDora: Tree-Based Semi-Structured Document Analysis System.
CoRR, February, 2026

Qute: Towards Quantum-Native Database.
CoRR, February, 2026

ByteHouse: A Cloud-Native OLAP Engine with Incremental Computation and Multi-Modal Retrieval.
CoRR, February, 2026

ST-Raptor: An Agentic System for Semi-Structured Table QA.
CoRR, February, 2026

Can LLMs Clean Up Your Mess? A Survey of Application-Ready Data Preparation with LLMs.
CoRR, January, 2026

CrackSQL: A Hybrid Dialect Translation System Powered by LLM.
Proceedings of the Companion of the International Conference on Management of Data, 2026

FeatInsight: An Online ML Feature Management System on 4Paradigm Sage-Studio Platform.
Proceedings of the Companion of the International Conference on Management of Data, 2026

Vector Search for the Future: From Memory-Resident, Static Heterogeneous Storage, to Cloud-Native Architectures.
Proceedings of the Companion of the International Conference on Management of Data, 2026

ByteHouse: ByteDance's Cloud-Native Data Warehouse for Real-Time Multimodal Data Analytics.
Proceedings of the Companion of the International Conference on Management of Data, 2026

2025
Spatial Retrieval Augmented Autonomous Driving.
CoRR, December, 2025

OSM+: Billion-Level Open Street Map Data Processing System for City-wide Experiments.
CoRR, December, 2025

A Survey of Data Agents: Emerging Paradigm or Overstated Hype?
CoRR, October, 2025

Lost in Tokenization: Context as the Key to Unlocking Biomolecular Understanding in Scientific LLMs.
CoRR, October, 2025

SurveyBench: Can LLM(-Agents) Write Academic Surveys that Align with Reader Needs?
CoRR, October, 2025

LLM/Agent-as-Data-Analyst: A Survey.
CoRR, September, 2025

PARROT: A Benchmark for Evaluating LLMs in Cross-System SQL Translation.
CoRR, September, 2025

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing.
CoRR, September, 2025

R-Bot: An LLM-based Query Rewrite System.
Proc. VLDB Endow., August, 2025

Cracking SQL Barriers: An LLM-based Dialect Translation System.
Proc. ACM Manag. Data, June, 2025

GaussMaster: An LLM-based Database Copilot System.
CoRR, June, 2025

A Survey of LLM ⨉ DATA.
CoRR, May, 2025

Tool Learning with Foundation Models.
ACM Comput. Surv., April, 2025

CrackSQL: A Hybrid SQL Dialect Translation System Powered by Large Language Models.
CoRR, April, 2025

FeatInsight: An Online ML Feature Management System on 4Paradigm Sage-Studio Platform.
CoRR, April, 2025

ST-Raptor: LLM-Powered Semi-Structured Table Question Answering.
Proc. ACM Manag. Data, 2025

OpenMLDB: A Real-Time Relational Data Feature Computation System for Online ML.
Proceedings of the Companion of the 2025 International Conference on Management of Data, 2025

D-Bot: An LLM-Powered DBA Copilot.
Proceedings of the Companion of the 2025 International Conference on Management of Data, 2025

Revolutionizing Database Q&A with Large Language Models: Comprehensive Benchmark and Evaluation.
Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.2, 2025

MemQ: A Graph-Based Query Memory Prediction Framework for Effective Workload Scheduling.
Proceedings of the 41st IEEE International Conference on Data Engineering, 2025

2024
Automatic Index Tuning: A Survey.
IEEE Trans. Knowl. Data Eng., December, 2024

Chat2Data: An Interactive Data Analysis System with RAG, Vector Databases and LLMs.
Proc. VLDB Endow., August, 2024

LLM for Data Management.
Proc. VLDB Endow., August, 2024

Breaking It Down: An In-depth Study of Index Advisors.
Proc. VLDB Endow., June, 2024

D-Bot: Database Diagnosis System using Large Language Models.
Proc. VLDB Endow., June, 2024

DB-GPT: Large Language Model Meets Database.
Data Sci. Eng., March, 2024

Robustness of Updatable Learning-based Index Advisors against Poisoning Attack.
Proc. ACM Manag. Data, February, 2024

R-Bot: An LLM-based Query Rewrite System.
CoRR, 2024

Revolutionizing Database Q&A with Large Language Models: Comprehensive Benchmark and Evaluation.
CoRR, 2024

LLM-Enhanced Data Management.
CoRR, 2024

The Efficacy of the Connect America Fund in Addressing US Internet Access Inequities.
Proceedings of the ACM SIGCOMM 2024 Conference, 2024

TRAP: Tailored Robustness Assessment for Index Advisors via Adversarial Perturbation.
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024

2023
Automatic Database Knob Tuning: A Survey.
IEEE Trans. Knowl. Data Eng., December, 2023

A Learned Query Rewrite System.
Proc. VLDB Endow., 2023

FEBench: A Benchmark for Real-Time Relational Data Feature Extraction.
Proc. VLDB Endow., 2023

Learned Index: A Comprehensive Experimental Evaluation.
Proc. VLDB Endow., 2023

Grep: A Graph Learning Based Database Partitioning System.
Proc. ACM Manag. Data, 2023

LLM As DBA.
CoRR, 2023

Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLs.
CoRR, 2023

Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLs.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Database Meets Artificial Intelligence: A Survey (Extended Abstract).
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

DBAugur: An Adversarial-based Trend Forecasting System for Diversified Workloads.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

Demonstration of ViTA: Visualizing, Testing and Analyzing Index Advisors.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

2022
Database Meets Artificial Intelligence: A Survey.
IEEE Trans. Knowl. Data Eng., 2022

LearnedSQLGen: Constraint-aware SQL Generation using Reinforcement Learning.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

AutoIndex: An Incremental Index Management System for Dynamic Workloads.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

Adaptive Code Learning for Spark Configuration Tuning.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

Machine Learning for Data Management: A System View.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

2021
A Learned Query Rewrite System using Monte Carlo Tree Search.
Proc. VLDB Endow., 2021

DBMind: A Self-Driving Platform in openGauss.
Proc. VLDB Endow., 2021

openGauss: An Autonomous Database System.
Proc. VLDB Endow., 2021

AI Meets Database: AI4DB and DB4AI.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

Machine Learning for Databases.
Proceedings of the AIMLSystems 2021: The First International Conference on AI-ML-Systems, Bangalore India, October 21, 2021

2020
Query Performance Prediction for Concurrent Queries using Graph Embedding.
Proc. VLDB Endow., 2020

2019
QTune: A Query-Aware Database Tuning System with Deep Reinforcement Learning.
Proc. VLDB Endow., 2019

XuanYuan: An AI-Native Database.
IEEE Data Eng. Bull., 2019


  Loading...