Neoklis Polyzotis

Orcid: 0000-0002-2694-8591

Affiliations:
  • University of California, Santa Cruz, USA


According to our database1, Neoklis Polyzotis authored at least 107 papers between 2002 and 2021.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2021
Validating Data and Models in Continuous ML Pipelines.
IEEE Data Eng. Bull., 2021

What can Data-Centric AI Learn from Data and ML Engineering?
CoRR, 2021

Production Machine Learning Pipelines: Empirical Analysis and Optimization Opportunities.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

2020
Automated Data Slicing for Model Validation: A Big Data - AI Integration Approach.
IEEE Trans. Knowl. Data Eng., 2020

Towards ML Engineering: A Brief History Of TensorFlow Extended (TFX).
CoRR, 2020

From Data to Models and Back.
Proceedings of the Fourth Workshop on Data Management for End-To-End Machine Learning, 2020

TensorFlow Data Validation: Data Analysis and Validation in Continuous ML Pipelines.
Proceedings of the 2020 International Conference on Management of Data, 2020

2019
Opportunities for Data Management Research in the Era of Horizontal AI/ML.
Proc. VLDB Endow., 2019

Improving Differentially Private Models with Active Learning.
CoRR, 2019

DEEM 2019: Workshop on Data Management for End-to-End Machine Learning.
Proceedings of the 2019 International Conference on Management of Data, 2019

Continuous Training for Production ML in the TensorFlow Extended (TFX) Platform.
Proceedings of the 2019 USENIX Conference on Operational Machine Learning, 2019

Data Validation for Machine Learning.
Proceedings of Machine Learning and Systems 2019, 2019

Slice Finder: Automated Data Slicing for Model Validation.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

2018
XML Selectivity Estimation.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Data Lifecycle Challenges in Production Machine Learning: A Survey.
SIGMOD Rec., 2018

Slice Finder: Automated Data Slicing for Model Validation.
CoRR, 2018

The Case for Learned Index Structures.
Proceedings of the 2018 International Conference on Management of Data, 2018

2017
Data Management Challenges in Production Machine Learning.
Proceedings of the 2017 ACM International Conference on Management of Data, 2017

TFX: A TensorFlow-Based Production-Scale Machine Learning Platform.
Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, NS, Canada, August 13, 2017

2016
Managing Google's data lake: an overview of the Goods system.
IEEE Data Eng. Bull., 2016

Goods: Organizing Google's Datasets.
Proceedings of the 2016 International Conference on Management of Data, 2016

Efficient Techniques for Crowdsourced Top-k Lists.
Proceedings of the Fourth AAAI Conference on Human Computation and Crowdsourcing, 2016

2015
SEEDB: Efficient Data-Driven Visualization Recommendations to Support Visual Analytics.
Proc. VLDB Endow., 2015

RITA: an index-tuning advisor for replicated databases.
Proceedings of the 27th International Conference on Scientific and Statistical Database Management, 2015

Oracle Workload Intelligence.
Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31, 2015

2014
QueRIE: Collaborative Database Exploration.
IEEE Trans. Knowl. Data Eng., 2014

SEEDB: Automatically Generating Query Visualizations.
Proc. VLDB Endow., 2014

Optimal Crowd-Powered Rating and Filtering Algorithms.
Proc. VLDB Endow., 2014

MISO: souping up big data query processing with a multistore system.
Proceedings of the International Conference on Management of Data, 2014

Opportunistic physical design for big data analytics.
Proceedings of the International Conference on Management of Data, 2014

2013
Top-k queries over web applications.
VLDB J., 2013

Odyssey: A Multi-Store System for Evolutionary Analytics.
Proc. VLDB Endow., 2013

Exploiting Opportunistic Physical Design in Large-scale Data Analytics
CoRR, 2013

Iterative MapReduce for Large Scale Machine Learning
CoRR, 2013

Human-Powered Top-k Lists.
Proceedings of the 16th International Workshop on the Web and Databases 2013, 2013

Towards a workload for evolutionary analytics.
Proceedings of the Second Workshop on Data Analytics in the Cloud, 2013

SIDR: structure-aware intelligent data routing in Hadoop.
Proceedings of the International Conference for High Performance Computing, 2013

INUM+: A leaner, more accurate and more efficient fast what-if optimizer.
Proceedings of the Workshops Proceedings of the 29th IEEE International Conference on Data Engineering, 2013

Machine learning on Big Data.
Proceedings of the 29th IEEE International Conference on Data Engineering, 2013

2012
An overview of the deco system: data model and query language; query processing and optimization.
SIGMOD Rec., 2012

Semi-Automatic Index Tuning: Keeping DBAs in the Loop.
Proc. VLDB Endow., 2012

Deco: A System for Declarative Crowdsourcing.
Proc. VLDB Endow., 2012

Declarative Systems for Large-Scale Machine Learning.
IEEE Data Eng. Bull., 2012

Scaling Datalog for Machine Learning on Big Data
CoRR, 2012

Max algorithms in crowdsourcing environments.
Proceedings of the 21st World Wide Web Conference 2012, 2012

CrowdScreen: algorithms for filtering data with humans.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2012

Kaizen: a semi-automatic index advisor.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2012

Divergent physical design tuning for replicated databases.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2012

Processing of Rank Joins in Highly Distributed Systems.
Proceedings of the IEEE 28th International Conference on Data Engineering (ICDE 2012), 2012

Asking the Right Questions in Crowd Data Sourcing.
Proceedings of the IEEE 28th International Conference on Data Engineering (ICDE 2012), 2012

Deco: declarative crowdsourcing.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

2011
Predictable performance and high query concurrency for data analytics.
VLDB J., 2011

Human-assisted graph search: it's okay to ask questions.
Proc. VLDB Endow., 2011

CoPhy: A Scalable, Portable, and Interactive Index Advisor for Large Workloads.
Proc. VLDB Endow., 2011

Benchmarking Online Index-Tuning Algorithms.
IEEE Data Eng. Bull., 2011

The QueRIE system for Personalized Query Recommendations.
IEEE Data Eng. Bull., 2011

Web information management with access control.
Proceedings of the 14th International Workshop on the Web and Databases 2011, 2011

Skyline query processing over joins.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2011

SciHadoop: array-based query processing in Hadoop.
Proceedings of the Conference on High Performance Computing Networking, 2011

Social networking on top of the WebdamExchange system.
Proceedings of the 27th International Conference on Data Engineering, 2011

Map-reduce extensions and recursive queries.
Proceedings of the EDBT 2011, 2011

Answering Queries using Humans, Algorithms and Databases.
Proceedings of the Fifth Biennial Conference on Innovative Data Systems Research, 2011

Private Database Synthesis for Outsourced System Evaluation.
Proceedings of the 5th Alberto Mendelzon International Workshop on Foundations of Data Management, 2011

2010
Optimal algorithms for evaluating rank joins in database systems.
ACM Trans. Database Syst., 2010

Optimal Top-K Query Evaluation for Weighted Business Processes.
Proc. VLDB Endow., 2010

SQL QueRIE Recommendations.
Proc. VLDB Endow., 2010

An automated, yet interactive and portable DB designer.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2010

The Rank Join Problem.
Proceedings of the Search Computing, 2010

Trends in Rank Join.
Proceedings of the Search Computing, 2010

QueRIE: A Query Recommender System Supporting Interactive Database Exploration.
Proceedings of the ICDMW 2010, 2010

Cluster Computing, Recursion and Datalog.
Proceedings of the Datalog Reloaded - First International Workshop, 2010

2009
XML Selectivity Estimation.
Proceedings of the Encyclopedia of Database Systems, 2009

Depth estimation for ranking query optimization.
VLDB J., 2009

TuG synopses for approximate query answering.
ACM Trans. Database Syst., 2009

Report on the 10th international workshop on web information and data management (WIDM).
SIGMOD Rec., 2009

Index Interactions in Physical Design Tuning: Modeling, Analysis, and Applications.
Proc. VLDB Endow., 2009

Autocompletion for Mashups.
Proc. VLDB Endow., 2009

A Scalable, Predictable Join Operator for Highly Concurrent Data Warehouses.
Proc. VLDB Endow., 2009

Searching Shared Content in Communities with the Data Ring.
IEEE Data Eng. Bull., 2009

Query Recommendations for Interactive Database Exploration.
Proceedings of the Scientific and Statistical Database Management, 2009

Robust and efficient algorithms for rank join evaluation.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2009

A Benchmark for Online Index Selection.
Proceedings of the 25th International Conference on Data Engineering, 2009

Sketch-Based Summarization of Ordered XML Streams.
Proceedings of the 25th International Conference on Data Engineering, 2009

MatchUp: Autocompletion for Mashups.
Proceedings of the 25th International Conference on Data Engineering, 2009

2008
Very Large Databases.
Proceedings of the Wiley Encyclopedia of Computer Science and Engineering, 2008

Meshing Streaming Updates with Persistent Data in an Active Data Warehouse.
IEEE Trans. Knowl. Data Eng., 2008

The repeatability experiment of SIGMOD 2008.
SIGMOD Rec., 2008

Report on the 9th international workshop on web information and data management (WIDM 2007).
SIGMOD Rec., 2008

Evaluating rank joins with optimal cost.
Proceedings of the Twenty-Seventh ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, 2008

XML processing in DHT networks.
Proceedings of the 24th International Conference on Data Engineering, 2008

2007
On-Line Index Selection for Shifting Workloads.
Proceedings of the 23rd International Conference on Data Engineering Workshops, 2007

Supporting Streaming Updates in an Active Data Warehouse.
Proceedings of the 23rd International Conference on Data Engineering, 2007

The Data Ring: Community Content Sharing.
Proceedings of the Third Biennial Conference on Innovative Data Systems Research, 2007

2006
XSKETCH synopses for XML data graphs.
ACM Trans. Database Syst., 2006

AQAX: A System for Approximate XML Query Answers.
Proceedings of the 32nd International Conference on Very Large Data Bases, 2006

Graph-based synopses for relational selectivity estimation.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2006

COLT: continuous on-line tuning.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2006

XCluster Synopses for Structured XML Content.
Proceedings of the 22nd International Conference on Data Engineering, 2006

Data Ring: Let Us Turn the Net into a Database!
Proceedings of the Advances in Databases and Information Systems, 2006

2005
Searching a file system using inferred semantic links.
Proceedings of the HYPERTEXT 2005, 2005

Selectivity-based partitioning: a divide-and-union paradigm for effective query optimization.
Proceedings of the 2005 ACM CIKM International Conference on Information and Knowledge Management, Bremen, Germany, October 31, 2005

2004
Fractional XSketch Synopses for XML Databases.
Proceedings of the Database and XML Technologies, 2004

Approximate XML Query Answers.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2004

Selectivity Estimation for XML Twigs.
Proceedings of the 20th International Conference on Data Engineering, 2004

2003
Speculative Query Processing.
Proceedings of the First Biennial Conference on Innovative Data Systems Research, 2003

2002
Structure and Value Synopses for XML Data Graphs.
Proceedings of 28th International Conference on Very Large Data Bases, 2002

Statistical synopses for graph-structured XML databases.
Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, 2002


  Loading...