Christopher Olston

  • Carnegie Mellon University, Pittsburgh, USA

According to our database1, Christopher Olston authored at least 55 papers between 1998 and 2020.

Collaborative distances:



In proceedings 
PhD thesis 


Online presence:



Learned Indexes for a Google-scale Disk-based Database.
CoRR, 2020

TensorFlow-Serving: Flexible, High-Performance ML Serving.
CoRR, 2017

Managing Google's data lake: an overview of the Goods system.
IEEE Data Eng. Bull., 2016

Goods: Organizing Google's Datasets.
Proceedings of the 2016 International Conference on Management of Data, 2016

Yedalog: Exploring Knowledge at Scale.
Proceedings of the 1st Summit on Advances in Programming Languages, 2015

The Beckman Report on Database Research.
SIGMOD Rec., 2014

SpongeFiles: mitigating data skew in mapreduce using distributed memory.
Proceedings of the International Conference on Management of Data, 2014

Inspector Gadget: A Framework for Custom Monitoring and Debugging of Distributed Dataflows.
Proc. VLDB Endow., 2011

Search result diversity for informational queries.
Proceedings of the 20th International Conference on World Wide Web, 2011

Nova: continuous Pig/Hadoop workflows.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2011

CoScan: cooperative scan sharing in the cloud.
Proceedings of the ACM Symposium on Cloud Computing in conjunction with SOSP 2011, 2011

Ibis: A Provenance Manager for Multi-Layer Systems.
Proceedings of the Fifth Biennial Conference on Innovative Data Systems Research, 2011

Web Crawling.
Found. Trends Inf. Retr., 2010

Stateful bulk processing for incremental analytics.
Proceedings of the 1st ACM Symposium on Cloud Computing, 2010

Building a HighLevel Dataflow System on top of MapReduce: The Pig Experience.
Proc. VLDB Endow., 2009

Generating example data for dataflow programs.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2009

Interactive Analysis of Web-Scale Data.
Proceedings of the Fourth Biennial Conference on Innovative Data Systems Research, 2009

Scalable query result caching for web applications.
Proc. VLDB Endow., 2008

Relaxation in text search using taxonomies.
Proc. VLDB Endow., 2008

Scheduling shared scans of large data files.
Proc. VLDB Endow., 2008

Recrawl scheduling based on information longevity.
Proceedings of the 17th International Conference on World Wide Web, 2008

Crawl ordering by search impact.
Proceedings of the International Conference on Web Search and Web Data Mining, 2008

Automatic Optimization of Parallel Dataflow Programs.
Proceedings of the 2008 USENIX Annual Technical Conference, 2008

Pig latin: a not-so-foreign language for data processing.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2008

Parallel Evaluation of Composite Aggregate Queries.
Proceedings of the 24th International Conference on Data Engineering, 2008

Computing shortest paths with uncertainty.
J. Algorithms, 2007

Navigationaided retrieval.
Proceedings of the 16th International Conference on World Wide Web, 2007

The discoverability of the web.
Proceedings of the 16th International Conference on World Wide Web, 2007

Configurations: a model for distributed data storage.
Proceedings of the Twenty-Sixth Annual ACM Symposium on Principles of Distributed Computing, 2007

Invalidation Clues for Database Scalability Services.
Proceedings of the 23rd International Conference on Data Engineering, 2007

Simultaneous scalability and security for data-intensive web applications.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2006

Handling Advertisements of Unknown Quality in Search Advertising.
Proceedings of the Advances in Neural Information Processing Systems 19, 2006

Efficient Monitoring and Querying of Distributed, Dynamic Data via Approximate Replication.
IEEE Data Eng. Bull., 2005

User-centric Web crawling.
Proceedings of the 14th international conference on World Wide Web, 2005

Shuffling a Stacked Deck: The Case for Partially Randomized Ranking of Search Engine Results.
Proceedings of the 31st International Conference on Very Large Data Bases, Trondheim, Norway, August 30, 2005

Finding (Recently) Frequent Items in Distributed Data Streams.
Proceedings of the 21st International Conference on Data Engineering, 2005

A Scalability Service for Dynamic Web Applications.
Proceedings of the Second Biennial Conference on Innovative Data Systems Research, 2005

What's new on the web?: the evolution of the web from a search engine perspective.
Proceedings of the 13th international conference on World Wide Web, 2004

WIC: A General-Purpose Algorithm for Monitoring Web Information Sources.
Proceedings of the (e)Proceedings of the Thirtieth International Conference on Very Large Data Bases, VLDB 2004, Toronto, Canada, August 31, 2004

Approximate replication.
PhD thesis, 2003

Computing the Median with Uncertainty.
SIAM J. Comput., 2003

ScentTrails: integrating browsing and searching on the Web.
Interactions, 2003

Adaptive Filters for Continuous Queries over Distributed Data Streams.
Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, 2003

Distributed Top-K Monitoring.
Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, 2003

Query Processing, Approximation, and Resource Management in a Data Stream Management System.
Proceedings of the First Biennial Conference on Innovative Data Systems Research, 2003

Best-effort cache synchronization with source cooperation.
Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, 2002

Visualizing Data with Bounded Uncertainty.
Proceedings of the 2002 IEEE Symposium on Information Visualization (InfoVis 2002), 27 October, 2002

DataSplash: A Direct Manipulation Environment for Programming Semantic Zoom Visualizations of Tabular Data.
J. Vis. Lang. Comput., 2001

Adaptive Precision Setting for Cached Approximate Values.
Proceedings of the 2001 ACM SIGMOD international conference on Management of data, 2001

Offering a Precision-Performance Tradeoff for Aggregation Queries over Replicated Data.
Proceedings of the VLDB 2000, 2000

Getting Portals to Behave.
Proceedings of the IEEE Symposium on Information Visualization 2000 (INFOVIS'00), 2000

Interactive data Analysis: The Control Project.
Computer, 1999

VIQING: Visual Interactive Querying.
Proceedings of the Proceedings 1998 IEEE Symposium on Visual Languages, 1998

CONTROL: Continuous Output and Navigation Technology with Refinement On-Line.
Proceedings of the SIGMOD 1998, 1998
