Tim Kraska

According to our database1, Tim Kraska authored at least 133 papers between 2006 and 2019.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Homepages:

On csauthors.net:

Bibliography

2019
Rethinking Database High Availability with RDMA Networks.
PVLDB, 2019

Choosing A Cloud DBMS: Architectures and Tradeoffs.
PVLDB, 2019

Tuplex: Robust, Efficient Analytics When Python Rules.
PVLDB, 2019

Neo: A Learned Query Optimizer.
PVLDB, 2019

LISA: Towards Learned DNA Sequence Search.
CoRR, 2019

Sherlock: A Deep Learning Approach to Semantic Data Type Detection.
CoRR, 2019

VizNet: Towards A Large-Scale Visualization Learning and Benchmarking Repository.
CoRR, 2019

Neo: A Learned Query Optimizer.
CoRR, 2019

SysML: The New Frontier of Machine Learning Systems.
CoRR, 2019

How I Learned to Stop Worrying and Love Re-optimization.
CoRR, 2019

Custodes: Auditable Hypothesis Testing.
CoRR, 2019

SchengenDB: A Data Protection Database Proposal.
Proceedings of the Heterogeneous Data Management, Polystores, and Analytics for Healthcare, 2019

Democratizing Data Science through Interactive Curation of ML Pipelines.
Proceedings of the 2019 International Conference on Management of Data, 2019

From Auto-tuning One Size Fits All to Self-designed and Learned Data-intensive Systems.
Proceedings of the 2019 International Conference on Management of Data, 2019

FITing-Tree: A Data-aware Index Structure.
Proceedings of the 2019 International Conference on Management of Data, 2019

Designing Distributed Tree-based Index Structures for Fast RDMA-capable Networks.
Proceedings of the 2019 International Conference on Management of Data, 2019

Sherlock: A Deep Learning Approach to Semantic Data Type Detection.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

How I Learned to Stop Worrying and Love Re-optimization.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

Slice Finder: Automated Data Slicing for Model Validation.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

SageDB: A Learned Database System.
Proceedings of the CIDR 2019, 2019

VizNet: Towards A Large-Scale Visualization Learning and Benchmarking Repository.
Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems, 2019

VizML: A Machine Learning Approach to Visualization Recommendation.
Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems, 2019

2018
Distributed Machine Learning.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Estimating the Impact of Unknown Unknowns on Aggregate Query Results.
ACM Trans. Database Syst., 2018

Northstar: An Interactive Data Science System.
PVLDB, 2018

Towards Quantifying Uncertainty in Data Analysis & Exploration.
IEEE Data Eng. Bull., 2018

Chiller: Contention-centric Transaction Execution and Data Partitioning for Fast Networks.
CoRR, 2018

VizRec: A framework for secure data exploration via visual representation.
CoRR, 2018

Unknown Examples & Machine Learning Model Generalization.
CoRR, 2018

VizML: A Machine Learning Approach to Visualization Recommendation.
CoRR, 2018

Slice Finder: Automated Data Slicing for Model Validation.
CoRR, 2018

Smallify: Learning Network Size while Training.
CoRR, 2018

IDEBench: A Benchmark for Interactive Data Exploration.
CoRR, 2018

A-Tree: A Bounded Approximate Index Structure.
CoRR, 2018

SuperNeurons: Dynamic GPU Memory Management for Training Deep Neural Networks.
CoRR, 2018

FastDAWG: Improving Data Migration in the BigDAWG Polystore System.
Proceedings of the Heterogeneous Data Management, Polystores, and Analytics for Healthcare, 2018

The Case for Learned Index Structures.
Proceedings of the 2018 International Conference on Management of Data, 2018

Towards Interactive Curation & Automatic Tuning of ML Pipelines.
Proceedings of the Second Workshop on Data Management for End-To-End Machine Learning, 2018

Superneurons: dynamic GPU memory management for training deep neural networks.
Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2018

Investigating the Effect of the Multiple Comparisons Problem in Visual Analysis.
Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, 2018

2017
How Progressive Visualizations Affect Exploratory Analysis.
IEEE Trans. Vis. Comput. Graph., 2017

The End of a Myth: Distributed Transaction Can Scale.
PVLDB, 2017

Revisiting Reuse for Approximate Query Processing.
PVLDB, 2017

A Data Quality Metric (DQM): How to Estimate the Number of Undetected Errors in Data Sets.
PVLDB, 2017

Rethinking Distributed Query Execution on High-Speed Networks.
IEEE Data Eng. Bull., 2017

Letter from the Special Issue Editor.
IEEE Data Eng. Bull., 2017

The Case for Learned Index Structures.
CoRR, 2017

Safe Visual Data Exploration.
Proceedings of the 2017 ACM International Conference on Management of Data, 2017

Controlling False Discoveries During Interactive Data Exploration.
Proceedings of the 2017 ACM International Conference on Management of Data, 2017

Approximate Query Processing for Interactive Data Science.
Proceedings of the 2017 ACM International Conference on Management of Data, 2017

What you see is not what you get!: Detecting Simpson's Paradoxes during Data Exploration.
Proceedings of the 2nd Workshop on Human-In-the-Loop Data Analytics, 2017

Revisiting Reuse in Main Memory Database Systems.
Proceedings of the 2017 ACM International Conference on Management of Data, 2017

Data Science Education: We're Missing the Boat, Again.
Proceedings of the 33rd IEEE International Conference on Data Engineering, 2017

Toward Sustainable Insights, or Why Polygamy is Bad for You.
Proceedings of the CIDR 2017, 2017

IncMap: A Journey towards Ontology-based Data Integration.
Proceedings of the Datenbanksysteme für Business, 2017

Spotlytics: How to Use Cloud Market Places for Analytics?
Proceedings of the Datenbanksysteme für Business, 2017


2016
The End of Slow Networks: It's Time for a Redesign.
PVLDB, 2016

Towards a Benchmark for Interactive Data Exploration.
IEEE Data Eng. Bull., 2016

Controlling False Discoveries During Interactive Data Exploration.
CoRR, 2016

The End of a Myth: Distributed Transactions Can Scale.
CoRR, 2016

Revisiting Reuse in Main Memory Database Systems.
CoRR, 2016

A Data Quality Metric (DQM): How to Estimate The Number of Undetected Errors in Data Sets.
CoRR, 2016

Answering enumeration queries with the crowd.
Commun. ACM, 2016

Making the Case for Query-by-Voice with EchoQuery.
Proceedings of the 2016 International Conference on Management of Data, 2016

PrivateClean: Data Cleaning and Differential Privacy.
Proceedings of the 2016 International Conference on Management of Data, 2016

VisTrees: fast indexes for interactive data exploration.
Proceedings of the Workshop on Human-In-the-Loop Data Analytics, 2016

The case for interactive data exploration accelerators (IDEAs).
Proceedings of the Workshop on Human-In-the-Loop Data Analytics, 2016

Estimating the Impact of Unknown Unknowns on Aggregate Query Results.
Proceedings of the 2016 International Conference on Management of Data, 2016

Dark Data: Are we solving the right problems?
Proceedings of the 32nd IEEE International Conference on Data Engineering, 2016

2015
Crowdsourcing Enumeration Queries: Estimators and Interfaces.
IEEE Trans. Knowl. Data Eng., 2015

S-Store: Streaming Meets Transaction Processing.
PVLDB, 2015

Stale View Cleaning: Getting Fresh Answers from Stale Materialized Views.
PVLDB, 2015

A Demonstration of the BigDAWG Polystore System.
PVLDB, 2015

Vizdom: Interactive Analytics through Pen and Touch.
PVLDB, 2015

An Architecture for Compiling UDF-centric Workflows.
PVLDB, 2015

SampleClean: Fast and Reliable Analytics on Dirty Data.
IEEE Data Eng. Bull., 2015

TuPAQ: An Efficient Planner for Large-scale Predictive Analytic Queries.
CoRR, 2015

S-Store: Streaming Meets Transaction Processing.
CoRR, 2015

Stale View Cleaning: Getting Fresh Answers from Stale Materialized Views.
CoRR, 2015

Fault-Tolerant Entity Resolution with the Crowd.
CoRR, 2015

Estimating the Impact of Unknown Unknowns on Aggregate Query Results.
CoRR, 2015

The End of Slow Networks: It's Time for a Redesign.
CoRR, 2015

Cost-based Fault-tolerance for Parallel Data Processing.
Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31, 2015

Machine Learning and Databases: The Sound of Things to Come or a Cacophony of Hype?
Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31, 2015

SpotADAPT: Spot-Aware (re-)Deployment of Analytical Processing Tasks on Amazon EC2.
Proceedings of the ACM Eighteenth International Workshop on Data Warehousing and OLAP, 2015

Automating model search for large scale machine learning.
Proceedings of the Sixth ACM Symposium on Cloud Computing, 2015

Tupleware: "Big" Data, Big Analytics, Small Clusters.
Proceedings of the CIDR 2015, 2015

2014
S-Store: A Streaming NewSQL System for Big Velocity Applications.
PVLDB, 2014

Putting Analytics on the Spot: Or How to Lower the Cost for Analytics.
IEEE Internet Computing, 2014

Tupleware: Distributed Machine Learning on Small Clusters.
IEEE Data Eng. Bull., 2014

The Expected Optimal Labeling Order Problem for Crowdsourced Joins and Entity Resolution.
CoRR, 2014

Leveraging Transitive Relations for Crowdsourced Joins.
CoRR, 2014

Tupleware: Redefining Modern Analytics.
CoRR, 2014

A sample-and-clean framework for fast and accurate query processing on dirty data.
Proceedings of the International Conference on Management of Data, 2014

PLANET: making progress with commit processing in unpredictable environments.
Proceedings of the International Conference on Management of Data, 2014

Should we all be teaching "intro to data science" instead of "intro to databases"?
Proceedings of the International Conference on Management of Data, 2014

2013
The New Database Architectures.
IEEE Internet Computing, 2013

Finding the Needle in the Big Data Systems Haystack.
IEEE Internet Computing, 2013

MLI: An API for Distributed Machine Learning.
CoRR, 2013

Leveraging transitive relations for crowdsourced joins.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2013

RTP: robust tenant placement for elastic in-memory database clusters.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2013

Generalized scale independence through incremental precomputation.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2013

MLI: An API for Distributed Machine Learning.
Proceedings of the 2013 IEEE 13th International Conference on Data Mining, 2013

Crowdsourced enumeration queries.
Proceedings of the 29th IEEE International Conference on Data Engineering, 2013

A Framework for Adaptive Crowd Query Processing.
Proceedings of the Human Computation and Crowdsourcing: Works in Progress and Demonstration Abstracts, 2013

CASTLE: Crowd-Assisted System for Text Labeling and Extraction.
Proceedings of the First AAAI Conference on Human Computation and Crowdsourcing, 2013

MDCC: multi-data center consistency.
Proceedings of the Eighth Eurosys Conference 2013, 2013

MLbase: A Distributed Machine-learning System.
Proceedings of the CIDR 2013, 2013

CrowdQ: Crowdsourced Query Understanding.
Proceedings of the CIDR 2013, 2013

2012
CrowdER: Crowdsourcing Entity Resolution.
PVLDB, 2012

CrowdER: Crowdsourcing Entity Resolution
CoRR, 2012

MDCC: Multi-Data Center Consistency
CoRR, 2012

Getting It All from the Crowd
CoRR, 2012

Stormy: an elastic and highly available streaming service in the cloud.
Proceedings of the 2012 Joint EDBT/ICDT Workshops, Berlin, Germany, March 30, 2012, 2012

2011
Repeatability and workability evaluation of SIGMOD 2011.
SIGMOD Record, 2011

CrowdDB: Query Processing with the VLDB Crowd.
PVLDB, 2011

Crowdsourcing Applications and Platforms: A Data Management Perspective.
PVLDB, 2011

PIQL: Success-Tolerant Query Processing in the Cloud.
PVLDB, 2011

PIQL: Success-Tolerant Query Processing in the Cloud
CoRR, 2011

CrowdDB: answering queries with crowdsourcing.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2011

2010
Cloudy: A Modular Cloud Storage System.
PVLDB, 2010

Data Management in the Cloud: Promises, State-of-the-art, and Open Questions.
Datenbank-Spektrum, 2010

An evaluation of alternative architectures for transaction processing in the cloud.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2010

2009
Consistency Rationing in the Cloud: Pay only when it matters.
PVLDB, 2009

XQuery Reloaded.
PVLDB, 2009

XQuery in the browser.
Proceedings of the 18th International Conference on World Wide Web, 2009

How is the weather tomorrow?: towards a benchmark for the cloud.
Proceedings of the 2nd International Workshop on Testing Database Systems, 2009

2008
XQuery in the browser.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2008

Building a database on S3.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2008

2007
Extending XQuery with Window Functions.
Proceedings of the 33rd International Conference on Very Large Data Bases, 2007

2006
Genea: Schema-Aware Mapping of Ontologies into Relational Databases.
Proceedings of the 13th International Conference on Management of Data, 2006

PathBank: Web-Based Querying and Visualziation of an Integrated Biological Pathway Database.
Proceedings of the Third International Conference on Computer Graphics, 2006


  Loading...