Anish Das Sarma

Affiliations:
  • Google, USA


According to our database1, Anish Das Sarma authored at least 56 papers between 2004 and 2014.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2014
Fusing data with correlations.
Proceedings of the International Conference on Management of Data, 2014

Anchor-Points Algorithms for Hamming and Edit Distances Using MapReduce.
Proceedings of the Proc. 17th International Conference on Database Theory (ICDT), 2014

Crowd-powered find algorithms.
Proceedings of the IEEE 30th International Conference on Data Engineering, Chicago, 2014

2013
Data Cleaning: A Practical Perspective
Synthesis Lectures on Data Management, Morgan & Claypool Publishers, ISBN: 978-3-031-01897-8, 2013

Consistent thinning of large geographical data for map visualization.
ACM Trans. Database Syst., 2013

Upper and Lower Bounds on the Cost of a Map-Reduce Computation.
Proc. VLDB Endow., 2013

Optimal hashing schemes for entity matching.
Proceedings of the 22nd International World Wide Web Conference, 2013

SIGMOD 2013 new researcher symposium.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2013

Finding connected components in map-reduce in logarithmic rounds.
Proceedings of the 29th IEEE International Conference on Data Engineering, 2013

2012
Vision Paper: Towards an Understanding of the Limits of Map-Reduce Computation
CoRR, 2012

Understanding cyclic trends in social choices.
Proceedings of the Fifth International Conference on Web Search and Web Data Mining, 2012

Efficient spatial sampling of large geographical tables.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2012

Finding related tables.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2012

Extracting Information from Google Fusion Tables.
Proceedings of the Search Computing - Broadening Web Search, 2012

Fuzzy Joins Using MapReduce.
Proceedings of the IEEE 28th International Conference on Data Engineering (ICDE 2012), 2012

Designing good algorithms for MapReduce and beyond.
Proceedings of the ACM Symposium on Cloud Computing, SOCC '12, 2012

An automatic blocking mechanism for large-scale de-duplication tasks.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

Dynamic covering for recommendation systems.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

2011
Human-assisted graph search: it's okay to ask questions.
Proc. VLDB Endow., 2011

REX: Explaining Relationships between Entity Pairs.
Proc. VLDB Endow., 2011

CBLOCK: An Automatic Blocking Mechanism for Large-Scale De-duplication Tasks
CoRR, 2011

Dynamic relationship and event discovery.
Proceedings of the Forth International Conference on Web Search and Web Data Mining, 2011

Data integration with dependent sources.
Proceedings of the EDBT 2011, 2011

CoScan: cooperative scan sharing in the cloud.
Proceedings of the ACM Symposium on Cloud Computing in conjunction with SOSP 2011, 2011

Building a generic debugger for information extraction pipelines.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

Ibis: A Provenance Manager for Multi-Layer Systems.
Proceedings of the Fifth Biennial Conference on Innovative Data Systems Research, 2011

Uncertainty in Data Integration and Dataspace Support Platforms.
Proceedings of the Schema Matching and Mapping, 2011

2010
Managing uncertain data.
PhD thesis, 2010

Foundations of Uncertain-Data Integration.
Proc. VLDB Endow., 2010

Understanding Fashion Cycles as a Social Choice
CoRR, 2010

PROBER: Ad-Hoc Debugging of Extraction and Integration Pipelines
CoRR, 2010

Ranking mechanisms in twitter-like forums.
Proceedings of the Third International Conference on Web Search and Web Data Mining, 2010

LIVE: A Lineage-Supported Versioned DBMS.
Proceedings of the Scientific and Statistical Database Management, 2010

I4E: interactive investigation of iterative information extraction.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2010

Synthesizing view definitions from data.
Proceedings of the Database Theory, 2010

2009
Representing uncertain data: models, properties, and algorithms.
VLDB J., 2009

Space Constrained Dynamic Covering
CoRR, 2009

Functional Dependency Generation and Applications in Pay-As-You-Go Data Integration Systems.
Proceedings of the 12th International Workshop on the Web and Databases, 2009

Sailing the Information Ocean with Awareness of Currents: Discovery and Application of Source Dependence.
Proceedings of the Fourth Biennial Conference on Innovative Data Systems Research, 2009

Data Modeling in Dataspace Support Platforms.
Proceedings of the Conceptual Modeling: Foundations and Applications, 2009

Schema Design for Uncertain Databases.
Proceedings of the 3rd Alberto Mendelzon International Workshop on Foundations of Data Management, 2009

2008
Databases with uncertainty and lineage.
VLDB J., 2008

Bootstrapping pay-as-you-go data integration systems.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2008

Towards Special-Purpose Indexes and Statistics for Uncertain Data.
Proceedings of the International Workshop on Quality in Databases and Management of Uncertain Data, 2008

Exploiting Lineage for Confidence Computation in Uncertain and Probabilistic Databases.
Proceedings of the 24th International Conference on Data Engineering, 2008

08421 Working Group: Classification, Representation and Modeling.
Proceedings of the Uncertainty Management in Information Systems, 12.10. - 17.10.2008, 2008

08421 Working Group: Lineage/Provenance.
Proceedings of the Uncertainty Management in Information Systems, 12.10. - 17.10.2008, 2008

2007
Detecting near-duplicates for web crawling.
Proceedings of the 16th International Conference on World Wide Web, 2007

Leveraging aggregate constraints for deduplication.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2007

Trio-One: Layering Uncertainty and Lineage on a Conventional DBMS (Demo).
Proceedings of the Third Biennial Conference on Innovative Data Systems Research, 2007

2006
An Introduction to ULDBs and the Trio System.
IEEE Data Eng. Bull., 2006

ULDBs: Databases with Uncertainty and Lineage.
Proceedings of the 32nd International Conference on Very Large Data Bases, 2006

Trio: A System for Data, Uncertainty, and Lineage.
Proceedings of the 32nd International Conference on Very Large Data Bases, 2006

Working Models for Uncertain Data.
Proceedings of the 22nd International Conference on Data Engineering, 2006

2004
Generic Text Summarization Using WordNet.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

A Decomposition Based Approach for Design of Supply Aggregation and Demand Aggregation Exchanges.
Proceedings of the Applying Formal Methods: Testing, Performance and M/ECommerce, 2004


  Loading...