Proceedings of the WSDM '22: The Fifteenth ACM International Conference on Web Search and Data Mining, Virtual Event / Tempe, AZ, USA, February 21, 2022

DI-2022: The Third Document Intelligence Workshop.

[BibT_eX]

[DOI]

Ani Nenkova

Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

2021

Glean: Structured Extractions from Templatic Documents.

[BibT_eX]

[DOI]

Proc. VLDB Endow., 2021

Simplified DOM Trees for Transferable Attribute Extraction from the Web.

[BibT_eX]

[DOI]

CoRR, 2021

DI-2021: The Second Document Intelligence Workshop.

[BibT_eX]

[DOI]

Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

2020

Active Learning for Skewed Data Sets.

[BibT_eX]

[DOI]

CoRR, 2020

FreeDOM: A Transferable Neural Architecture for Structured Information Extraction on Web Documents.

[BibT_eX]

[DOI]

Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Improving Recommendation Quality in Google Drive.

[BibT_eX]

[DOI]

Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Migrating a Privacy-Safe Information Extraction System to a Software 2.0 Design.

[BibT_eX]

[DOI]

Proceedings of the 10th Conference on Innovative Data Systems Research, 2020

Representation Learning for Information Extraction from Form-like Documents.

[BibT_eX]

[DOI]

Bodhisattwa Prasad Majumder

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019

Online Template Induction for Machine-Generated Emails.

[BibT_eX]

[DOI]

Proc. VLDB Endow., 2019

RiSER: Learning Better Representations for Richly Structured Emails.

[BibT_eX]

[DOI]

Proceedings of the World Wide Web Conference, 2019

ItemSuggest: A Data Management Platform for Machine Learned Ranking Services.

[BibT_eX]

[DOI]

Proceedings of the 9th Biennial Conference on Innovative Data Systems Research, 2019

2018

Query Languages and Evaluation Techniques for Biological Sequence Data.

[BibT_eX]

[DOI]

Sandeep Tata

Jignesh M. Patel

Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Hidden in Plain Sight: Classifying Emails Using Embedded Image Contents.

[BibT_eX]

[DOI]

Proceedings of the 2018 World Wide Web Conference on World Wide Web, 2018

Anatomy of a Privacy-Safe Large-Scale Information Extraction System Over Email.

[BibT_eX]

[DOI]

Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

Recommendations for All: Solving Thousands of Recommendation Problems Daily.

[BibT_eX]

[DOI]

Bhargav Kanagal

Sandeep Tata

Proceedings of the 34th IEEE International Conference on Data Engineering, 2018

2017

Quick Access: Building a Smart Experience for Google Drive.

[BibT_eX]

[DOI]

Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, NS, Canada, August 13, 2017

2014

Diff-Index: Differentiated Index in Distributed Log-Structured Data Stores.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Extending Database Technology, 2014

2013

Toward a scale-out data-management middleware for low-latency enterprise computing.

[BibT_eX]

[DOI]

IBM J. Res. Dev., 2013

A platform for eXtreme Analytics.

[BibT_eX]

[DOI]

IBM J. Res. Dev., 2013

BlueSNP: R package for highly scalable genome-wide association studies using Hadoop clusters.

[BibT_eX]

[DOI]

Hailiang Huang

Sandeep Tata

Robert J. Prill

Bioinform., 2013

Sparkler: supporting large-scale matrix factorization.

[BibT_eX]

[DOI]

Boduo Li

Sandeep Tata

Yannis Sismanis

Proceedings of the Joint 2013 EDBT/ICDT Conferences, 2013

2012

Clydesdale: structured data processing on hadoop.

[BibT_eX]

[DOI]

Andrey Balmin

Tim Kaldewey

Sandeep Tata

Proceedings of the ACM SIGMOD International Conference on Management of Data, 2012

Clydesdale: structured data processing on MapReduce.

[BibT_eX]

[DOI]

Tim Kaldewey

Eugene J. Shekita

Sandeep Tata

Proceedings of the 15th International Conference on Extending Database Technology, 2012

2011

Efficient and Accurate Discovery of Patterns in Sequence Data Sets.

[BibT_eX]

[DOI]

Avrilia Floratou

Sandeep Tata

Jignesh M. Patel

IEEE Trans. Knowl. Data Eng., 2011

Using Paxos to Build a Scalable, Consistent, and Highly Available Datastore.

[BibT_eX]

[DOI]

Jun Rao

Eugene J. Shekita

Sandeep Tata

Proc. VLDB Endow., 2011

Column-Oriented Storage Techniques for MapReduce.

[BibT_eX]

[DOI]

Proc. VLDB Endow., 2011

2010

Efficient and accurate discovery of patterns in sequence datasets.

[BibT_eX]

[DOI]

Avrilia Floratou

Sandeep Tata

Jignesh M. Patel

Proceedings of the 26th International Conference on Data Engineering, 2010

2009

Query Languages and Evaluation Techniques for Biological Sequence Data.

[BibT_eX]

[DOI]

Sandeep Tata

Jignesh M. Patel

Proceedings of the Encyclopedia of Database Systems, 2009

Towards a Scalable Enterprise Content Analytics Platform.

[BibT_eX]

[DOI]

Kevin S. Beyer

Vuk Ercegovac

Rajasekar Krishnamurthy

Shivakumar Vaithyanathan

Huaiyu Zhu

IEEE Data Eng. Bull., 2009

Leveraging a scalable row store to build a distributed text index.

[BibT_eX]

[DOI]

Proceedings of the First International CIKM Workshop on Cloud Data Management, 2009

2008

SQAK: doing more with keywords.

[BibT_eX]

[DOI]

Sandeep Tata

Guy M. Lohman

Proceedings of the ACM SIGMOD International Conference on Management of Data, 2008

On common tools for databases - The case for a client-based index advisor.

[BibT_eX]

[DOI]

Sandeep Tata

Lin Qiao

Guy M. Lohman

Proceedings of the 24th International Conference on Data Engineering Workshops, 2008

FLAME: Shedding Light on Hidden Frequent Patterns in Sequence Datasets.

[BibT_eX]

[DOI]

Sandeep Tata

Jignesh M. Patel

Proceedings of the 24th International Conference on Data Engineering, 2008

2007

Declarative Querying For Biological Sequences.

[BibT_eX]

[DOI]

Sandeep Tata

PhD thesis, 2007

Estimating the selectivity of <i>tf-idf</i> based cosine similarity predicates.

[BibT_eX]

[DOI]

Sandeep Tata

Jignesh M. Patel

SIGMOD Rec., 2007

Periscope/SQ: Interactive Exploration of Biological Sequence Databases.

[BibT_eX]

[DOI]

Sandeep Tata

Willis Lang

Jignesh M. Patel

Proceedings of the 33rd International Conference on Very Large Data Bases, 2007

2006

Declarative Querying for Biological Sequences.

[BibT_eX]

[DOI]

Proceedings of the 22nd International Conference on Data Engineering, 2006

2005

Practical methods for constructing suffix trees.

[BibT_eX]

[DOI]

VLDB J., 2005

2004

Practical Suffix Tree Construction.

[BibT_eX]

[DOI]

Sandeep Tata

Richard A. Hankins

Jignesh M. Patel

Proceedings of the (e)Proceedings of the Thirtieth International Conference on Very Large Data Bases, VLDB 2004, Toronto, Canada, August 31, 2004

2003

PiQA: An Algebra for Querying Protein Data Sets.

[BibT_eX]

[DOI]

Sandeep Tata

Jignesh M. Patel

Proceedings of the 15th International Conference on Scientific and Statistical Database Management (SSDBM 2003), 2003

Sandeep Tata

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...