Venkatesh Ganti

Affiliations:
  • Microsoft Research


According to our database1, Venkatesh Ganti authored at least 43 papers between 1995 and 2018.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2018
Data Cleaning.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

2013
Data Cleaning: A Practical Perspective
Synthesis Lectures on Data Management, Morgan & Claypool Publishers, ISBN: 978-3-031-01897-8, 2013

Crawling deep web entity pages.
Proceedings of the Sixth ACM International Conference on Web Search and Data Mining, 2013

2011
Interval-based pruning for top-k processing over compressed lists.
Proceedings of the 27th International Conference on Data Engineering, 2011

2010
Keyword++: A Framework to Improve Keyword Search Over Entity Databases.
Proc. VLDB Endow., 2010

Precomputing search features for fast and accurate query classification.
Proceedings of the Third International Conference on Web Search and Web Data Mining, 2010

Query portals: dynamically generating portals for entity-oriented web queries.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2010

2009
Data Cleaning.
Proceedings of the Encyclopedia of Database Systems, 2009

Mining Document Collections to Facilitate Accurate Approximate Entity Matching.
Proc. VLDB Endow., 2009

Exploiting web search to generate synonyms for entities.
Proceedings of the 18th International Conference on World Wide Web, 2009

Exploiting web search engines to search structured databases.
Proceedings of the 18th International Conference on World Wide Web, 2009

2008
Scalable ad-hoc entity extraction from text collections.
Proc. VLDB Endow., 2008

An efficient filter for approximate membership checking.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2008

Entity categorization over large document collections.
Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2008

2007
Example-driven design of efficient record matching queries.
Proceedings of the 33rd International Conference on Very Large Data Bases, 2007

Leveraging aggregate constraints for deduplication.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2007

2006
Data Debugger: An Operator-Centric Approach for Data Quality Solutions.
IEEE Data Eng. Bull., 2006

Efficient Exact Set-Similarity Joins.
Proceedings of the 32nd International Conference on Very Large Data Bases, 2006

Ranking objects based on relationships.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2006

A Primitive Operator for Similarity Joins in Data Cleaning.
Proceedings of the 22nd International Conference on Data Engineering, 2006

2005
Data cleaning in microsoft SQL server 2005.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2005

Robust Identification of Fuzzy Duplicates.
Proceedings of the 21st International Conference on Data Engineering, 2005

2004
Data management technology for decision support systems.
Adv. Comput., 2004

Mining reference tables for automatic text segmentation.
Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2004

Selectivity Estimation for String Predicates: Overcoming the Underestimation Problem.
Proceedings of the 20th International Conference on Data Engineering, 2004

2003
Robust and Efficient Fuzzy Match for Online Data Cleaning.
Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, 2003

2002
Mining Data Streams under Block Evolution.
SIGKDD Explor., 2002

A Framework for Measuring Differences in Data Characteristics.
J. Comput. Syst. Sci., 2002

Eliminating Fuzzy Duplicates in Data Warehouses.
Proceedings of 28th International Conference on Very Large Data Bases, 2002

2001
DEMON: Mining and Monitoring Evolving Data.
IEEE Trans. Knowl. Data Eng., 2001

Database Technology for Decision Support Systems.
Computer, 2001

2000
RainForest - A Framework for Fast Decision Tree Construction of Large Datasets.
Data Min. Knowl. Discov., 2000

ICICLES: Self-Tuning Samples for Approximate Query Answering.
Proceedings of the VLDB 2000, 2000

1999
Approximate Query Answering using Histograms.
IEEE Data Eng. Bull., 1999

Mining Very Large Databases.
Computer, 1999

Fast Approximate Answers to Aggregate Queries on a Data Cube.
Proceedings of the 11th International Conference on Scientific and Statistical Database Management, 1999

BOAT-Optimistic Decision Tree Construction.
Proceedings of the SIGMOD 1999, 1999

A Framework for Measuring Changes in Data Characteristics.
Proceedings of the Eighteenth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, May 31, 1999

CACTUS - Clustering Categorical Data Using Summaries.
Proceedings of the Fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 1999

Fast Approximate Query Answering Using Precomputed Statistics.
Proceedings of the 15th International Conference on Data Engineering, 1999

Clustering Large Datasets in Arbitrary Metric Spaces.
Proceedings of the 15th International Conference on Data Engineering, 1999

1996
Implementation of a Real-Time Database System.
Inf. Syst., 1996

1995
Design, Implementation and Performance of a Real-Time Version of a Commercial RDBMS.
Proceedings of the Advances in Data Management, 1995


  Loading...