Ihab F. Ilyas

According to our database1, Ihab F. Ilyas authored at least 126 papers between 2001 and 2019.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Homepages:

On csauthors.net:

Bibliography

2019
Distributed Implementations of Dependency Discovery Algorithms.
PVLDB, 2019

Approximate Inference in Structured Instances with Noisy Categorical Observations.
CoRR, 2019

Technical Report: Optimizing Human Involvement for Entity Matching and Consolidation.
CoRR, 2019

Principles of Progress Indicators for Database Repairing.
CoRR, 2019

HoloDetect: Few-Shot Learning for Error Detection.
CoRR, 2019

Matching Entities Across Different Knowledge Graphs with Graph Embeddings.
CoRR, 2019

Distributed Dependency Discovery.
CoRR, 2019

Approximate Inference in Structured Instances with Noisy Categorical Observations.
Proceedings of the Thirty-Fifth Conference on Uncertainty in Artificial Intelligence, 2019

HoloDetect: Few-Shot Learning for Error Detection.
Proceedings of the 2019 International Conference on Management of Data, 2019

APEx: Accuracy-Aware Differentially Private Data Exploration.
Proceedings of the 2019 International Conference on Management of Data, 2019

A Formal Framework for Probabilistic Unclean Databases.
Proceedings of the 22nd International Conference on Database Theory, 2019

Distributed Discovery of Functional Dependencies.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

A Semi-Supervised Framework of Clustering Selection for De-Duplication.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

Unsupervised String Transformation Learning for Entity Consolidation.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

Building Scalable Machine Learning Solutions for Data Cleaning.
Proceedings of the Datenbanksysteme für Business, 2019

Semi-supervised clustering for de-duplication.
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

Data unification at scale: data tamer.
Proceedings of the Making Databases Work: the Pragmatic Wisdom of Michael Stonebraker, 2019

2018
Top-k Queries.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Rank-Join.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Rank-Aware Query Processing.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Data Integration: The Current Status and the Way Forward.
IEEE Data Eng. Bull., 2018

Semi-supervised clustering for de-duplication.
CoRR, 2018

A Formal Framework For Probabilistic Unclean Databases.
CoRR, 2018

Building Data Civilizer Pipelines with an Advanced Workflow Engine.
Proceedings of the 34th IEEE International Conference on Data Engineering, 2018

Seeping Semantics: Linking Datasets Using Word Embeddings for Data Discovery.
Proceedings of the 34th IEEE International Conference on Data Engineering, 2018

Farewell Freebase: Migrating the SimpleQuestions Dataset to DBpedia.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

2017
Smart Meter Data Analytics: Systems, Algorithms, and Benchmarking.
ACM Trans. Database Syst., 2017

Data Quality: The Role of Empiricism.
SIGMOD Record, 2017

HoloClean: Holistic Data Repairs with Probabilistic Inference.
PVLDB, 2017

Private Exploration Primitives for Data Cleaning.
CoRR, 2017

Entity Consolidation: The Golden Record Problem.
CoRR, 2017

HoloClean: Holistic Data Repairs with Probabilistic Inference.
CoRR, 2017

A Demo of the Data Civilizer System.
Proceedings of the 2017 ACM International Conference on Management of Data, 2017


2016
Distributed Data Deduplication.
PVLDB, 2016

Qualitative Data Cleaning.
PVLDB, 2016

Detecting Data Errors: Where are we and what needs to be done?
PVLDB, 2016

Learning to identify relevant studies for systematic reviews using random forest and external information.
Machine Learning, 2016

Editorial: Special Issue on Web Data Quality.
J. Data and Information Quality, 2016

Effective Data Cleaning with Continuous Evaluation.
IEEE Data Eng. Bull., 2016

CLAMS: Bringing Quality to Data Lakes.
Proceedings of the 2016 International Conference on Management of Data, 2016

Data Cleaning: Overview and Emerging Challenges.
Proceedings of the 2016 International Conference on Management of Data, 2016

LONLIES: Estimating Property Values for Long Tail Entities.
Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, 2016

Dark Data: Are we solving the right problems?
Proceedings of the 32nd IEEE International Conference on Data Engineering, 2016

DataXFormer: A robust transformation discovery system.
Proceedings of the 32nd IEEE International Conference on Data Engineering, 2016

2015
KATARA: Reliable Data Cleaning with Knowledge Bases and Crowdsourcing.
PVLDB, 2015

Trends in Cleaning Relational Data: Consistency and Deduplication.
Foundations and Trends in Databases, 2015

DataXFormer: An Interactive Data Transformation Tool.
Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31, 2015

BigDansing: A System for Big Data Cleansing.
Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31, 2015

KATARA: A Data Cleaning System Powered by Knowledge Bases and Crowdsourcing.
Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31, 2015

SMAS: A smart meter data analytics system.
Proceedings of the 31st IEEE International Conference on Data Engineering, 2015

Benchmarking Smart Meter Data Analytics.
Proceedings of the 18th International Conference on Extending Database Technology, 2015

Dataxformer: Leveraging the Web for Semantic Transformations.
Proceedings of the CIDR 2015, 2015

2014
Sampling from repairs of conditional functional dependency violations.
VLDB J., 2014

Top-k Nearest Neighbor Search In Uncertain Data Series.
PVLDB, 2014

NADEEF/ER: generic and interactive entity resolution.
Proceedings of the International Conference on Management of Data, 2014

Descriptive and prescriptive data cleaning.
Proceedings of the International Conference on Management of Data, 2014

RuleMiner: Data quality rules discovery.
Proceedings of the IEEE 30th International Conference on Data Engineering, Chicago, 2014

2013
Probabilistic Web Data Management.
World Wide Web, 2013

NADEEF: A Generalized Data Cleaning System.
PVLDB, 2013

Discovering Denial Constraints.
PVLDB, 2013

We are drowning in a sea of least publishable units (LPUs).
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2013

NADEEF: a commodity data cleaning system.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2013

Holistic data cleaning: Putting violations into context.
Proceedings of the 29th IEEE International Conference on Data Engineering, 2013

On the relative trust between inconsistent data and inaccurate constraints.
Proceedings of the 29th IEEE International Conference on Data Engineering, 2013

Data Curation at Scale: The Data Tamer System.
Proceedings of the CIDR 2013, 2013

2012
The data analytics group at the qatar computing research institute.
SIGMOD Record, 2012

On the Relative Trust between Inconsistent Data and Inaccurate Constraints
CoRR, 2012

Just-in-time information extraction using extraction views.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2012

Interpreting keyword queries over web knowledge bases.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

2011
Probabilistic Ranking Techniques in Relational Databases
Synthesis Lectures on Data Management, Morgan & Claypool Publishers, 2011

Guided data repair.
PVLDB, 2011

Guided Data Repair
CoRR, 2011

Ranking with uncertain scoring functions: semantics and sensitivity measures.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2011

2010
Supporting ranking queries on uncertain and incomplete data.
VLDB J., 2010

Building Ranked Mashups of Unstructured Sources with Uncertain Information.
PVLDB, 2010

QUICK: Expressive and Flexible Search over Knowledge Bases and Text Collections.
PVLDB, 2010

Sampling the Repairs of Functional Dependency Violations under Hard Constraints.
PVLDB, 2010

Expressive and flexible access to web-extracted data: a keyword-based structured query language.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2010

Trends in Rank Join.
Proceedings of the Search Computing, 2010

Uncertainty in Rank Join.
Proceedings of the Search Computing, 2010

MashRank: Towards uncertainty-aware and rank-aware mashups.
Proceedings of the 26th International Conference on Data Engineering, 2010

ProbClean: A probabilistic duplicate detection system.
Proceedings of the 26th International Conference on Data Engineering, 2010

2009
Discovering and Exploiting Statistical Properties for Query Optimization in Relational Databases: A Survey.
Statistical Analysis and Data Mining, 2009

Creating Competitive Products.
PVLDB, 2009

StatAdvisor: Recommending Statistical Views.
PVLDB, 2009

Modeling and Querying Possible Repairs in Duplicate Detection.
PVLDB, 2009

Guest editorial: special issue on ranking in databases.
Distributed and Parallel Databases, 2009

Rank-Join Algorithms for Search Computing.
Proceedings of the Search Computing: Challenges and Directions [outcome of the first SeCO Workshop on Search Computing Challenges and Directions, 2009

PSALM: Cardinality Estimation inthe Presence of Fine-Grained Access Controls.
Proceedings of the 25th International Conference on Data Engineering, 2009

Ranking with Uncertain Scores.
Proceedings of the 25th International Conference on Data Engineering, 2009

2008
Probabilistic top-k and ranking-aggregate queries.
ACM Trans. Database Syst., 2008

Efficient search for the top-k probable nearest neighbors in uncertain databases.
PVLDB, 2008

A survey of top-k query processing techniques in relational database systems.
ACM Comput. Surv., 2008

Message from the DBRANK'08 program co-chairs.
Proceedings of the 24th International Conference on Data Engineering Workshops, 2008

08421 Working Group: Classification, Representation and Modeling.
Proceedings of the Uncertainty Management in Information Systems, 12.10. - 17.10.2008, 2008

08421 Working Group: Lineage/Provenance.
Proceedings of the Uncertainty Management in Information Systems, 12.10. - 17.10.2008, 2008

2007
Report on the First International Workshop on Ranking in Databases (DBRank'07).
SIGMOD Record, 2007

URank: formulation and efficient evaluation of top-k queries in uncertain databases.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2007

Finding Skyline and Top-k Bargaining Solutions.
Proceedings of the 23rd International Conference on Data Engineering, 2007

Top-k Query Processing in Uncertain Databases.
Proceedings of the 23rd International Conference on Data Engineering, 2007

Collecting and Maintaining Just-in-Time Statistics.
Proceedings of the 23rd International Conference on Data Engineering, 2007

2006
Adaptive rank-aware query optimization in relational databases.
ACM Trans. Database Syst., 2006

FIX: Feature-based Indexing Technique for XML Documents.
Proceedings of the 32nd International Conference on Very Large Data Bases, 2006

InterJoin: Exploiting Indexes and Materialized Views in XPath Evaluation.
Proceedings of the 18th International Conference on Scientific and Statistical Database Management, 2006

Supporting ad-hoc ranking aggregates.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2006

XSEED: Accurate and Fast Cardinality Estimation for XPath Queries.
Proceedings of the 22nd International Conference on Data Engineering, 2006

2005
RankSQL: Supporting Ranking Queries in Relational Database Management Systems.
Proceedings of the 31st International Conference on Very Large Data Bases, Trondheim, Norway, August 30, 2005

RankSQL: Query Algebra and Optimization for Relational Top-k Queries.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2005

Rank-Aware Query Processing and Optimization.
Proceedings of the 21st International Conference on Data Engineering, 2005

2004
Supporting top-k join queries in relational databases.
VLDB J., 2004

Reminiscences on Influential Papers.
SIGMOD Record, 2004

VDBMS: A testbed facility for research in video database benchmarking.
Multimedia Syst., 2004

CORDS: Automatic Generation of Correlation Statistics in DB2.
Proceedings of the (e)Proceedings of the Thirtieth International Conference on Very Large Data Bases, VLDB 2004, Toronto, Canada, August 31, 2004

Rank-aware Query Optimization.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2004

CORDS: Automatic Discovery of Correlations and Soft Functional Dependencies.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2004

Nile: A Query Processing Engine for Data Streams.
Proceedings of the 20th International Conference on Data Engineering, 2004

Automatic Relationship Discovery in Self-Managing Database Systems.
Proceedings of the 1st International Conference on Autonomic Computing (ICAC 2004), 2004

2003
Supporting Top-k Join Queries in Relational Databases.
Proceedings of 29th International Conference on Very Large Data Bases, 2003

Estimating Compilation Time of a Query Optimizer.
Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, 2003

Video query processing in the VDBMS testbed for video database research.
Proceedings of the First ACM International Workshop on Multimedia Databases, 2003

2002
Joining Ranked Inputs in Practice.
Proceedings of 28th International Conference on Very Large Data Bases, 2002

A Video Database Management System for Advancing Video Database Research.
Proceedings of the MIS 2002, International Workshop on Multimedia Information Systems, October 10, 2002

A Distributed Database Server for Continuous Media.
Proceedings of the 18th International Conference on Data Engineering, San Jose, CA, USA, February 26, 2002

2001
SP-GiST: An Extensible Database Index for Supporting Space Partitioning Trees.
J. Intell. Inf. Syst., 2001

An Extensible Index for Spatial Databases.
Proceedings of the 13th International Conference on Scientific and Statistical Database Management, 2001


  Loading...