Wang Chiew Tan

Orcid: 0009-0008-4174-7545

Affiliations:
  • Meta, USA
  • University of California, Santa Cruz, USA (former)


According to our database1, Wang Chiew Tan authored at least 132 papers between 1996 and 2023.

Collaborative distances:

Awards

ACM Fellow

ACM Fellow 2015, "For contributions to data provenance and to the foundations of information integration.".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Effective entity matching with transformers.
VLDB J., November, 2023

Diversity, Equity and Inclusion Activities in Database Conferences: A 2022 Report.
SIGMOD Rec., June, 2023

Unstructured and structured data: Can we have the best of both worlds with large language models?
IEEE Data Eng. Bull., 2023

Personal Data for Personal Use: Vision or Reality?
Proceedings of the Companion of the 2023 International Conference on Management of Data, 2023

TimelineQA: A Benchmark for Question Answering over Timelines.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Reimagining Retrieval Augmented Language Models for Answering Queries.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Diversity and Inclusion Activities in Database Conferences: A 2021 Report.
SIGMOD Rec., 2022

Annotating Columns with Pre-trained Language Models.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

2021
Querying subjective data.
VLDB J., 2021

Data Augmentation for ML-driven Data Preparation and Integration.
Proc. VLDB Endow., 2021

Deep Entity Matching: Challenges and Opportunities.
ACM J. Data Inf. Qual., 2021

Constructing Explainable Opinion Graphs from Reviews.
Proceedings of the WWW '21: The Web Conference 2021, 2021

Deep Data Integration.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

Adaptive Rule Discovery for Labeling Text Data.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

Convex Aggregation for Opinion Summarization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

2020
Advice from SIGMOD/PODS 2020.
SIGMOD Rec., 2020

Sato: Contextual Semantic Type Detection in Tables.
Proc. VLDB Endow., 2020

Deep or Simple Models for Semantic Tagging? It Depends on your Data.
Proc. VLDB Endow., 2020

Deep Entity Matching with Pre-Trained Language Models.
Proc. VLDB Endow., 2020

Deep or Simple Models for Semantic Tagging? It Depends on your Data [Experiments].
CoRR, 2020

ExplainIt: Explainable Review Summarization with Opinion Causality Graphs.
CoRR, 2020

Enhancing Review Comprehension with Domain-Specific Commonsense.
CoRR, 2020

Towards Productionizing Subjective Search Systems.
CoRR, 2020

Technical perspective: Entity matching with Magellan.
Commun. ACM, 2020

ExtremeReader: An interactive explorer for customizable and explainable review summarization.
Proceedings of the Companion of The 2020 Web Conference 2020, 2020

Snippext: Semi-supervised Opinion Mining with Augmented Data.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Unleashing the Power of Subjective Data: Managing Experiences as First-Class Citizens.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

SubjQA: A Dataset for Subjectivity and Review Comprehension.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Teddy: A System for Interactive Review Analysis.
Proceedings of the CHI '20: CHI Conference on Human Factors in Computing Systems, 2020

Sampo: Unsupervised Knowledge Base Construction for Opinions and Implications.
Proceedings of the Conference on Automated Knowledge Base Construction, 2020

OpinionDigest: A Simple Framework for Opinion Summarization.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Emu: Enhancing Multilingual Sentence Embeddings with Semantic Specialization.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
What I probably did right and what I think I could have done better.
Proc. VLDB Endow., 2019

Subjective Databases.
Proc. VLDB Endow., 2019

Expressive power of entity-linking frameworks.
J. Comput. Syst. Sci., 2019

Jo: The Smart Journal.
CoRR, 2019

Voyageur: An Experiential Travel Search Engine.
Proceedings of the World Wide Web Conference, 2019

Essentia: Mining Domain-specific Paraphrases with Word-Alignment Graphs.
Proceedings of the Thirteenth Workshop on Graph-Based Methods for Natural Language Processing, 2019

Open Information Extraction from Question-Answer Pairs.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Enabling Search by Experience.
Proceedings of the 13th Alberto Mendelzon International Workshop on Foundations of Data Management, 2019

Happiness Entailment: Automating Suggestions for Well-Being.
Proceedings of the 8th International Conference on Affective Computing and Intelligent Interaction, 2019

2018
Schema Mapping Composition.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Provenance in Databases.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Provenance in Scientific Databases.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Technical Perspective: : Toward Building Entity Matching Management Systems.
SIGMOD Rec., 2018

Technical Perspective: : A Relational Framework for Classifier Engineering.
SIGMOD Rec., 2018

Data Provenance: What next?
SIGMOD Rec., 2018

Koko: A System for Scalable Semantic Querying of Text.
Proc. VLDB Endow., 2018

Scalable Semantic Querying of Text.
Proc. VLDB Endow., 2018

GOLDRUSH: Rule Sharing System for Fraud Detection.
Proc. VLDB Endow., 2018

BigGorilla: An Open-Source Ecosystem for Data Preparation and Integration.
IEEE Data Eng. Bull., 2018

Active Learning of GAV Schema Mappings.
Proceedings of the 37th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems, 2018

HappyDB: A Corpus of 100, 000 Crowdsourced Happy Moments.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Rule Sharing for Fraud Detection via Adaptation.
Proceedings of the 34th IEEE International Conference on Data Engineering, 2018

Interactive Rule Refinement for Fraud Detection.
Proceedings of the 21st International Conference on Extending Database Technology, 2018

FrameIt: Ontology Discovery for Noisy User-Generated Text.
Proceedings of the 4th Workshop on Noisy User-generated Text, 2018

2017
Approximation Algorithms for Schema-Mapping Discovery from Data Examples.
ACM Trans. Database Syst., 2017

Data Integration: After the Teenage Years.
Proceedings of the 36th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems, 2017

2016
A Declarative Framework for Linking Entities.
ACM Trans. Database Syst., 2016

Technical Perspective: Attacking the Problem of Consistent Query Answering.
SIGMOD Rec., 2016

A Time Machine for Information: Looking Back to Look Forward.
SIGMOD Rec., 2016

Database Theory Column: Report on PODS 2016.
SIGACT News, 2016

Rudolf: Interactive Rule Refinement System for Fraud Detection.
Proc. VLDB Endow., 2016

Temporal Data Exchange.
CoRR, 2016

2015
A Time Machine for Information: Looking Back to Look Forward.
Proc. VLDB Endow., 2015

QOCO: A Query Oriented Data Cleaning System with Oracles.
Proc. VLDB Endow., 2015

Foreword: Special Issue on Database Theory.
Theory Comput. Syst., 2015

Linking Temporal Records for Profiling Entities.
Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31, 2015

Query-Oriented Data Cleaning with Oracles.
Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31, 2015

High-Level Why-Not Explanations using Ontologies.
Proceedings of the 34th ACM Symposium on Principles of Database Systems, 2015

2014
Federation in Cloud Data Management: Challenges and Opportunities.
IEEE Trans. Knowl. Data Eng., 2014

Preference-aware Integration of Temporal Data.
Proc. VLDB Endow., 2014

A hybrid machine-crowdsourcing system for matching web tables.
Proceedings of the IEEE 30th International Conference on Data Engineering, Chicago, 2014

2013
Efficient Querying of Inconsistent Databases with Binary Integer Programming.
Proc. VLDB Endow., 2013

Schema mappings and data examples.
Proceedings of the Joint 2013 EDBT/ICDT Conferences, 2013

Data Integration and Data Exchange: It's Really About Time.
Proceedings of the Sixth Biennial Conference on Innovative Data Systems Research, 2013

A New Framework for Designing Schema Mappings.
Proceedings of the In Search of Elegance in the Theory and Practice of Computation, 2013

2012
Splash: a platform for analysis and simulation of health.
Proceedings of the ACM International Health Informatics Symposium, 2012

Asking the Right Questions in Crowd Data Sourcing.
Proceedings of the IEEE 28th International Conference on Data Engineering (ICDE 2012), 2012

Splash: Simulation optimization in complex systems of systems.
Proceedings of the 50th Annual Allerton Conference on Communication, 2012

2011
Reverse data exchange: Coping with nulls.
ACM Trans. Database Syst., 2011

Characterizing schema mappings via data examples.
ACM Trans. Database Syst., 2011

Data is Dead... Without What-If Models.
Proc. VLDB Endow., 2011

EIRENE: Interactive Design and Refinement of Schema Mappings via Data Examples.
Proc. VLDB Endow., 2011

Information technology for healthcare transformation.
IBM J. Res. Dev., 2011

Letter from the Special Issue Editors.
IEEE Data Eng. Bull., 2011

Designing and refining schema mappings via data examples.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2011

On the tractability and intractability of consistent conjunctive query answering.
Proceedings of the 2011 Joint EDBT/ICDT Ph.D. Workshop, Uppsala, Sweden, March 25, 2011, 2011

Schema Mapping Evolution Through Composition and Inversion.
Proceedings of the Schema Matching and Mapping, 2011

2010
MapMerge: Correlating Independent Schema Mappings.
Proc. VLDB Endow., 2010

Letter from the Special Issue Editor.
IEEE Data Eng. Bull., 2010

Characterizing schema mappings via data examples.
Proceedings of the Twenty-Ninth ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, 2010

Database Constraints and Homomorphism Dualities.
Proceedings of the Principles and Practice of Constraint Programming - CP 2010, 2010

2009
Schema Mapping Composition.
Proceedings of the Encyclopedia of Database Systems, 2009

Provenance.
Proceedings of the Encyclopedia of Database Systems, 2009

Provenance in Scientific Databases.
Proceedings of the Encyclopedia of Database Systems, 2009

Artemis: A System for Analyzing Missing Answers.
Proc. VLDB Endow., 2009

Laconic Schema Mappings: Computing the Core with SQL Queries.
Proc. VLDB Endow., 2009

Provenance in Databases: Why, How, and Where.
Found. Trends Databases, 2009

Laconic schema mappings: computing core universal solutions by means of SQL queries
CoRR, 2009

2008
Quasi-inverses of schema mappings.
ACM Trans. Database Syst., 2008

Data exchange with data-metadata translations.
Proc. VLDB Endow., 2008

Comparing and evaluating mapping systems with STBenchmark.
Proc. VLDB Endow., 2008

STBenchmark: towards a benchmark for mapping systems.
Proc. VLDB Endow., 2008

Muse: a system for understanding and designing mappings.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2008

Curated databases.
Proceedings of the Twenty-Seventh ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, 2008

Muse: Mapping Understanding and deSign by Example.
Proceedings of the 24th International Conference on Data Engineering, 2008

2007
Provenance in Databases: Past, Current, and Future.
IEEE Data Eng. Bull., 2007

Provenance in databases.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2007

2006
Peer data exchange.
ACM Trans. Database Syst., 2006

Debugging Schema Mappings with Routes.
Proceedings of the 32nd International Conference on Very Large Data Bases, 2006

SPIDER: a Schema mapPIng DEbuggeR.
Proceedings of the 32nd International Conference on Very Large Data Bases, 2006

The complexity of data exchange.
Proceedings of the Twenty-Fifth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, 2006

2005
An annotation management system for relational databases.
VLDB J., 2005

Composing schema mappings: Second-order dependencies to the rescue.
ACM Trans. Database Syst., 2005

DBNotes: a post-it system for relational databases based on provenance.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2005

2004
Archiving scientific data.
ACM Trans. Database Syst., 2004

Research Problems in Data Provenance.
IEEE Data Eng. Bull., 2004

2003
Reasoning about keys for XML.
Inf. Syst., 2003

Containment of Relational Queries with Annotation Propagation.
Proceedings of the Database Programming Languages, 9th International Workshop, 2003

2002
SilkRoute: A framework for publishing relational data in XML.
ACM Trans. Database Syst., 2002

Keys for XML.
Comput. Networks, 2002

On Propagation of Deletions and Annotations Through Views.
Proceedings of the Twenty-first ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, 2002

2001
Publishing Relational Data in XML: the SilkRoute Approach.
IEEE Data Eng. Bull., 2001

On Computing Functions with Uncertainty.
Proceedings of the Twentieth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, 2001

Why and Where: A Characterization of Data Provenance.
Proceedings of the Database Theory, 2001

2000
SilkRoute: trading between relations and XML.
Comput. Networks, 2000

Towards a Query Language for Annotation Graphs.
Proceedings of the Second International Conference on Language Resources and Evaluation, 2000

Data Provenance: Some Basic Issues.
Proceedings of the Foundations of Software Technology and Theoretical Computer Science, 2000

1998
A Graphical Interface to Genome Multidatabases.
J. Database Manag., 1998

Beyond XML Query Languages.
Proceedings of the Query Languages Workshop, Boston, 1998

1996
QUICK: Graphical User Interface to Multiple Databases.
Proceedings of the Seventh International Workshop on Database and Expert Systems Applications, 1996


  Loading...