Haixun Wang

Orcid: 0000-0002-1378-4241

Affiliations:
  • Instacart
  • Google (former)
  • University of California, Los Angeles, USA (former)


According to our database1, Haixun Wang authored at least 250 papers between 1998 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Multilingual spatial domain natural language interface to databases.
GeoInformatica, January, 2024

2023
Rethinking E-Commerce Search.
SIGIR Forum, December, 2023

Will LLMs reshape, supercharge, or kill data science?
Proc. VLDB Endow., 2023

Mitigating Pooling Bias in E-commerce Search via False Negative Estimation.
CoRR, 2023

Dynamic Embedding-based Retrieval for Personalized Item Recommendations at Instacart.
Proceedings of the Companion Proceedings of the ACM Web Conference 2023, 2023

Graph-Structured Gaussian Processes for Transferable Graph Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Modeling Sequential Collaborative User Behaviors For Seller-Aware Next Basket Recommendation.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

2022
An Embedding-Based Grocery Search Model at Instacart.
CoRR, 2022

Rethink e-Commerce Search.
Proceedings of the WSDM '22: The Fifteenth ACM International Conference on Web Search and Data Mining, Virtual Event / Tempe, AZ, USA, February 21, 2022

Extraction of Reliable and Actionable Information from Social Media During Emergencies.
Proceedings of the IEEE Global Humanitarian Technology Conference, 2022

Adversarial Robustness through Bias Variance Decomposition: A New Perspective for Federated Learning.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

A Short Survey on the User Cold Start Problem in Recommender Systems: Metadata and Meta-Learning Methods.
Proceedings of the IEEE International Conference on Big Data, 2022

2021
From Intrinsic to Counterfactual: On the Explainability of Contextualized Recommender Systems.
CoRR, 2021

Tensor-based Complementary Product Recommendation.
Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), 2021

Ultra-Fine Entity Typing with Weak Supervision from a Masked Language Model.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Letter from the Editor-in-Chief.
IEEE Data Eng. Bull., 2020

A Natural Language Interface for Database: Achieving Transfer-learnability Using Adversarial Method for Question Understanding.
Proceedings of the 36th IEEE International Conference on Data Engineering, 2020

2019
SpatialNLI: A Spatial Domain Natural Language Interface to Databases Using Spatial Comprehension.
Proceedings of the 27th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, 2019

2018
Diagnosing and Minimizing Semantic Drift in Iterative Bootstrapping Extraction.
IEEE Trans. Knowl. Data Eng., 2018

Answering Natural Language Questions by Subgraph Matching over Knowledge Graphs.
IEEE Trans. Knowl. Data Eng., 2018

Employing Semantic Context for Sparse Information Extraction Assessment.
ACM Trans. Knowl. Discov. Data, 2018

A Transfer-Learnable Natural Language Interface for Databases.
CoRR, 2018

Answering Natural Language Questions by Subgraph Matching over Knowledge Graphs (Extended Abstract).
Proceedings of the 34th IEEE International Conference on Data Engineering, 2018

2017
Semantic Bootstrapping: A Theoretical Perspective.
IEEE Trans. Knowl. Data Eng., 2017

Probase+: Inferring Missing Links in Conceptual Taxonomies.
IEEE Trans. Knowl. Data Eng., 2017

Understand Short Texts by Harvesting and Analyzing Semantic Knowledge.
IEEE Trans. Knowl. Data Eng., 2017

Scaling Up Markov Logic Probabilistic Inference for Social Graphs.
IEEE Trans. Knowl. Data Eng., 2017

KBQA: Learning Question Answering over QA Corpora and Knowledge Bases.
Proc. VLDB Endow., 2017

Letter from the Special Issue Editor.
IEEE Data Eng. Bull., 2017

Trinity Graph Engine and its Applications.
IEEE Data Eng. Bull., 2017

Entity Suggestion with Conceptual Expanation.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Automatic Navbox Generation by Interpretable Clustering over Linked Entities.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

On the Transitivity of Hypernym-Hyponym Relations in Data-Driven Lexical Taxonomies.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Graph-Based Wrong IsA Relation Detection in a Large-Scale Lexical Taxonomy.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Understanding Short Texts through Semantic Enrichment and Hashing.
IEEE Trans. Knowl. Data Eng., 2016

Proxies for Shortest Path and Distance Queries.
IEEE Trans. Knowl. Data Eng., 2016

Unsupervised Head-Modifier Detection in Search Queries.
ACM Trans. Knowl. Discov. Data, 2016

G-SQL: Fast Query Processing via Graph Exploration.
Proc. VLDB Endow., 2016

Learning Defining Features for Categories.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Syntactic Parsing of Web Queries.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Fine-Grained Semantic Conceptualization of FrameNet.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

Verb Pattern: A Probabilistic Semantic Representation on Verbs.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

Extending Relational Query Languages for Data Streams.
Proceedings of the Data Stream Management - Processing High-Speed Data Streams, 2016

2015
G-Index Model: A generic model of index schemes for top-k spatial-keyword queries.
World Wide Web, 2015

Graph similarity search on large uncertain graph databases.
VLDB J., 2015

Automatic Taxonomy Construction from Keywords via Scalable Bayesian Rose Trees.
IEEE Trans. Knowl. Data Eng., 2015

A Large Probabilistic Semantic Network Based Approach to Compute Term Similarity.
IEEE Trans. Knowl. Data Eng., 2015

Guest Editorial: Special Issue on Managing and Mining Massive Graphs.
Distributed Parallel Databases, 2015

Entity Suggestion by Example using a Conceptual Taxonomy.
CoRR, 2015

Learning Knowledge Bases for Multimedia in 2015.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Learning Term Embeddings for Hypernymy Identification.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Query Understanding through Knowledge-Based Conceptualization.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

On Conceptual Labeling of a Bag of Words.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Open Domain Short Text Conceptualization: A Generative + Descriptive Modeling Approach.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Short text understanding through lexical-semantic analysis.
Proceedings of the 31st IEEE International Conference on Data Engineering, 2015

Inferencing in information extraction: Techniques and applications.
Proceedings of the 31st IEEE International Conference on Data Engineering, 2015

An Inference Approach to Basic Level of Categorization.
Proceedings of the 24th ACM International Conference on Information and Knowledge Management, 2015

2014
Efficient processing of k-hop reachability queries.
VLDB J., 2014

A Generic Framework for Top-<i>k</i> Pairs and Top-<i>k</i> Objects Queries over Sliding Windows.
IEEE Trans. Knowl. Data Eng., 2014

A Unified Framework for Answering k Closest Pairs Queries and Variants.
IEEE Trans. Knowl. Data Eng., 2014

Guest Editorial: Big Social Data Analysis.
Knowl. Based Syst., 2014

Semantic Multidimensional Scaling for Open-Domain Sentiment Analysis.
IEEE Intell. Syst., 2014

The Links Have It: Infobox Generation by Summarization over Linked Entities.
CoRR, 2014

Distance Landmarks Revisited for Road Graphs.
CoRR, 2014

Natural language question answering over RDF: a graph data driven approach.
Proceedings of the International Conference on Management of Data, 2014

Local search of communities in large graphs.
Proceedings of the International Conference on Management of Data, 2014

Learning Knowledge Bases for Text and Multimedia.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

How to partition a billion-node graph.
Proceedings of the IEEE 30th International Conference on Data Engineering, Chicago, 2014

Head, modifier, and constraint detection in short texts.
Proceedings of the IEEE 30th International Conference on Data Engineering, Chicago, 2014

Overcoming Semantic Drift in Information Extraction.
Proceedings of the 17th International Conference on Extending Database Technology, 2014

WiiCluster: a Platform for Wikipedia Infobox Generation.
Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014

Transfer Understanding from Head Queries to Tail Queries.
Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014

2013
RASIM: a rank-aware separate index method for answering top-k spatial keyword queries.
World Wide Web, 2013

Asymmetric signature schemes for efficient exact edit similarity query processing.
ACM Trans. Database Syst., 2013

Efficient Keyword Search on Uncertain Graph Data.
IEEE Trans. Knowl. Data Eng., 2013

A Bayesian Inference-Based Framework for RFID Data Cleansing.
IEEE Trans. Knowl. Data Eng., 2013

Data-Driven Metaphor Recognition and Explanation.
Trans. Assoc. Comput. Linguistics, 2013

A Distributed Graph Engine for Web Scale RDF Data.
Proc. VLDB Endow., 2013

Toward a Distance Oracle for Billion-Node Graphs.
Proc. VLDB Endow., 2013

Preface.
J. Comput. Sci. Technol., 2013

A query integrity assurance scheme for accessing outsourced spatial databases.
GeoInformatica, 2013

Statistical Approaches to Concept-Level Sentiment Analysis.
IEEE Intell. Syst., 2013

Knowledge-Based Approaches to Concept-Level Sentiment Analysis.
IEEE Intell. Syst., 2013

Hub-Accelerator: Fast and Exact Shortest Path Computation in Large Social Networks
CoRR, 2013

Identifying users' topical tasks in web search.
Proceedings of the Sixth ACM International Conference on Web Search and Data Mining, 2013

Trinity: a distributed graph engine on a memory cloud.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2013

An online cost sensitive decision-making method in crowdsourcing systems.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2013

Online search of overlapping communities.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2013

Query Suggestion by Concept Instantiation.
Proceedings of the ISWC 2013 Posters & Demonstrations Track, 2013

Context-Dependent Conceptualization.
Proceedings of the IJCAI 2013, 2013

On Anomalous Hotspot Discovery in Graph Streams.
Proceedings of the 2013 IEEE 13th International Conference on Data Mining, 2013

Automatic extraction of top-k lists from the web.
Proceedings of the 29th IEEE International Conference on Data Engineering, 2013

Attribute extraction and scoring: A probabilistic approach.
Proceedings of the 29th IEEE International Conference on Data Engineering, 2013

LinkProbe: Probabilistic inference on large-scale social networks.
Proceedings of the 29th IEEE International Conference on Data Engineering, 2013

Shallow Information Extraction for the knowledge Web.
Proceedings of the 29th IEEE International Conference on Data Engineering, 2013

Semantic queries by example.
Proceedings of the Joint 2013 EDBT/ICDT Conferences, 2013

Computing term similarity by large probabilistic isA knowledge.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

Assessing sparse information extraction using semantic contexts.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

Wikification via link co-occurrence.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

Understanding Short Texts.
Proceedings of the Web Technologies and Applications - 15th Asia-Pacific Web Conference, 2013

2012
Efficient Computation of Range Aggregates against Uncertain Location-Based Queries.
IEEE Trans. Knowl. Data Eng., 2012

Efficient Subgraph Similarity Search on Large Probabilistic Graph Databases.
Proc. VLDB Endow., 2012

Efficient Subgraph Matching on Billion Node Graphs.
Proc. VLDB Endow., 2012

K-Reach: Who is in Your Small World.
Proc. VLDB Endow., 2012

Beyond ten blue links: enabling user click modeling in federated web search.
Proceedings of the Fifth International Conference on Web Search and Web Data Mining, 2012

Probase: a probabilistic taxonomy for text understanding.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2012

Managing and mining large graphs: systems and implementations.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2012

Optimizing index for taxonomy keyword search.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2012

A system for extracting top-K lists from the web.
Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2012

Automatic taxonomy construction from keywords.
Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2012

Efficiently Monitoring Top-k Pairs over Sliding Windows.
Proceedings of the IEEE 28th International Conference on Data Engineering (ICDE 2012), 2012

Understanding Tables on the Web.
Proceedings of the Conceptual Modeling, 2012

Concept-Based Web Search.
Proceedings of the Conceptual Modeling, 2012

2011
Relational languages and data models for continuous queries on sequences and data streams.
ACM Trans. Database Syst., 2011

Path-tree: An efficient reachability indexing scheme for large directed graphs.
ACM Trans. Database Syst., 2011

A conversation with MSRA researchers.
SIGKDD Explor., 2011

Efficient Subgraph Search over Large Uncertain Graphs.
Proc. VLDB Endow., 2011

Web Scale Taxonomy Cleansing.
Proc. VLDB Endow., 2011

Distance-Constraint Reachability Computation in Uncertain Graphs.
Proc. VLDB Endow., 2011

Querying uncertain data with aggregate constraints.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2011

Finding semantics in time series.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2011

Short Text Conceptualization Using a Probabilistic Knowledgebase.
Proceedings of the IJCAI 2011, 2011

Tracking and Connecting Topics via Incremental Hierarchical Dirichlet Processes.
Proceedings of the 11th IEEE International Conference on Data Mining, 2011

Isanette: A Common and Common Sense Knowledge Base for Opinion Mining.
Proceedings of the Data Mining Workshops (ICDMW), 2011

A unified approach for computing top-k pairs in multidimensional space.
Proceedings of the 27th International Conference on Data Engineering, 2011

On dimensionality reduction of massive graphs for indexing and retrieval.
Proceedings of the 27th International Conference on Data Engineering, 2011

Link-based hidden attribute discovery for objects on Web.
Proceedings of the EDBT 2011, 2011

Challenges in Managing and Mining Large, Heterogeneous Data.
Proceedings of the Database Systems for Advanced Applications, 2011

Finding information nebula over large networks.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

Text Mining in Social Networks.
Proceedings of the Social Network Data Analytics, 2011

2010
A Survey of Algorithms for Keyword Search on Graph Data.
Proceedings of the Managing and Mining Graph Data, 2010

A Survey of Clustering Algorithms for Graph Data.
Proceedings of the Managing and Mining Graph Data, 2010

Graph Data Management and Mining: A Survey of Algorithms and Applications.
Proceedings of the Managing and Mining Graph Data, 2010

An Introduction to Graph Data.
Proceedings of the Managing and Mining Graph Data, 2010

Mining Concept-Drifting Data Streams.
Proceedings of the Data Mining and Knowledge Discovery Handbook, 2nd ed., 2010

An Integrated Data-Driven Framework for Computing System Management.
IEEE Trans. Syst. Man Cybern. Part A, 2010

Report on the first international workshop on cloud data management (CloudDB 2009).
SIGMOD Rec., 2010

Optimizing content freshness of relations extracted from the web using keyword search.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2010

MapDupReducer: detecting near duplicates over massive datasets.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2010

An algorithmic approach to event summarization.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2010

Computing label-constraint reachability in graph databases.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2010

Leveraging spatio-temporal redundancy for RFID data cleansing.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2010

Incorporating post-click behaviors into a click model.
Proceedings of the Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010

Adaptive system anomaly prediction for large-scale hosting infrastructures.
Proceedings of the 29th Annual ACM Symposium on Principles of Distributed Computing, 2010

Cleansing uncertain databases leveraging aggregate constraints.
Proceedings of the Workshops Proceedings of the 26th International Conference on Data Engineering, 2010

2009
Query Integrity Assurance of Location-Based Services Accessing Outsourced Spatial Databases.
Proceedings of the Advances in Spatial and Temporal Databases, 2009

Inverse Time Dependency in Convex Regularized Learning.
Proceedings of the ICDM 2009, 2009

Weighted Proximity Best-Joins for Information Retrieval.
Proceedings of the 25th International Conference on Data Engineering, 2009

Online Anomaly Prediction for Robust Cluster Systems.
Proceedings of the 25th International Conference on Data Engineering, 2009

Concept Clustering of Evolving Data.
Proceedings of the 25th International Conference on Data Engineering, 2009

CloudDB workshop summary.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

Semantic queries in databases: problems and challenges.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

Learning to rank with a novel kernel perceptron method.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

2008
Location-Based Spatial Query Processing in Wireless Broadcast Environments.
IEEE Trans. Mob. Comput., 2008

Clustering by Pattern Similarity.
J. Comput. Sci. Technol., 2008

Time-Stamp Management and Query Execution in Data Stream Management Systems.
IEEE Internet Comput., 2008

Lock-free consistency control for web 2.0 applications.
Proceedings of the 17th International Conference on World Wide Web, 2008

Efficiently answering reachability queries on very large directed graphs.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2008

A Sampling-Based Approach to Information Recovery.
Proceedings of the 24th International Conference on Data Engineering, 2008

Fast Graph Pattern Matching.
Proceedings of the 24th International Conference on Data Engineering, 2008

Stop Chasing Trends: Discovering High Order Models in Evolving Data.
Proceedings of the 24th International Conference on Data Engineering, 2008

Modeling and Querying E-Commerce Data in Hybrid Relational-XML DBMSs.
Proceedings of the Conceptual Modeling, 2008

Providing freshness guarantees for outsourced databases.
Proceedings of the EDBT 2008, 2008

Fast computing reachability labelings for large graphs with high compression rate.
Proceedings of the EDBT 2008, 2008

Dual encryption for query integrity assurance.
Proceedings of the 17th ACM Conference on Information and Knowledge Management, 2008

2007
A Low-Granularity Classifier for Data Streams with Concept Drifts and Biased Class Distribution.
IEEE Trans. Knowl. Data Eng., 2007

Integrity Auditing of Outsourced Data.
Proceedings of the 33rd International Conference on Very Large Data Bases, 2007

Challenges and Experience in Prototyping a Multi-Modal Stream Analytic and Monitoring Application on System S.
Proceedings of the 33rd International Conference on Very Large Data Bases, 2007

Unifying Data and Domain Knowledge Using Virtual Views.
Proceedings of the 33rd International Conference on Very Large Data Bases, 2007

Supporting ranking and clustering as generalized order-by and group-by.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2007

BLINKS: ranked keyword searches on graphs.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2007

Load Shedding in Classifying Multi-Source Streaming Data: A Bayes Risk Approach.
Proceedings of the Seventh SIAM International Conference on Data Mining, 2007

Event summarization for system management.
Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2007

Computing Compressed Multidimensional Skyline Cubes Efficiently.
Proceedings of the 23rd International Conference on Data Engineering, 2007

Semantic Data Management: Towards Querying Data with their Meaning.
Proceedings of the 23rd International Conference on Data Engineering, 2007

Location-based Spatial Queries with Data Sharing in Wireless Broadcast Environments.
Proceedings of the 23rd International Conference on Data Engineering, 2007

GString: A Novel Approach for Efficient Search in Graph Databases.
Proceedings of the 23rd International Conference on Data Engineering, 2007

Adaptive Load Diffusion for Multiway Windowed Stream Joins.
Proceedings of the 23rd International Conference on Data Engineering, 2007

A Flexible Query Graph Based Model for the Efficient Execution of Continuous Queries.
Proceedings of the 23rd International Conference on Data Engineering Workshops, 2007

Optimizing Timestamp Management in Data Stream Management Systems.
Proceedings of the 23rd International Conference on Data Engineering, 2007

2006
Discovering Frequent Closed Partial Orders from Strings.
IEEE Trans. Knowl. Data Eng., 2006

Catch the moment: maintaining closed frequent itemsets over a data stream sliding window.
Knowl. Inf. Syst., 2006

Finding global icebergs over distributed data sets.
Proceedings of the Twenty-Fifth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, 2006

Suppressing model overfitting in mining concept-drifting data streams.
Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2006

LOCI: Load Shedding through Class-Preserving Data Acquisition.
Proceedings of the 6th IEEE International Conference on Data Mining (ICDM 2006), 2006

Fast Relevance Discovery in Time Series.
Proceedings of the 6th IEEE International Conference on Data Mining (ICDM 2006), 2006

A Balanced Ensemble Approach to Weighting Classifiers for Text Classification.
Proceedings of the 6th IEEE International Conference on Data Mining (ICDM 2006), 2006

Dual Labeling: Answering Graph Reachability Queries in Constant Time.
Proceedings of the 22nd International Conference on Data Engineering, 2006

Fast Computation of Reachability Labeling for Large Graphs.
Proceedings of the Advances in Database Technology, 2006

A data stream language and system designed for power and extensibility.
Proceedings of the 2006 ACM CIKM International Conference on Information and Knowledge Management, 2006

2005
Demand-driven frequent itemset mining using pattern structures.
Knowl. Inf. Syst., 2005

Preference-Based Frequent Pattern Mining.
Int. J. Data Warehous. Min., 2005

An Improved Biclustering Method for Analyzing Gene Expression Profiles.
Int. J. Artif. Intell. Tools, 2005

Loadstar: Load Shedding in Data Stream Mining.
Proceedings of the 31st International Conference on Very Large Data Bases, Trondheim, Norway, August 30, 2005

A native extension of SQL for mining data streams.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2005

Near-Neighbor Search in Pattern Distance Spaces.
Proceedings of the 2005 SIAM International Conference on Data Mining, 2005

Loadstar: A Load Shedding Scheme for Classifying Data Streams.
Proceedings of the 2005 SIAM International Conference on Data Mining, 2005

A Random Method for Quantifying Changing Distributions in Data Streams.
Proceedings of the Knowledge Discovery in Databases: PKDD 2005, 2005

Pattern-based similarity search for microarray data.
Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2005

On Reducing Classifier Granularity in Mining Concept-Drifting Data Streams.
Proceedings of the 5th IEEE International Conference on Data Mining (ICDM 2005), 2005

Efficiently Mining Frequent Closed Partial Orders.
Proceedings of the 5th IEEE International Conference on Data Mining (ICDM 2005), 2005

Online Mining of Data Streams: Applications, Techniques and Progress.
Proceedings of the 21st International Conference on Data Engineering, 2005

On the Sequencing of Tree Structures for XML Indexing.
Proceedings of the 21st International Conference on Data Engineering, 2005

Stay Current and Relevant in Data Mining Research.
Proceedings of the Database Systems for Advanced Applications, 2005

Compact reachability labeling for graph-structured data.
Proceedings of the 2005 ACM CIKM International Conference on Information and Knowledge Management, Bremen, Germany, October 31, 2005

Mining Data Streams.
Proceedings of the Data Mining and Knowledge Discovery Handbook., 2005

2004
Estimating the Selectivity of XML Path Expression with Predicates by Histograms.
Proceedings of the Advances in Web-Age Information Management: 5th International Conference, 2004

Query Languages and Data Models for Database Sequences and Data Streams.
Proceedings of the (e)Proceedings of the Thirtieth International Conference on Very Large Data Bases, VLDB 2004, Toronto, Canada, August 31, 2004

A Fast Algorithm for Subspace Clustering by Pattern Similarity.
Proceedings of the 16th International Conference on Scientific and Statistical Database Management (SSDBM 2004), 2004

XSeq: An Index Infrastructure for Tree Pattern Queries.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2004

Active Mining of Data Streams.
Proceedings of the Fourth SIAM International Conference on Data Mining, 2004

Moment: Maintaining Closed Frequent Itemsets over a Stream Sliding Window.
Proceedings of the 4th IEEE International Conference on Data Mining (ICDM 2004), 2004

Mining Extremely Skewed Trading Anomalies.
Proceedings of the Advances in Database Technology, 2004

2003
The Deductive Database System LDL++.
Theory Pract. Log. Program., 2003

Recent Progress on Selected Topics in Database Research - A Report by Nine Young Chinese Researchers Working in the United States.
J. Comput. Sci. Technol., 2003

ATLAS: A Small but Complete SQL Extension for Data Mining and Data Streams.
Proceedings of 29th International Conference on Very Large Data Bases, 2003

ViST: A Dynamic Index Method for Querying XML Data by Tree Structures.
Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, 2003

Incompleteness of Database Languages for Data Streams and Data Mining: the Problem and the Cure.
Proceedings of the Eleventh Italian Symposium on Advanced Database Systems, 2003

ATLaS: A Native Extension of SQL for Data Mining.
Proceedings of the Third SIAM International Conference on Data Mining, 2003

Mining concept-drifting data streams using ensemble classifiers.
Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 24, 2003

Inductive Learning in Less Than One Sequential Data Scan.
Proceedings of the IJCAI-03, 2003

MaPle: A Fast Algorithm for Maximal Pattern-based Clustering.
Proceedings of the 3rd IEEE International Conference on Data Mining (ICDM 2003), 2003

Is random model better? On its accuracy and efficiency.
Proceedings of the 3rd IEEE International Conference on Data Mining (ICDM 2003), 2003

Indexing Weighted-Sequences in Large Databases.
Proceedings of the 19th International Conference on Data Engineering, 2003

Enhanced Biclustering on Expression Data.
Proceedings of the 3rd IEEE International Symposium on BioInformatics and BioEngineering (BIBE 2003), 2003

2002
Discovery in multi-attribute data with user-defined constraints.
SIGKDD Explor., 2002

Clustering by pattern similarity in large data sets.
Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, 2002

A Framework for Scalable Cost-sensitive Learning Based on Combing Probabilities and Benefits.
Proceedings of the Second SIAM International Conference on Data Mining, 2002

Mining Associations by Pattern Structure in Large Relational Tables.
Proceedings of the 2002 IEEE International Conference on Data Mining (ICDM 2002), 2002

User-directed Exploration of Mining Space with Multiple Attributes.
Proceedings of the 2002 IEEE International Conference on Data Mining (ICDM 2002), 2002

Progressive Modeling.
Proceedings of the 2002 IEEE International Conference on Data Mining (ICDM 2002), 2002

Empirical Comparison of Various Reinforcement Learning Strategies for Sequential Targeted Marketing.
Proceedings of the 2002 IEEE International Conference on Data Mining (ICDM 2002), 2002

delta-Clusters: Capturing Subspace Correlation in a Large Data Set.
Proceedings of the 18th International Conference on Data Engineering, San Jose, CA, USA, February 26, 2002

The ATLaS System and Its Powerful Database Language Based on Simple Extensions of SQL.
Proceedings of the 18th International Conference on Data Engineering, San Jose, CA, USA, February 26, 2002

A Fully Distributed Framework for Cost-Sensitive Data Mining.
Proceedings of the 22nd International Conference on Distributed Computing Systems (ICDCS'02), 2002

Extending SQL for Decision Support Applications.
Proceedings of the Design and Management of Data Warehouses 2002, 2002

An Index Structure for Pattern Similarity Searching in DNA Microarray Dat.
Proceedings of the 1st IEEE Computer Society Bioinformatics Conference, 2002

Pruning and Dynamic Scheduling of Cost-Sensitive Ensembles.
Proceedings of the Eighteenth National Conference on Artificial Intelligence and Fourteenth Conference on Innovative Applications of Artificial Intelligence, July 28, 2002

2001
The S<sup>2</sup>-Tree : An Index Structure for Subsequence Matching of Spatial Objects.
Proceedings of the Knowledge Discovery and Data Mining, 2001

SSDT: A Scalable Subspace-Splitting Classifier for Biased Data.
Proceedings of the 2001 IEEE International Conference on Data Mining, 29 November, 2001

FARM: A Framework for Exploring Mining Spaces with Multiple Attributes.
Proceedings of the 2001 IEEE International Conference on Data Mining, 29 November, 2001

2000
Using SQL to Build New Aggregates and Extenders for Object- Relational Systems.
Proceedings of the VLDB 2000, 2000

CMP: A Fast Decision Tree Classifier Using Multivariate Predictions.
Proceedings of the 16th International Conference on Data Engineering, San Diego, California, USA, February 28, 2000

User Defined Aggregates in Object-Relational Systems.
Proceedings of the 16th International Conference on Data Engineering, San Diego, California, USA, February 28, 2000

Landmarks: a New Model for Similarity-based Pattern Querying in Time Series Databases.
Proceedings of the 16th International Conference on Data Engineering, San Diego, California, USA, February 28, 2000

Database System Extensions for Decision Support: the AXL Approach.
Proceedings of the 2000 ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, 2000

1999
User-Defined Aggregates for Datamining.
Proceedings of the 1999 ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, 1999

User-Defined Aggregates in Database Languages.
Proceedings of the Research Issues in Structured and Semistructured Database Programming, 1999

Logic-Based User-Defined Aggregates for the Next Generation of Database Systems.
Proceedings of the Logic Programming Paradigm - A 25-Year Perspective, 1999

1998
User Defined Aggregates for Logical Data Languages.
Proceedings of the 6th International Workshop on Deductive Databases and Logic Programming (DDLP'98). In Conjunction with JICSLP'98, 1998


  Loading...