Jian Pei

According to our database1, Jian Pei authored at least 334 papers between 2000 and 2019.

Collaborative distances:

Awards

ACM Fellow

ACM Fellow 2015, "For contributions to the foundation, methodology and applications of data mining.".

IEEE Fellow

IEEE Fellow 2014, "For contributions to data mining and knowledge discovery".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Homepages:

On csauthors.net:

Bibliography

2019
A Survey on Network Embedding.
IEEE Trans. Knowl. Data Eng., 2019

2018
Association Rules.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

High-Order Proximity Preserved Embedding for Dynamic Networks.
IEEE Trans. Knowl. Data Eng., 2018

Cleaning Crowdsourced Labels Using Oracles For Statistical Classification.
PVLDB, 2018

Front Matter.
PVLDB, 2018

Front Matter.
PVLDB, 2018

Front Matter.
PVLDB, 2018

Front Matter.
PVLDB, 2018

Front Matter.
PVLDB, 2018

Front Matter.
PVLDB, 2018

Front Matter.
PVLDB, 2018

Front Matter.
PVLDB, 2018

Front Matter.
PVLDB, 2018

Subspace multi-clustering: a review.
Knowl. Inf. Syst., 2018

Online Compact Convexified Factorization Machine.
Proceedings of the 2018 World Wide Web Conference on World Wide Web, 2018

AQP++: Connecting Approximate Query Processing With Aggregate Precomputation for Interactive Analytics.
Proceedings of the 2018 International Conference on Management of Data, 2018

Arbitrary-Order Proximity Preserved Network Embedding.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

Sketched Follow-The-Regularized-Leader for Online Factorization Machine.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

Exact and Consistent Interpretation for Piecewise Linear Neural Networks: A Closed Form Solution.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

Finding Maximal Significant Linear Representation between Long Time Series.
Proceedings of the IEEE International Conference on Data Mining, 2018

Mining Density Contrast Subgraphs.
Proceedings of the 34th IEEE International Conference on Data Engineering, 2018

Skyline Diagram: Finding the Voronoi Counterpart for Skyline Queries.
Proceedings of the 34th IEEE International Conference on Data Engineering, 2018

TIMERS: Error-Bounded SVD Restart on Dynamic Networks.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Tracking Influential Individuals in Dynamic Networks.
IEEE Trans. Knowl. Data Eng., 2017

Activity Maximization by Effective Information Diffusion in Social Networks.
IEEE Trans. Knowl. Data Eng., 2017

Editorial.
IEEE Trans. Knowl. Data Eng., 2017

Efficient Mining of Regional Movement Patterns in Semantic Trajectories.
PVLDB, 2017

Front Matter.
PVLDB, 2017

Front Matter.
PVLDB, 2017

Front Matter.
PVLDB, 2017

Front Matter.
PVLDB, 2017

Measuring in-network node similarity based on neighborhoods: a unified parametric approach.
Knowl. Inf. Syst., 2017

Multidimensional Business Benchmarking Analysis on Data Warehouses.
IJDWM, 2017

JASIST special issue on biomedical information retrieval.
JASIST, 2017

Multidimensional benchmarking in data warehouses.
Intell. Data Anal., 2017

Preference-driven similarity join.
Proceedings of the International Conference on Web Intelligence, 2017

Schemaless Join for Result Set Preferences.
Proceedings of the 2017 IEEE International Conference on Information Reuse and Integration, 2017

Secure Skyline Queries on Cloud Platform.
Proceedings of the 33rd IEEE International Conference on Data Engineering, 2017

Principal Patern Mining on Graphs.
Proceedings of the 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2017, Sydney, Australia, July 31, 2017

Community Preserving Network Embedding.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Online Visual Analytics of Text Streams.
IEEE Trans. Vis. Comput. Graph., 2016

EIC Editorial.
IEEE Trans. Knowl. Data Eng., 2016

State of the Journal Editorial.
IEEE Trans. Knowl. Data Eng., 2016

Scalable and Accurate Online Feature Selection for Big Data.
TKDD, 2016

Continuous similarity search for evolving queries.
Knowl. Inf. Syst., 2016

Efficient discovery of contrast subspaces for object explanation and characterization.
Knowl. Inf. Syst., 2016

Preface.
J. Comput. Sci. Technol., 2016

Discovering outlying aspects in large datasets.
Data Min. Knowl. Discov., 2016

Using Computer Intelligence for Depression Diagnosis and Crowdsourcing.
IEEE Computer, 2016

Preface.
Big Data Research, 2016

Continuous Influence Maximization: What Discounts Should We Offer to Social Network Users?
Proceedings of the 2016 International Conference on Management of Data, 2016

Asymmetric Transitivity Preserving Graph Embedding.
Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016

When Social Influence Meets Item Inference.
Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016

Finding Gangs in War from Signed Networks.
Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016

Finding the minimum spatial keyword cover.
Proceedings of the 32nd IEEE International Conference on Data Engineering, 2016

Urban Traffic Prediction through the Second Use of Inexpensive Big Data from Buildings.
Proceedings of the 25th ACM International Conference on Information and Knowledge Management, 2016

Tradeoffs between density and size in extracting dense subgraphs: A unified framework.
Proceedings of the 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, 2016

2015
Editorial.
IEEE Trans. Knowl. Data Eng., 2015

State of the Journal Editorial.
IEEE Trans. Knowl. Data Eng., 2015

Classification with Streaming Features: An Emerging-Pattern Mining Approach.
TKDD, 2015

Proximity-Aware Local-Recoding Anonymization with MapReduce for Scalable Big Data Privacy Preservation in Cloud.
IEEE Trans. Computers, 2015

Finding Pareto Optimal Groups: Group-based Skyline.
PVLDB, 2015

ALID: Scalable Dominant Cluster Detection.
PVLDB, 2015

Preface.
J. Comput. Sci. Technol., 2015

Mining multidimensional contextual outliers from categorical relational data.
Intell. Data Anal., 2015

Mining outlying aspects on numeric data.
Data Min. Knowl. Discov., 2015

Scalable Outlying-Inlying Aspects Discovery via Feature Ranking.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2015

Reliable Early Classification on Multivariate Time Series with Numerical and Categorical Attributes.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2015

COSNET: Connecting Heterogeneous Social Networks with Local and Global Consistency.
Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015

Tornado Forecasting with Multiple Markov Boundaries.
Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015

Finding Multiple Stable Clusterings.
Proceedings of the 2015 IEEE International Conference on Data Mining, 2015

Cleaning structured event logs: A graph repair approach.
Proceedings of the 31st IEEE International Conference on Data Engineering, 2015

Mining Frequent Co-occurrence Patterns across Multiple Data Streams.
Proceedings of the 18th International Conference on Extending Database Technology, 2015

Efficiently Computing Top-K Shortest Path Join.
Proceedings of the 18th International Conference on Extending Database Technology, 2015

2014
Mining most frequently changing component in evolving graphs.
World Wide Web, 2014

Malicious URL detection by dynamically mining patterns without pre-defined elements.
World Wide Web, 2014

Consensus-Based Ranking of Multivalued Objects: A Generalized Borda Count Approach.
IEEE Trans. Knowl. Data Eng., 2014

EIC Editorial.
IEEE Trans. Knowl. Data Eng., 2014

Editorial [State of the Transactions].
IEEE Trans. Knowl. Data Eng., 2014

Email mining: tasks, common techniques, and tools.
Knowl. Inf. Syst., 2014

A spatiotemporal compression based approach for efficient big data processing on Cloud.
J. Comput. Syst. Sci., 2014

Managing Data-Intensive Applications in the Cloud.
IEEE Computer, 2014

Shortest Unique Queries on Strings.
Proceedings of the String Processing and Information Retrieval, 2014

Efficient Matching of Substrings in Uncertain Sequences.
Proceedings of the 2014 SIAM International Conference on Data Mining, 2014

How Can I Index My Thousands of Photos Effectively and Automatically? An Unsupervised Feature Selection Approach.
Proceedings of the 2014 SIAM International Conference on Data Mining, 2014

An Iterative Fusion Approach to Graph-Based Semi-Supervised Learning from Multiple Views.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2014

Mining Contrast Subspaces.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2014

Structure-Aware Distance Measures for Comparing Clusterings in Graphs.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2014

Distance metric learning using dropout: a structured regularization approach.
Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2014

Towards Scalable and Accurate Online Feature Selection for Big Data.
Proceedings of the 2014 IEEE International Conference on Data Mining, 2014

SNOC: Streaming Network Node Classification.
Proceedings of the 2014 IEEE International Conference on Data Mining, 2014

An Appliance-Driven Approach to Detection of Corrupted Load Curve Data.
Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014

Within-Network Classification Using Radius-Constrained Neighborhood Patterns.
Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014

Do neighbor buddies make a difference in reblog likelihood? An analysis on SINA Weibo data.
Proceedings of the 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, 2014

Pattern-Growth Methods.
Proceedings of the Frequent Pattern Mining, 2014

2013
Finding email correspondents in online social networks.
World Wide Web, 2013

A vlHMM approach to context-aware search.
TWEB, 2013

Editorial [2012 & 2013 Associate Editors].
IEEE Trans. Knowl. Data Eng., 2013

Clustering Uncertain Data Based on Probability Distribution Similarity.
IEEE Trans. Knowl. Data Eng., 2013

Introduction to the Special Issue ACM SIGKDD 2012.
TKDD, 2013

Mining search and browse logs for web search: A Survey.
ACM TIST, 2013

More is Simpler: Effectively and Efficiently Assessing Node-Pair Similarities Based on Hyperlinks.
PVLDB, 2013

A Data-adaptive and Dynamic Segmentation Index for Whole Matching on Time Series.
PVLDB, 2013

Skyline distance: a measure of multidimensional competence.
Knowl. Inf. Syst., 2013

Recommendations for two-way selections using skyline view queries.
Knowl. Inf. Syst., 2013

What distinguish one from its peers in social networks?
Data Min. Knowl. Discov., 2013

Mining multidimensional contextual outliers from categorical relational data.
Proceedings of the Conference on Scientific and Statistical Database Management, 2013

Some New Progress in Analyzing and Mining Uncertain and Probabilistic Data for Big Data Analytics.
Proceedings of the Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing, 2013

Parallel field alignment for cross media retrieval.
Proceedings of the ACM Multimedia Conference, 2013

Price Information Patterns in Web Search Advertising: An Empirical Case Study on Accommodation Industry.
Proceedings of the 2013 IEEE 13th International Conference on Data Mining, 2013

Mining Statistically Significant Sequential Patterns.
Proceedings of the 2013 IEEE 13th International Conference on Data Mining, 2013

Mining Probabilistic Frequent Spatio-Temporal Sequential Patterns with Gap Constraints from Uncertain Databases.
Proceedings of the 2013 IEEE 13th International Conference on Data Mining, 2013

On shortest unique substring queries.
Proceedings of the 29th IEEE International Conference on Data Engineering, 2013

Towards Cohesive Anomaly Mining.
Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013

2012
Mining Frequent Trajectory Patterns for Activity Monitoring Using Radio Frequency Tag Arrays.
IEEE Trans. Parallel Distrib. Syst., 2012

Aggregate keyword search on large relational databases.
Knowl. Inf. Syst., 2012

Early classification on time series.
Knowl. Inf. Syst., 2012

Probabilistic skylines on uncertain data: model and bounding-pruning-refining methods.
J. Intell. Inf. Syst., 2012

Efficient and Effective Aggregate Keyword Search on Relational Databases.
IJDWM, 2012

Clustering in applications with multiple data sources - A mutual subspace clustering approach.
Neurocomputing, 2012

Top-10 Data Mining Case Studies.
International Journal of Information Technology and Decision Making, 2012

Multi-level relationship outlier detection.
IJBIDM, 2012

Mining query subtopics from search log data.
Proceedings of the 35th International ACM SIGIR conference on research and development in Information Retrieval, 2012

A practical method for estimating performance degradation on multicore processors, and its application to HPC workloads.
Proceedings of the SC Conference on High Performance Computing Networking, 2012

Community Preserving Lossy Compression of Social Networks.
Proceedings of the 12th IEEE International Conference on Data Mining, 2012

Random Error Reduction in Similarity Search on Time Series: A Statistical Approach.
Proceedings of the IEEE 28th International Conference on Data Engineering (ICDE 2012), 2012

Aggregate queries on probabilistic record linkages.
Proceedings of the 15th International Conference on Extending Database Technology, 2012

On compressing weighted time-evolving graphs.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

2011
Ranking Queries on Uncertain Data
Advances in Database Systems 42, Kluwer, ISBN: 978-1-4419-9379-3, 2011

Ranking queries on uncertain data.
VLDB J., 2011

Can the Utility of Anonymized Data be Used for Privacy Breaches?
TKDD, 2011

Mining Concept Sequences from Large-Scale Search Logs for Context-Aware Query Suggestion.
ACM TIST, 2011

On Pruning for Top-K Ranking in Uncertain Databases.
PVLDB, 2011

Best papers from the Fifth International Conference on Advanced Data Mining and Applications (ADMA 2009).
Knowl. Inf. Syst., 2011

The k-anonymity and l-diversity approaches for privacy preservation in social networks against neighborhood attacks.
Knowl. Inf. Syst., 2011

Ranking uncertain sky: The probabilistic top-k skyline operator.
Inf. Syst., 2011

Publishing anonymous survey rating data.
Data Min. Knowl. Discov., 2011

Multidimensional mining of large-scale search logs: a topic-concept cube approach.
Proceedings of the Forth International Conference on Web Search and Web Data Mining, 2011

Citation recommendation without author supervision.
Proceedings of the Forth International Conference on Web Search and Web Data Mining, 2011

On k-skip shortest paths.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2011

Privacy-aware data management in information networks.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2011

Enhancing web search by mining search and browse logs.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

Extracting Interpretable Features for Early Classification on Time Series.
Proceedings of the Eleventh SIAM International Conference on Data Mining, 2011

Towards bounding sequential patterns.
Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2011

Outlier detection on uncertain data: Objects, instances, and inferences.
Proceedings of the 27th International Conference on Data Engineering, 2011

Early Classification on Temporal Sequences.
Proceedings of the Extraction et gestion des connaissances (EGC'2011), 2011

Data Mining: Concepts and Techniques, 3rd edition
Morgan Kaufmann, ISBN: 978-0123814791, 2011

2010
Mining discriminative items in multiple data streams.
World Wide Web, 2010

Threshold-based probabilistic top-k dominating queries.
VLDB J., 2010

Superseding Nearest Neighbor Search on Uncertain Spatial Databases.
IEEE Trans. Knowl. Data Eng., 2010

Probabilistic Reverse Nearest Neighbor Queries on Uncertain Data.
IEEE Trans. Knowl. Data Eng., 2010

A brief survey on sequence classification.
SIGKDD Explorations, 2010

Special issue on the best papers of SDM'10.
Statistical Analysis and Data Mining, 2010

Computing Closed Skycubes.
PVLDB, 2010

A binary decision diagram based approach for mining frequent subsequences.
Knowl. Inf. Syst., 2010

Exploring Disease Association from the NHANES Data: Data Mining, Pattern Summarization, and Visual Analytics.
IJDWM, 2010

Towards Progressive and Load Balancing Distributed Computation: A Case Study on Skyline Analysis.
J. Comput. Sci. Technol., 2010

Document clustering of scientific texts using citation contexts.
Inf. Retr., 2010

Web search/browse log mining: challenges, methods, and applications.
Proceedings of the 19th International Conference on World Wide Web, 2010

Context-aware citation recommendation.
Proceedings of the 19th International Conference on World Wide Web, 2010

Logging every footstep: quantile summaries for the entire history.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2010

Context-aware ranking in web search.
Proceedings of the Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010

Search and browse log mining for web information retrieval: challenges, methods, and applications.
Proceedings of the Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010

Neighbor query friendly compression of social networks.
Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2010

Probabilistic Inference Protection on Anonymized Data.
Proceedings of the ICDM 2010, 2010

Correlation hiding by independence masking.
Proceedings of the 26th International Conference on Data Engineering, 2010

Probabilistic path queries in road networks: traffic uncertainty aware path selection.
Proceedings of the EDBT 2010, 2010

2009
Association Rules.
Proceedings of the Encyclopedia of Database Systems, 2009

Top-k typicality queries and efficient query answering methods on large databases.
VLDB J., 2009

Anonymization-based attacks in privacy-preserving data publishing.
ACM Trans. Database Syst., 2009

Online Skyline Analysis with Dynamic Preferences on Nominal Attributes.
IEEE Trans. Knowl. Data Eng., 2009

Continuous K-Means Monitoring with Low Reporting Cost in Sensor Networks.
IEEE Trans. Knowl. Data Eng., 2009

Link spam target detection using page farms.
TKDD, 2009

Mining frequent cross-graph quasi-cliques.
TKDD, 2009

Summary of the first ACM SIGKDD workshop on knowledge discovery from uncertain data (U'09).
SIGKDD Explorations, 2009

PADS: a simple yet effective pattern-aware dynamic search method for fast maximal frequent pattern mining.
Knowl. Inf. Syst., 2009

Continuously monitoring top-k uncertain data streams: a probabilistic threshold method.
Distributed and Parallel Databases, 2009

OrthoClusterDB: an online platform for synteny blocks.
BMC Bioinformatics, 2009

News article extraction with template-independent wrapper.
Proceedings of the 18th International Conference on World Wide Web, 2009

Towards context-aware search by learning a very large variable length hidden markov model from search logs.
Proceedings of the 18th International Conference on World Wide Web, 2009

MobileMiner: a real world case study of data mining in mobile communication.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2009

Understanding Importance of Collaborations in Co-authorship Networks: A Supportiveness Analysis Approach.
Proceedings of the SIAM International Conference on Data Mining, 2009

Debt Detection in Social Security by Sequence Classification Using Both Positive and Negative Patterns.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2009

Hierarchical Distributed Data Classification in Wireless Sensor Networks.
Proceedings of the IEEE 6th International Conference on Mobile Adhoc and Sensor Systems, 2009

OLAP on search logs: an infrastructure supporting data-driven applications in search engines.
Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Paris, France, June 28, 2009

Can we learn a template-independent wrapper for news article extraction from a single training site?
Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Paris, France, June 28, 2009

Early Prediction on Time Series: A Nearest Neighbor Approach.
Proceedings of the IJCAI 2009, 2009

Distance-Based Representative Skyline.
Proceedings of the 25th International Conference on Data Engineering, 2009

Privacy Preserving Publishing on Multiple Quasi-identifiers.
Proceedings of the 25th International Conference on Data Engineering, 2009

Online Interval Skyline Queries on Time Series.
Proceedings of the 25th International Conference on Data Engineering, 2009

Answering aggregate keyword queries on relational databases using minimal group-bys.
Proceedings of the EDBT 2009, 2009

Continuous privacy preserving publishing of data streams.
Proceedings of the EDBT 2009, 2009

Efficiently indexing shortest paths by exploiting symmetry in graphs.
Proceedings of the EDBT 2009, 2009

Personalizing entity detection and recommendation with a fusion of web log mining techniques.
Proceedings of the EDBT 2009, 2009

MAPO: Mining and Recommending API Usage Patterns.
Proceedings of the ECOOP 2009, 2009

Detecting topic evolution in scientific literature: how can citations help?
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

Towards Web Search Engine Scale Data Mining.
Proceedings of the Eighth Australasian Data Mining Conference, AusDM 2009, Melbourne, 2009

2008
Privacy-Preserving Data Stream Classification.
Proceedings of the Privacy-Preserving Data Mining - Models and Algorithms, 2008

A Survey of Utility-based Privacy-Preserving Data Transformation Methods.
Proceedings of the Privacy-Preserving Data Mining - Models and Algorithms, 2008

Anonymization by Local Recoding in Data with Attribute Hierarchical Taxonomies.
IEEE Trans. Knowl. Data Eng., 2008

A brief survey on anonymization techniques for privacy preserving publishing of social network data.
SIGKDD Explorations, 2008

Advances in information and knowledge management.
SIGIR Forum, 2008

Efficient skyline querying with variable user preferences on nominal attributes.
PVLDB, 2008

Clustering by Pattern Similarity.
J. Comput. Sci. Technol., 2008

Managing Uncertain Data: Probabilistic Approaches.
Proceedings of the Ninth International Conference on Web-Age Information Management, 2008

PLEDS: A Personalized Entity Detection System Based on Web Log Mining Techniques.
Proceedings of the Ninth International Conference on Web-Age Information Management, 2008

Query answering techniques on uncertain and probabilistic data: tutorial summary.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2008

Ranking queries on uncertain data: a probabilistic threshold approach.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2008

DiMaC: a system for cleaning disguised missing data.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2008

A Spamicity Approach to Web Spam Detection.
Proceedings of the SIAM International Conference on Data Mining, 2008

Mining Sequence Classifiers for Early Prediction.
Proceedings of the SIAM International Conference on Data Mining, 2008

Fast and quality-guaranteed data streaming in resource-constrained sensor networks.
Proceedings of the 9th ACM Interational Symposium on Mobile Ad Hoc Networking and Computing, 2008

Mining preferences from superior and inferior examples.
Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2008

DiMaC: a disguised missing data cleaning tool.
Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2008

Context-aware query suggestion by mining click-through and session data.
Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2008

Publishing Sensitive Transactions for Itemset Utility.
Proceedings of the 8th IEEE International Conference on Data Mining (ICDM 2008), 2008

Preserving Privacy in Social Networks Against Neighborhood Attacks.
Proceedings of the 24th International Conference on Data Engineering, 2008

Efficiently Answering Probabilistic Threshold Top-k Queries on Uncertain Data.
Proceedings of the 24th International Conference on Data Engineering, 2008

OrthoCluster: a new tool for mining synteny blocks and applications in comparative genomics.
Proceedings of the EDBT 2008, 2008

Anonymity for continuous data publishing.
Proceedings of the EDBT 2008, 2008

2007
Multi-Dimensional Analysis of Data Streams Using Stream Cubes.
Proceedings of the Data Streams - Models and Algorithms, 2007

Sequence Data Mining
Advances in Database Systems 33, Kluwer, ISBN: 978-0-387-69936-3, 2007

An Energy-Efficient Data Collection Framework for Wireless Sensor Networks by Exploiting Spatiotemporal Correlation.
IEEE Trans. Parallel Distrib. Syst., 2007

Efficient Skyline and Top-k Retrieval in Subspaces.
IEEE Trans. Knowl. Data Eng., 2007

Introduction to the special issue on data mining for health informatics.
SIGKDD Explorations, 2007

Mining gene-sample-time microarray data: a coherent gene cluster discovery approach.
Knowl. Inf. Syst., 2007

Answering ad hoc aggregate queries from data streams using prefix aggregate trees.
Knowl. Inf. Syst., 2007

Constraint-based sequential pattern mining: the pattern-growth methods.
J. Intell. Inf. Syst., 2007

Active Rules Termination Analysis Through Conditional Formula Containing Updatable Variable.
Proceedings of the Advances in Data and Web Management, 2007

(alpha, k)-anonymity Based Privacy Preservation by Lossy Join.
Proceedings of the Advances in Data and Web Management, 2007

Minimality Attack in Privacy Preserving Data Publishing.
Proceedings of the 33rd International Conference on Very Large Data Bases, 2007

Probabilistic Skylines on Uncertain Data.
Proceedings of the 33rd International Conference on Very Large Data Bases, 2007

Efficiently Answering Top-k Typicality Queries on Large Databases.
Proceedings of the 33rd International Conference on Very Large Data Bases, 2007

Maintaining K-Anonymity against Incremental Updates.
Proceedings of the 19th International Conference on Scientific and Statistical Database Management, 2007

Mining API patterns as partial orders from source code: from usage scenarios to specifications.
Proceedings of the 6th joint meeting of the European Software Engineering Conference and the ACM SIGSOFT International Symposium on Foundations of Software Engineering, 2007

Sketching Landscapes of Page Farms.
Proceedings of the Seventh SIAM International Conference on Data Mining, 2007

WAT: Finding Top-K Discords in Time Series Database.
Proceedings of the Seventh SIAM International Conference on Data Mining, 2007

Mining Frequent Trajectory Patterns for Activity Monitoring Using Radio Frequency Tag Arrays.
Proceedings of the Fifth Annual IEEE International Conference on Pervasive Computing and Communications (PerCom 2007), 2007

Mining favorable facets.
Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2007

Cleaning disguised missing data: a heuristic approach.
Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2007

Mining Software Engineering Data.
Proceedings of the 29th International Conference on Software Engineering (ICSE 2007), 2007

Computing Compressed Multidimensional Skyline Cubes Efficiently.
Proceedings of the 23rd International Conference on Data Engineering, 2007

Ix-cubes: iceberg cubes for data warehousing and olap on xml data.
Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management, 2007

TS-Trees: A Non-Alterable Search Tree Index for Trustworthy Databases on Write-Once-Read-Many (WORM) Storage.
Proceedings of the 21st International Conference on Advanced Information Networking and Applications (AINA 2007), 2007

2006
Towards multidimensional subspace skyline analysis.
ACM Trans. Database Syst., 2006

Closed Constrained Gradient Mining in Retail Databases.
IEEE Trans. Knowl. Data Eng., 2006

Discovering Frequent Closed Partial Orders from Strings.
IEEE Trans. Knowl. Data Eng., 2006

Regression Cubes with Lossless Compression and Aggregation.
IEEE Trans. Knowl. Data Eng., 2006

Utility-based anonymization for privacy preservation with less information loss.
SIGKDD Explorations, 2006

Mining changing regions from access-constrained snapshots: a cluster-embedded decision tree approach.
J. Intell. Inf. Syst., 2006

Mining Co-Location Patterns with Rare Events from Spatial Data Sets.
GeoInformatica, 2006

An Erratum on "Pushing Convertible Constraints in Frequent Itemset Mining".
Data Min. Knowl. Discov., 2006

Using High Dimensional Indexes to Support Relevance Feedback Based Interactive Images Retrival.
Proceedings of the 32nd International Conference on Very Large Data Bases, 2006

MAPO: mining API usages from open source repositories.
Proceedings of the 2006 International Workshop on Mining Software Repositories, 2006

Utility-based anonymization using local recoding.
Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2006

Suppressing model overfitting in mining concept-drifting data streams.
Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2006

On privacy preservation against adversarial data mining.
Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2006

An effective approach to entity resolution problem using quasi-clique and its application to digital libraries.
Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, 2006

Improving Grouped-Entity Resolution Using Quasi-Cliques.
Proceedings of the 6th IEEE International Conference on Data Mining (ICDM 2006), 2006

SUBSKY: Efficient Computation of Skylines in Subspaces.
Proceedings of the 22nd International Conference on Data Engineering, 2006

Granularity Adaptive Density Estimation and on Demand Clustering of Concept-Drifting Data Streams.
Proceedings of the Data Warehousing and Knowledge Discovery, 8th International Conference, 2006

Achieving k-Anonymity by Clustering in Attribute Hierarchical Structures.
Proceedings of the Data Warehousing and Knowledge Discovery, 8th International Conference, 2006

Classification spanning correlated data streams.
Proceedings of the 2006 ACM CIKM International Conference on Information and Knowledge Management, 2006

Minimum Description Length Principle: Generators Are Preferable to Closed Patterns.
Proceedings of the Proceedings, 2006

2005
An Interactive Approach to Mining Gene Expression Data.
IEEE Trans. Knowl. Data Eng., 2005

Preference-Based Frequent Pattern Mining.
IJDWM, 2005

Book Review on "Out of Their Minds: The Lives and Discoveries of 15 Great Computer Scientists".
J. Comput. Sci. Technol., 2005

Stream Cube: An Architecture for Multi-Dimensional Analysis of Data Streams.
Distributed and Parallel Databases, 2005

A Stratification-Based Approach to Accurate and Fast Image Annotation.
Proceedings of the Advances in Web-Age Information Management, 2005

Catching the Best Views of Skyline: A Semantic Approach Based on Decisive Subspaces.
Proceedings of the 31st International Conference on Very Large Data Bases, Trondheim, Norway, August 30, 2005

Mining Most General Multidimensional Summarization of Probably Groups in Data Warehouses.
Proceedings of the 17th International Conference on Scientific and Statistical Database Management, 2005

GraphMiner: a structural pattern-mining system for large disk-based graph databases and its applications.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2005

A dynamic clustering and scheduling approach to energy saving in data collection from wireless sensor networks.
Proceedings of the Second Annual IEEE Communications Society Conference on Sensor, 2005

Cross Table Cubing: Mining Iceberg Cubes from Data Warehouses.
Proceedings of the 2005 SIAM International Conference on Data Mining, 2005

A Random Method for Quantifying Changing Distributions in Data Streams.
Proceedings of the Knowledge Discovery in Databases: PKDD 2005, 2005

Pattern-based similarity search for microarray data.
Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2005

On mining cross-graph quasi-cliques.
Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2005

Efficiently Mining Frequent Closed Partial Orders.
Proceedings of the 5th IEEE International Conference on Data Mining (ICDM 2005), 2005

Online Mining of Data Streams: Applications, Techniques and Progress.
Proceedings of the 21st International Conference on Data Engineering, 2005

Mining Cross-Graph Quasi-Cliques in Gene Expression and Protein Interaction Data.
Proceedings of the 21st International Conference on Data Engineering, 2005

A General Approach to Mining Quality Pattern-Based Clusters from Microarray Data.
Proceedings of the Database Systems for Advanced Applications, 2005

Mining Succinct Systems of Minimal Generators of Formal Concepts.
Proceedings of the Database Systems for Advanced Applications, 2005

2004
Mining Sequential Patterns by Pattern-Growth: The PrefixSpan Approach.
IEEE Trans. Knowl. Data Eng., 2004

Mining Constrained Gradients in Large Databases.
IEEE Trans. Knowl. Data Eng., 2004

Mining Condensed Frequent-Pattern Bases.
Knowl. Inf. Syst., 2004

From Sequential Pattern Mining to Structured Pattern Mining: A Pattern-Growth Approach.
J. Comput. Sci. Technol., 2004

Pushing Convertible Constraints in Frequent Itemset Mining.
Data Min. Knowl. Discov., 2004

Mining Frequent Patterns without Candidate Generation: A Frequent-Pattern Tree Approach.
Data Min. Knowl. Discov., 2004

GPX: Interactive Mining of Gene Expression Data.
Proceedings of the (e)Proceedings of the Thirtieth International Conference on Very Large Data Bases, Toronto, Canada, August 31, 2004

A Fast Algorithm for Subspace Clustering by Pattern Similarity.
Proceedings of the 16th International Conference on Scientific and Statistical Database Management (SSDBM 2004), 2004

Efficient Pattern-Growth Methods for Frequent Tree Pattern Mining.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2004

Scalable mining of large disk-based graph databases.
Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2004

Mining coherent gene clusters from gene-sample-time microarray data.
Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2004

A rank sum test method for informative gene discovery.
Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2004

Data Mining for Intrusion Detection: Techniques, Applications and Systems.
Proceedings of the 20th International Conference on Data Engineering, 2004

Preface to CoMWIM 2004.
Proceedings of the Conceptual Modeling for Advanced Application Domains, 2004


2003
Towards interactive exploration of gene expression patterns.
SIGKDD Explorations, 2003

Recent Progress on Selected Topics in Database Research - A Report by Nine Young Chinese Researchers Working in the United States.
J. Comput. Sci. Technol., 2003

Efficacious Data Cube Exploration by Semantic Summarization and Compression.
Proceedings of the VLDB 2003, 2003

SOCQET: Semantic OLAP with Compressed Cube and Summarization.
Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, 2003

QC-Trees: An Efficient Summary Structure for Semantic OLAP.
Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, 2003

ApproxMAP: Approximate Mining of Consensus Sequential Patterns.
Proceedings of the Third SIAM International Conference on Data Mining, 2003

Mining Confident Colocation Rules without A Support Threshold.
Proceedings of the 2003 ACM Symposium on Applied Computing (SAC), 2003

CLOSET+: searching for the best strategies for mining frequent closed itemsets.
Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 24, 2003

Mining phenotypes and informative genes from gene expression data.
Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 24, 2003

Interactive exploration of coherent patterns in time-series gene expression data.
Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 24, 2003

MaPle: A Fast Algorithm for Maximal Pattern-based Clustering.
Proceedings of the 3rd IEEE International Conference on Data Mining (ICDM 2003), 2003

A General Model for Online Analytical Processing of Complex Data.
Proceedings of the Conceptual Modeling, 2003

DHC: A Density-Based Hierarchical Clustering Method for Time Series Gene Expression Dat.
Proceedings of the 3rd IEEE International Symposium on BioInformatics and BioEngineering (BIBE 2003), 2003

2002
Constrained frequent pattern mining: a pattern-growth view.
SIGKDD Explorations, 2002

Quotient Cube: How to Summarize the Semantics of a Data Cube.
Proceedings of the VLDB 2002, 2002

COMMIX: towards effective web information extraction, integration and query answering.
Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, 2002

CubeExplorer: online exploration of data cubes.
Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, 2002

On Computing Condensed Frequent Pattern Bases.
Proceedings of the 2002 IEEE International Conference on Data Mining (ICDM 2002), 2002

Online Analytical Processing Stream Data: Is It Feasible?
Proceedings of the 2002 ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, 2002

Mining sequential patterns with constraints in large databases.
Proceedings of the 2002 ACM CIKM International Conference on Information and Knowledge Management, 2002

2001
Mining Multi-Dimensional Constrained Gradients in Data Cubes.
Proceedings of the VLDB 2001, 2001

DNA-Miner: A System Prototype for Mining DNA Sequences.
Proceedings of the 2001 ACM SIGMOD international conference on Management of data, 2001

Efficient Computation of Iceberg Cubes with Complex Measures.
Proceedings of the 2001 ACM SIGMOD international conference on Management of data, 2001

Scalable frequent-pattern mining methods: an overview.
KDD Tutorials, 2001

H-Mine: Hyper-Structure Mining of Frequent Patterns in Large Databases.
Proceedings of the 2001 IEEE International Conference on Data Mining, 29 November, 2001

CMAR: Accurate and Efficient Classification Based on Multiple Class-Association Rules.
Proceedings of the 2001 IEEE International Conference on Data Mining, 29 November, 2001

PrefixSpan: Mining Sequential Patterns by Prefix-Projected Growth.
Proceedings of the 17th International Conference on Data Engineering, 2001

Mining Frequent Item Sets with Convertible Constraints.
Proceedings of the 17th International Conference on Data Engineering, 2001

Fault-Tolerant Frequent Pattern Mining: Problems and Challenges.
Proceedings of the 2001 ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, 2001

Multi-Dimensional Sequential Pattern Mining.
Proceedings of the 2001 ACM CIKM International Conference on Information and Knowledge Management, 2001

2000
Mining Frequent Patterns by Pattern-Growth: Methodology and Implications.
SIGKDD Explorations, 2000

Towards Data Mining Benchmarking: A Testbed for Performance Study of Frequent Pattern Mining.
Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, 2000

Mining Frequent Patterns without Candidate Generation.
Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, 2000

Mining Access Patterns Efficiently from Web Logs.
Proceedings of the Knowledge Discovery and Data Mining, 2000

Can we push more constraints into frequent pattern mining?
Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining, 2000

FreeSpan: frequent pattern-projected sequential pattern mining.
Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining, 2000

CLOSET: An Efficient Algorithm for Mining Frequent Closed Itemsets.
Proceedings of the 2000 ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, 2000


  Loading...