Mohammed J. Zaki

According to our database1, Mohammed J. Zaki
  • authored at least 184 papers between 1995 and 2017.
  • has a "Dijkstra number"2 of three.

Awards

IEEE Fellow

IEEE Fellow 2017, "For contributions to knowledge discovery and data mining".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Homepages:

On csauthors.net:

Bibliography

2017
Preface: Selected Papers from the Workshop Bioinformatics and Artificial Intelligence Joined with the International Joint Conference on Artificial Intelligence.
Journal of Computational Biology, 2017

KATE: K-Competitive Autoencoder for Text.
CoRR, 2017

Graph Data Mining with Arabesque.
Proceedings of the 2017 ACM International Conference on Management of Data, 2017

KATE: K-Competitive Autoencoder for Text.
Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, NS, Canada, August 13, 2017

2016
A distributed approach for graph mining in massive networks.
Data Min. Knowl. Discov., 2016

Sampling frequent and minimal boolean patterns: theory and application in classification.
Data Min. Knowl. Discov., 2016

A Query-oriented Approach for Relevance in Citation Networks.
Proceedings of the 25th International Conference on World Wide Web, 2016

Investigating bank failures using text mining.
Proceedings of the 2016 IEEE Symposium Series on Computational Intelligence, 2016

Rheem: Enabling Multi-Platform Task Execution.
Proceedings of the 2016 International Conference on Management of Data, 2016

Road to Freedom in Big Data Analytics.
Proceedings of the 19th International Conference on Extending Database Technology, 2016

Parallel graph mining with dynamic load balancing.
Proceedings of the 2016 IEEE International Conference on Big Data, 2016

2015
Knowledge Discovery Using Big Data in Biomedical Systems.
IEEE/ACM Trans. Comput. Biology Bioinform., 2015

Learning sequential classifiers from long and noisy discrete-event sequences efficiently.
Data Min. Knowl. Discov., 2015

Arabesque: A System for Distributed Graph Mining - Extended version.
CoRR, 2015

Arabesque: a system for distributed graph mining.
Proceedings of the 25th Symposium on Operating Systems Principles, 2015

2014
Parallel Graph Mining with GPUs.
Proceedings of the 3rd International Workshop on Big Data, 2014

Reachability Queries in Very Large Graphs: A Fast Refined Online Search Approach.
Proceedings of the 17th International Conference on Extending Database Technology, 2014

Data Mining and Analysis: Fundamental Concepts and Algorithms.
Cambridge University Press, ISBN: 9780521766333, 2014

2013
DAGGER: A Scalable Index for Reachability Queries in Large Dynamic Graphs
CoRR, 2013

Trends in computer science research.
Commun. ACM, 2013

Stochastic subspace search for top-k multi-view clustering.
Proceedings of the 4th MultiClust Workshop on Multiple Clusterings, 2013

ProfileRank: finding relevant content and influential users based on information diffusion.
Proceedings of the 7th Workshop on Social Network Mining and Analysis, 2013

Approximate graph mining with label costs.
Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2013

Clustering Biological Data.
Data Clustering: Algorithms and Applications, 2013

2012
GRAIL: a scalable index for reachability queries in very large graphs.
VLDB J., 2012

Biological knowledge discovery and data mining.
Scientific Programming, 2012

Effective graph classification based on topological and label attributes.
Statistical Analysis and Data Mining, 2012

Mining Attribute-structure Correlated Patterns in Large Attributed Graphs.
PVLDB, 2012

Graph mining for discovering infrastructure patterns in configuration management databases.
Knowl. Inf. Syst., 2012

Towards a Better Quality Metric for Graph Cluster Evaluation.
JIDM, 2012

BitPath -- Label Order Constrained Reachability Queries over Large Graphs
CoRR, 2012

Mining Attribute-structure Correlated Patterns in Large Attributed Graphs
CoRR, 2012

Sampling minimal frequent boolean (DNF) patterns.
Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2012

Characterizing the effectiveness of twitter hashtags to detect and track online population sentiment.
Proceedings of the CHI Conference on Human Factors in Computing Systems, 2012

2011
SimClus: an effective algorithm for clustering with a lower bound on similarity.
Knowl. Inf. Syst., 2011

Calibrated lazy associative classification.
Inf. Sci., 2011

Data Integration via Constrained Clustering: An Application to Enzyme Clustering.
Proceedings of the Eleventh SIAM International Conference on Data Mining, 2011

ABACUS: Mining Arbitrary Shaped Clusters from Large Datasets based on Backbone Identification.
Proceedings of the Eleventh SIAM International Conference on Data Mining, 2011

Is There a Best Quality Metric for Graph Clusters?
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2011

Infrastructure Pattern Discovery in Configuration Management Databases via Large Sparse Graph Mining.
Proceedings of the 11th IEEE International Conference on Data Mining, 2011

A Survey of Link Prediction in Social Networks.
Proceedings of the Social Network Data Analytics, 2011

2010
VOGUE: A variable order hidden Markov model with duration based on frequent sequence mining.
TKDD, 2010

GRAIL: Scalable Reachability Index for Large Graphs.
PVLDB, 2010

Learning Dissimilarities for Categorical Symbols.
Proceedings of the Fourth International Workshop on Feature Selection in Data Mining, 2010

Prism: An effective approach for frequent sequence mining via prime-block encoding.
J. Comput. Syst. Sci., 2010

Mining Frequent Boolean Expressions: Application to Gene Expression and Regulatory Modeling.
IJKDB, 2010

FlexSnap: Flexible Non-sequential Protein Structure Alignment.
Algorithms for Molecular Biology, 2010

Matrix "Bit" loaded: a scalable lightweight join query processor for RDF data.
Proceedings of the 19th International Conference on World Wide Web, 2010

Structural correlation pattern mining for large graphs.
Proceedings of the Eighth Workshop on Mining and Learning with Graphs, 2010

Graph indexing for reachability queries.
Proceedings of the Workshops Proceedings of the 26th International Conference on Data Engineering, 2010

Practical Graph Mining.
Proceedings of the Conceptual Structures: From Information to Intelligence, 2010

Pattern mining : the past, present, and future.
Proceedings of the Extraction et gestion des connaissances (EGC'2010), 2010

Mining Complex Boolean Expressions for Sequential Equivalence Checking.
Proceedings of the 19th IEEE Asian Test Symposium, 2010

2009
Closed Itemset Mining and Non-redundant Association Rule Mining.
Proceedings of the Encyclopedia of Database Systems, 2009

Novel tools to streamline the conference review process: experiences from SIGKDD'09.
SIGKDD Explorations, 2009

Competence-conscious associative classification.
Statistical Analysis and Data Mining, 2009

Output Space Sampling for Graph Patterns.
PVLDB, 2009

Robust partitional clustering by outlier and density insensitive seeding.
Pattern Recognition Letters, 2009

SPARCL: an effective and efficient algorithm for mining arbitrary shape-based clusters.
Knowl. Inf. Syst., 2009

Iterative Non-Sequential protein Structural Alignment.
J. Bioinformatics and Computational Biology, 2009

FlexSnap: Flexible Non-sequential Protein Structure Alignment.
Proceedings of the Algorithms in Bioinformatics, 9th International Workshop, 2009

The Metric Dilemma: Competence-Conscious Associative Classification.
Proceedings of the SIAM International Conference on Data Mining, 2009

MUSK: Uniform Sampling of k Maximal Patterns.
Proceedings of the SIAM International Conference on Data Mining, 2009

Clustering with Lower Bound on Similarity.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2009

graphOnt: An ontology based library for conversion from semantic graphs to JUNG.
Proceedings of the IEEE International Conference on Intelligence and Security Informatics, 2009

2008
Introduction to special issue on bioinformatics.
TKDD, 2008

Biological data mining.
Scientific Programming, 2008

BIOKDD 2008: a workshop report on data mining in bioinformatics.
SIGKDD Explorations, 2008

Special Issue on the Best Papers of SDM'08.
Statistical Analysis and Data Mining, 2008

ORIGAMI: A Novel and Effective Approach for Mining Representative Orthogonal Graph Patterns.
Statistical Analysis and Data Mining, 2008

PSIST: A scalable approach to indexing protein structures using suffix trees.
J. Parallel Distrib. Comput., 2008

The ParTriCluster Algorithm for Gene Expression Analysis.
International Journal of Parallel Programming, 2008

An integrated, generic approach to pattern mining: data mining template library.
Data Min. Knowl. Discov., 2008

Calibrated Lazy Associative Classification.
Proceedings of the XXIII Simpósio Brasileiro de Banco de Dados, 2008

TRELLIS+: An Effective Approach for Indexing Genome-Scale Sequences Using Suffix Trees.
Proceedings of the Biocomputing 2008, 2008

SPARCL: Efficient and Effective Shape-Based Clustering.
Proceedings of the 8th IEEE International Conference on Data Mining (ICDM 2008), 2008

2007
Clicks: An effective algorithm for mining subspace clusters in categorical datasets.
Data Knowl. Eng., 2007

Data Mining in Bioinformatics (BIOKDD).
Algorithms for Molecular Biology, 2007

Genome-scale disk-based suffix tree indexing.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2007

Data Clustering Paradigms.
Proceedings of the XXII Simpósio Brasileiro de Banco de Dados, 2007

Multi-label Lazy Associative Classification.
Proceedings of the Knowledge Discovery in Databases: PKDD 2007, 2007

Xproj: a framework for projected structural clustering of xml documents.
Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2007

ORIGAMI: Mining Representative Orthogonal Graph Patterns.
Proceedings of the 7th IEEE International Conference on Data Mining (ICDM 2007), 2007

Prism: A Primal-Encoding Approach for Frequent Sequence Mining.
Proceedings of the 7th IEEE International Conference on Data Mining (ICDM 2007), 2007

2006
BIOKDD06: data mining in Bioinformatics.
SIGKDD Explorations, 2006

What are the grand challenges for data mining?: KDD-2006 panel report.
SIGKDD Explorations, 2006

The Complexity of Finding Top-Toda-Equivalence-Class Members.
Theory Comput. Syst., 2006

XRules: An effective algorithm for structural classification of XML data.
Machine Learning, 2006

Mining Multiple Data Sources: Local Pattern Analysis.
Data Min. Knowl. Discov., 2006

SMOTIF: efficient structured pattern and profile motif search.
Algorithms for Molecular Biology, 2006

EXMOTIF: efficient structured motif extraction.
Algorithms for Molecular Biology, 2006

VOGUE: A Novel Variable Order-Gap State Machine for Modeling Sequences.
Proceedings of the Knowledge Discovery in Databases: PKDD 2006, 2006

BLOSOM: a framework for mining arbitrary boolean expressions.
Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2006

Is there a grand challenge or X-prize for data mining?
Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2006

Finding Hidden Group Structure in a Stream of Communications.
Proceedings of the Intelligence and Security Informatics, 2006

Lazy Associative Classification.
Proceedings of the 6th IEEE International Conference on Data Mining (ICDM 2006), 2006

Multi-evidence, multi-criteria, lazy associative document classification.
Proceedings of the 2006 ACM CIKM International Conference on Information and Knowledge Management, 2006

2005
Efficient Algorithms for Mining Closed Itemsets and Their Lattice Structure.
IEEE Trans. Knowl. Data Eng., 2005

Efficiently Mining Frequent Trees in a Forest: Algorithms and Applications.
IEEE Trans. Knowl. Data Eng., 2005

BIOKDD 2005 workshop report.
SIGKDD Explorations, 2005

Open source data mining: workshop report.
SIGKDD Explorations, 2005

SCHISM: a new approach to interesting subspace mining.
IJBIDM, 2005

Efficiently Mining Frequent Embedded Unordered Trees.
Fundam. Inform., 2005

MicroCluster: Efficient Deterministic Biclustering of Microarray Data.
IEEE Intelligent Systems, 2005

GenMax: An Efficient Algorithm for Mining Maximal Frequent Itemsets.
Data Min. Knowl. Discov., 2005

TriCluster: An Effective Algorithm for Mining Coherent Clusters in 3D Microarray Data.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2005

Towards Generic Pattern Mining.
Proceedings of the Pattern Recognition and Machine Intelligence, 2005

Reasoning about sets using redescription mining.
Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2005

CLICKS: an effective algorithm for mining subspace clusters in categorical datasets.
Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2005

Distribution-Based Synthetic Database Generation Techniques for Itemset Mining.
Proceedings of the Ninth International Database Engineering and Applications Symposium (IDEAS 2005), 2005

Towards Generic Pattern Mining.
Proceedings of the Formal Concept Analysis, Third International Conference, 2005

CLICKS: Mining Subspace Clusters in Categorical Data via K-partite Maximal Cliques.
Proceedings of the 21st International Conference on Data Engineering, 2005

PSIST: Indexing Protein Structures Using Suffix Trees.
Proceedings of the Fourth International IEEE Computer Society Computational Systems Bioinformatics Conference, 2005

Predicting Protein Folding Pathways.
Proceedings of the Data Mining in Bioinformatics, 2005

Introduction to Data Mining in Bioinformatics.
Proceedings of the Data Mining in Bioinformatics, 2005

2004
Report on BIOKDD04: workshop on data mining in Bioinformatics.
SIGKDD Explorations, 2004

Advances in frequent itemset mining implementations: report on FIMI'03.
SIGKDD Explorations, 2004

Mining Non-Redundant Association Rules.
Data Min. Knowl. Discov., 2004

Visual web mining.
Proceedings of the 13th international conference on World Wide Web, 2004

The Complexity of Finding Top-Toda-Equivalence-Class Members.
Proceedings of the LATIN 2004: Theoretical Informatics, 2004

Predicting protein folding pathways.
Proceedings of the Proceedings Twelfth International Conference on Intelligent Systems for Molecular Biology/Third European Conference on Computational Biology 2004, 2004

SCHISM: A New Approach for Interesting Subspace Mining.
Proceedings of the 4th IEEE International Conference on Data Mining (ICDM 2004), 2004

Topic 17: High Performance Bioinformatics.
Proceedings of the Euro-Par 2004 Parallel Processing, 2004

Generic Pattern Mining Via Data Mining Template Library.
Proceedings of the Constraint-Based Mining and Inductive Databases, 2004

2003
Mining residue contacts in proteins using local structure predictions.
IEEE Trans. Systems, Man, and Cybernetics, Part B, 2003

Data mining in bioinformatics: report on BIOKDD'03.
SIGKDD Explorations, 2003

A novel approach to determine normal variation in gene expression data.
SIGKDD Explorations, 2003

Special issue on data management in bioinformatics.
Inf. Syst., 2003

Feasible itemset distributions in data mining: theory and application.
Proceedings of the Twenty-Second ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, 2003

Opening Remarks.
Proceedings of the 3nd ACM SIGKDD Workshop on Data Mining in Bioinformatics (BIOKDD 2003), 2003

Fast vertical mining using diffsets.
Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 24, 2003

XRules: an effective structural classifier for XML data.
Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 24, 2003

Improving spatial locality of programs via data mining.
Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 24, 2003

Carpenter: finding closed patterns in long biological datasets.
Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 24, 2003

Advances in Frequent Itemset Mining Implementations: Introduction to FIMI03.
Proceedings of the FIMI '03, 2003

2002
BIOKDD 2002: Recent Advanced in Data Minig for Bioinformatics.
SIGKDD Explorations, 2002

BIOKDD01: Workshop on Data Mining in Bioinformatics.
SIGKDD Explorations, 2002

Editorial: Online, Interactive, and Anytime Data Mining.
SIGKDD Explorations, 2002

Introduction: Recent Developments in Parallel and Distributed Data Mining.
Distributed and Parallel Databases, 2002

CHARM: An Efficient Algorithm for Closed Itemset Mining.
Proceedings of the Second SIAM International Conference on Data Mining, 2002

Mining Frequent Itemsets in Evolving Databases.
Proceedings of the Second SIAM International Conference on Data Mining, 2002

Efficiently Mining Approximate Models of Associations in Evolving Databases.
Proceedings of the Principles of Data Mining and Knowledge Discovery, 2002

Foreword.
Proceedings of the 2nd ACM SIGKDD Workshop on Data Mining in Bioinformatics (BIOKDD 2002), 2002

Efficiently mining frequent trees in a forest.
Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2002

ADMIT: anomaly-based data mining for intrusions.
Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2002

Mining Protein Contact Maps.
Proceedings of the 2nd ACM SIGKDD Workshop on Data Mining in Bioinformatics (BIOKDD 2002), 2002

Performance Mining of Large-Scale Data-Intensive Applications.
Proceedings of the 16th International Parallel and Distributed Processing Symposium (IPDPS 2002), 2002

Indexing and Data Access Methods for Database Mining.
Proceedings of the 2002 ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, 2002

2001
SPADE: An Efficient Algorithm for Mining Frequent Sequences.
Machine Learning, 2001

Parallel Data Mining for Association Rules on Shared-Memory Systems.
Knowl. Inf. Syst., 2001

Parallel Sequence Mining on Shared-Memory Machines.
J. Parallel Distrib. Comput., 2001

LOGML - XML Language for Web Usage Mining.
Proceedings of the Poster Proceedings of the Tenth International World Wide Web Conference, 2001

Sequence Mining in Categorical Domains: Algorithms and Applications.
Proceedings of the Sequence Learning - Paradigms, Algorithms, and Applications, 2001

LOGML: Log Markup Language for Web Usage Mining.
Proceedings of the WEBKDD 2001, 2001

Efficiently Mining Maximal Frequent Itemsets.
Proceedings of the 2001 IEEE International Conference on Data Mining, 29 November, 2001

2000
Scalable Algorithms for Association Mining.
IEEE Trans. Knowl. Data Eng., 2000

KDD-99 Workshop on Large-Scale Parallel KDD Systems.
SIGKDD Explorations, 2000

Systems Support for Scalable Data Mining.
SIGKDD Explorations, 2000

Scalable Feature Mining for Sequential Data.
IEEE Intelligent Systems, 2000

PlanMine: Predicting Plan Failures Using Sequence Mining.
Artif. Intell. Rev., 2000

Generating non-redundant association rules.
Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining, 2000

A Requirements Analysis for Parallel KDD Systems.
Proceedings of the Parallel and Distributed Processing, 2000

Sequence Mining in Categorical Domains: Incorporating Constraints.
Proceedings of the 2000 ACM CIKM International Conference on Information and Knowledge Management, 2000

Mining Residue Contacts in Proteins.
Proceedings of the 1st IEEE International Symposium on Bioinformatics and Biomedical Engineering, 2000

1999
Parallel and distributed association mining: a survey.
IEEE Concurrency, 1999

Parallel Sequence Mining on Shared-Memory Machines.
Proceedings of the Large-Scale Parallel Data Mining, 1999

Parallel and Distributed Data Mining: An Introduction.
Proceedings of the Large-Scale Parallel Data Mining, 1999

Mining Features for Sequence Classification.
Proceedings of the Fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 1999

Parallel Classification for Data Mining on Shared-Memory Multiprocessors.
Proceedings of the 15th International Conference on Data Engineering, 1999

Incremental and Interactive Sequence Mining.
Proceedings of the 1999 ACM CIKM International Conference on Information and Knowledge Management, 1999

1998
PlanMine: Sequence Mining for Plan Failures.
Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining (KDD-98), 1998

Memory Placement Techniques for Parallel Association Mining.
Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining (KDD-98), 1998

Efficient Enumeration of Frequent Sequences.
Proceedings of the 1998 ACM CIKM International Conference on Information and Knowledge Management, 1998

1997
Customized Dynamic Load Balancing for a Network of Workstations.
J. Parallel Distrib. Comput., 1997

Parallel Algorithms for Discovery of Association Rules.
Data Min. Knowl. Discov., 1997

Compile-Time Scheduling Algorithms for a Heterogeneous Network of Workstations.
Comput. J., 1997

A Localized Algorithm for Parallel Association Mining.
SPAA, 1997

Evaluation of Sampling for Data Mining of Association Rules.
Proceedings of the 7th International Workshop on Research Issues in Data Engineering (RIDE '97) High Performance Database Management for Large-Scale Applications, 1997

New Algorithms for Fast Discovery of Association Rules.
Proceedings of the Third International Conference on Knowledge Discovery and Data Mining (KDD-97), 1997

Arithmetic and logic operations with DNA.
Proceedings of the DNA Based Computers, 1997

1996
Compile-time inter-query dependence analysis.
Proceedings of the Eighth IEEE Symposium on Parallel and Distributed Processing, 1996

Parallel Data Mining for Association Rules on Shared-Memory Multi-Processors.
Proceedings of the 1996 ACM/IEEE Conference on Supercomputing, 1996

Customized Dynamic Load Balancing for a Network of Workstations.
Proceedings of the 5th International Symposium on High Performance Distributed Computing (HPDC '96), 1996

1995
Loop Scheduling for Heterogeneity.
Proceedings of the 4th International Symposium on High Performance Distributed Computing (HPDC '95), 1995


  Loading...