Arthur Zimek

Orcid: 0000-0001-7713-4208

Affiliations:
  • University of Southern Denmark, Odense, Denmark
  • Ludwig Maximilian University of Munich, Germany


According to our database1, Arthur Zimek authored at least 120 papers between 2004 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Dimensionality-Aware Outlier Detection: Theoretical and Experimental Analysis.
CoRR, 2024

2023
Anomaly detection in streaming data: A comparison and evaluation study.
Expert Syst. Appl., December, 2023

On the evaluation of outlier detection and one-class classification: a comparative study of algorithms, model selection, and ensembles.
Data Min. Knowl. Discov., 2023

Sharing is CAIRing: Characterizing Principles and Assessing Properties of Universal Privacy Evaluation for Synthetic Tabular Data.
CoRR, 2023

Explaining text classifiers through progressive neighborhood approximation with realistic samples.
CoRR, 2023

SDOclust: Clustering with Sparse Data Observers.
Proceedings of the Similarity Search and Applications - 16th International Conference, 2023

An Interpretable Measure of Dataset Complexity for Imbalanced Classification Problems.
Proceedings of the 2023 SIAM International Conference on Data Mining, 2023

2022
Similarity-Based Unsupervised Evaluation of Outlier Detection.
Proceedings of the Similarity Search and Applications - 15th International Conference, 2022

Analyzing Passing Sequences for the Prediction of Goal-Scoring Opportunities.
Proceedings of the Machine Learning and Data Mining for Sports Analytics, 2022

Evaluation of Probability Distribution Distance Metrics in Traffic Flow Outlier Detection.
Proceedings of the 23rd IEEE International Conference on Mobile Data Management, 2022

Power of Explanations: Towards automatic debiasing in hate speech detection.
Proceedings of the 9th IEEE International Conference on Data Science and Advanced Analytics, 2022

Unsupervised Representation Learning on Attributed Multiplex Network.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

A Simple Meta-path-free Framework for Heterogeneous Network Embedding.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

2021
Clustering refinement.
Int. J. Data Sci. Anal., 2021

Non-parametric Semi-supervised Learning by Bayesian Label Distribution Propagation.
Proceedings of the Similarity Search and Applications - 14th International Conference, 2021

Handling Class Imbalance in k-Nearest Neighbor Classification by Balancing Prior Probabilities.
Proceedings of the Similarity Search and Applications - 14th International Conference, 2021

Detecting Wandering Behavior of People with Dementia.
Proceedings of the 2021 International Conference on Data Mining, 2021

XPROAX-Local explanations for text classification with progressive neighborhood approximation.
Proceedings of the 8th IEEE International Conference on Data Science and Advanced Analytics, 2021

2020
Density-based clustering.
WIREs Data Mining Knowl. Discov., 2020

Internal Evaluation of Unsupervised Outlier Detection.
ACM Trans. Knowl. Discov. Data, 2020

Absolute Cluster Validity.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Correction to: A unified view of density-based methods for semi-supervised clustering and classification.
Data Min. Knowl. Discov., 2020

Call for Special Issue Papers: Evaluation and Experimental Design in Data Mining and Machine Learning.
Big Data, 2020

Explainable Detection of Zero Day Web Attacks.
Proceedings of the 3rd International Conference on Data Intelligence and Security, 2020

Improving Semantic Similarity of Words by Retrofitting Word Vectors in Sense Level.
Proceedings of the 12th International Conference on Agents and Artificial Intelligence, 2020

Matching Research Publications to the United Nations' Sustainable Development Goals by Multi-Label-Learning with Hierarchical Categories.
Proceedings of the 7th IEEE International Conference on Data Science and Advanced Analytics, 2020

Interpretability and Refinement of Clustering.
Proceedings of the 7th IEEE International Conference on Data Science and Advanced Analytics, 2020

2019
A unified view of density-based methods for semi-supervised clustering and classification.
Data Min. Knowl. Discov., 2019

Subspace Determination through Local Intrinsic Dimensional Decomposition: Theory and Experimentation.
CoRR, 2019

ELKI: A large open-source library for data analysis - ELKI Release 0.7.5 "Heidelberg".
CoRR, 2019

Outlier detection in graphs: A study on the impact of multiple graph models.
Comput. Sci. Inf. Syst., 2019

MDCGen: Multidimensional Dataset Generator for Clustering.
J. Classif., 2019

Subspace Determination Through Local Intrinsic Dimensional Decomposition.
Proceedings of the Similarity Search and Applications - 12th International Conference, 2019

1st Workshop on Evaluation and Experimental Design in Data Mining and Machine Learning (EDML 2019).
Proceedings of the 1st Workshop on Evaluation and Experimental Design in Data Mining and Machine Learning co-located with SIAM International Conference on Data Mining (SDM 2019), 2019

Are Network Attacks Outliers? A Study of Space Representations and Unsupervised Algorithms.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2019

2018
Outlier Detection.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Subspace Clustering Techniques.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

There and back again: Outlier detection between statistical reasoning and data mining algorithms.
WIREs Data Mining Knowl. Discov., 2018

Outlier Detection in Urban Traffic Data.
Proceedings of the 8th International Conference on Web Intelligence, Mining and Semantics, 2018

Outlier Detection in Graphs: On the Impact of Multiple Graph Models.
Proceedings of the 8th International Conference on Web Intelligence, Mining and Semantics, 2018

A unified framework of density-based clustering for semi-supervised classification.
Proceedings of the 30th International Conference on Scientific and Statistical Database Management, 2018

On the Correlation Between Local Intrinsic Dimensionality and Outlierness.
Proceedings of the Similarity Search and Applications - 11th International Conference, 2018

An Unsupervised Boosting Strategy for Outlier Detection Ensembles.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2018

Outlier Detection Based on Low Density Models.
Proceedings of the 2018 IEEE International Conference on Data Mining Workshops, 2018

Outlier Detection in Urban Traffic Flow Distributions.
Proceedings of the IEEE International Conference on Data Mining, 2018

2017
Dimensional Testing for Reverse k-Nearest Neighbor Search.
Proc. VLDB Endow., 2017

The (black) art of runtime evaluation: Are we comparing algorithms or implementations?
Knowl. Inf. Syst., 2017

Good and Bad Neighborhood Approximations for Outlier Detection Ensembles.
Proceedings of the Similarity Search and Applications - 10th International Conference, 2017

Redundancies in Data and their Effect on the Evaluation of Recommendation Systems: A Case Study on the Amazon Reviews Datasets.
Proceedings of the 2017 SIAM International Conference on Data Mining, 2017

2016
MultiClust 2013: Multiple Clusterings, Multiview Data, and Multisource Knowledgedriven Clustering: [Workshop Report].
SIGKDD Explor., 2016

On strategies for building effective ensembles of relative clustering validity criteria.
Knowl. Inf. Syst., 2016

On the evaluation of unsupervised outlier detection: measures, datasets, and an empirical study.
Data Min. Knowl. Discov., 2016

On the Evaluation of Outlier Detection and One-Class Classification Methods.
Proceedings of the 2016 IEEE International Conference on Data Science and Advanced Analytics, 2016

2015
Hierarchical Density Estimates for Data Clustering, Visualization, and Outlier Detection.
ACM Trans. Knowl. Discov. Data, 2015

Dimensionality and Scalability II: Hands-On Intrinsic Dimensionality (NII Shonan Meeting 2015-9).
NII Shonan Meet. Rep., 2015

A Framework for Clustering Uncertain Data.
Proc. VLDB Endow., 2015

The blind men and the elephant: on meeting the problem of multiple truths in data from clustering and pattern mining perspectives.
Mach. Learn., 2015

Outlier Detection and Trend Detection: Two Sides of the Same Coin.
Proceedings of the IEEE International Conference on Data Mining Workshop, 2015

Fast and Scalable Outlier Detection with Approximate Nearest Neighbor Ensembles.
Proceedings of the Database Systems for Advanced Applications, 2015

2014
Local outlier detection reconsidered: a generalized view on locality with applications to spatial, video, and network outlier detection.
Data Min. Knowl. Discov., 2014

Data perturbation for outlier detection ensembles.
Proceedings of the Conference on Scientific and Statistical Database Management, 2014

Generalized Outlier Detection with Flexible Kernel Density Estimates.
Proceedings of the 2014 SIAM International Conference on Data Mining, 2014

Density-Based Clustering Validation.
Proceedings of the 2014 SIAM International Conference on Data Mining, 2014

Representative clustering of uncertain data.
Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2014

Discriminative features for identifying and interpreting outliers.
Proceedings of the IEEE 30th International Conference on Data Engineering, Chicago, 2014

Model Selection for Semi-Supervised Clustering.
Proceedings of the 17th International Conference on Extending Database Technology, 2014

Active Learning Strategies for Semi-Supervised DBSCAN.
Proceedings of the Advances in Artificial Intelligence, 2014

Frequent Pattern Mining Algorithms for Data Clustering.
Proceedings of the Frequent Pattern Mining, 2014

2013
Ensembles for unsupervised outlier detection: challenges and research questions a position paper.
SIGKDD Explor., 2013

Dimensionality and Scalability (NII Shonan Meeting 2013-4).
NII Shonan Meet. Rep., 2013

A survey on enhanced subspace clustering.
Data Min. Knowl. Discov., 2013

A framework for semi-supervised and unsupervised optimal extraction of clusters from hierarchies.
Data Min. Knowl. Discov., 2013

Geodetic Distance Queries on R-Trees for Indexing Geographic Data.
Proceedings of the Advances in Spatial and Temporal Databases, 2013

Interactive data mining with 3D-parallel-coordinate-trees.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2013

Subsampling for efficient and effective unsupervised outlier detection ensembles.
Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2013

Clustering High-Dimensional Data.
Proceedings of the Data Clustering: Algorithms and Applications, 2013

2012
Subspace clustering.
WIREs Data Mining Knowl. Discov., 2012

A survey on unsupervised outlier detection in high-dimensional numerical data.
Stat. Anal. Data Min., 2012

On Evaluation of Outlier Rankings and Outlier Scores.
Proceedings of the Twelfth SIAM International Conference on Data Mining, 2012

Density-based Projected Clustering over High Dimensional Data Streams.
Proceedings of the Twelfth SIAM International Conference on Data Mining, 2012

Outlier Detection in Arbitrarily Oriented Subspaces.
Proceedings of the 12th IEEE International Conference on Data Mining, 2012

Evaluation of Clusterings - Metrics and Visual Support.
Proceedings of the IEEE 28th International Conference on Data Engineering (ICDE 2012), 2012

2011
Density-based clustering.
WIREs Data Mining Knowl. Discov., 2011

Density Based Subspace Clustering over Dynamic Data.
Proceedings of the Scientific and Statistical Database Management, 2011

Quality of Similarity Rankings in Time Series.
Proceedings of the Advances in Spatial and Temporal Databases, 2011

Spatial Outlier Detection: Data, Algorithms, Visualizations.
Proceedings of the Advances in Spatial and Temporal Databases, 2011

Interpreting and Unifying Outlier Scores.
Proceedings of the Eleventh SIAM International Conference on Data Mining, 2011

When Pattern Met Subspace Cluster.
Proceedings of the 2nd MultiClust Workshop: Discovering, 2011

Evaluation of Multiple Clustering Solutions.
Proceedings of the 2nd MultiClust Workshop: Discovering, 2011

2010
A Study of Hierarchical and Flat Classification of Proteins.
IEEE ACM Trans. Comput. Biol. Bioinform., 2010

Investigating a Correlation between Subcellular Localization and Fold of Proteins.
J. Univers. Comput. Sci., 2010

Can Shared-Neighbor Distances Defeat the Curse of Dimensionality?
Proceedings of the Scientific and Statistical Database Management, 2010

Subspace Similarity Search: Efficient k-NN Queries in Arbitrary Subspaces.
Proceedings of the Scientific and Statistical Database Management, 2010

Towards subspace clustering on dynamic data: an incremental version of PreDeCon.
Proceedings of the First International Workshop on Novel Data Stream Pattern Mining Techniques, 2010

Subspace similarity search using the ideas of ranking and top-k retrieval.
Proceedings of the Workshops Proceedings of the 26th International Conference on Data Engineering, 2010

Similarity Search in Time Series of Dynamical Model-based Systems.
Proceedings of the Database and Expert Systems Applications, 2010

Visual Evaluation of Outlier Detection Models.
Proceedings of the Database Systems for Advanced Applications, 2010

2009
Subspace Clustering Techniques.
Proceedings of the Encyclopedia of Database Systems, 2009

Clustering high-dimensional data: A survey on subspace clustering, pattern-based clustering, and correlation clustering.
ACM Trans. Knowl. Discov. Data, 2009

Correlation clustering.
SIGKDD Explor., 2009

Subspace and projected clustering: experimental evaluation and analysis.
Knowl. Inf. Syst., 2009

Supervised Ensembles of Prediction Methods for Subcellular Localization.
J. Bioinform. Comput. Biol., 2009

ELKI in Time: ELKI 0.2 for the Performance Evaluation of Distance Measures for Time Series.
Proceedings of the Advances in Spatial and Temporal Databases, 2009

Outlier Detection in Axis-Parallel Subspaces of High Dimensional Data.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2009

LoOP: local outlier probabilities.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

2008
Global Correlation Clustering Based on the Hough Transform.
Stat. Anal. Data Min., 2008

Detecting clusters in moderate-to-high dimensional data: subspace clustering, pattern-based clustering, and correlation clustering.
Proc. VLDB Endow., 2008

A General Framework for Increasing the Robustness of PCA-Based Correlation Clustering Algorithms.
Proceedings of the Scientific and Statistical Database Management, 2008

ELKI: A Software System for Evaluation of Subspace Clustering Algorithms.
Proceedings of the Scientific and Statistical Database Management, 2008

Robust Clustering in Arbitrarily Oriented Subspaces.
Proceedings of the SIAM International Conference on Data Mining, 2008

Angle-based outlier detection in high-dimensional data.
Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2008

2007
Future trends in data mining.
Data Min. Knowl. Discov., 2007

On Exploring Complex Relationships of Correlation Clusters.
Proceedings of the 19th International Conference on Scientific and Statistical Database Management, 2007

Robust, Complete, and Efficient Correlation Clustering.
Proceedings of the Seventh SIAM International Conference on Data Mining, 2007

Detection and Visualization of Subspace Cluster Hierarchies.
Proceedings of the Advances in Databases: Concepts, 2007

2006
Mining Hierarchies of Correlation Clusters.
Proceedings of the 18th International Conference on Scientific and Statistical Database Management, 2006

Finding Hierarchies of Subspace Clusters.
Proceedings of the Knowledge Discovery in Databases: PKDD 2006, 2006

Deriving quantitative models for correlation clusters.
Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2006

COSMIC: Conceptually Specified Multi-Instance Clusters.
Proceedings of the 6th IEEE International Conference on Data Mining (ICDM 2006), 2006

2004
Computing Clusters of Correlation Connected Objects.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2004


  Loading...