Erich Schubert

Orcid: 0000-0001-9143-4880

Affiliations:
  • Technical University of Dortmund, Germany
  • Universität Heidelberg, Germany (former)


According to our database1, Erich Schubert authored at least 73 papers between 2005 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Medoid Silhouette clustering with automatic cluster number selection.
Inf. Syst., February, 2024

2023
Stop using the elbow criterion for k-means and how to choose the number of clusters instead.
SIGKDD Explor., 2023

Sparse Partitioning Around Medoids.
CoRR, 2023

Data Aggregation for Hierarchical Clustering.
CoRR, 2023

An Alternating Optimization Scheme for Binary Sketches for Cosine Similarity Search.
Proceedings of the Similarity Search and Applications - 16th International Conference, 2023

Accelerating k-Means Clustering with Cover Trees.
Proceedings of the Similarity Search and Applications - 16th International Conference, 2023

Fast k-Nearest-Neighbor-Consistent Clustering.
Proceedings of the Lernen, 2023

Who Did What When? Discovering Complex Historical Interrelations in Immersive Virtual Reality.
Proceedings of the IEEE International Symposium on Mixed and Augmented Reality, 2023

2022
Fast k-medoids Clustering in Rust and Python.
J. Open Source Softw., 2022

ABID: Angle Based Intrinsic Dimensionality - Theory and analysis.
Inf. Syst., 2022

BETULA: Fast clustering of large data with improved BIRCH CF-Trees.
Inf. Syst., 2022

EmbAssi: embedding assignment costs for similarity search in large graph databases.
Data Min. Knowl. Discov., 2022

LOSDD: Leave-Out Support Vector Data Description for Outlier Detection.
CoRR, 2022

On Projections to Linear Subspaces.
Proceedings of the Similarity Search and Applications - 15th International Conference, 2022

Automatic Indexing for Similarity Search in ELKI.
Proceedings of the Similarity Search and Applications - 15th International Conference, 2022

Clustering by Direct Optimization of the Medoid Silhouette.
Proceedings of the Similarity Search and Applications - 15th International Conference, 2022

Data Aggregation for Hierarchical Clustering.
Proceedings of the Machine Learning under Resource Constraints - Volume 1: Fundamentals, 2022

Sparse Partitioning Around Medoids.
Proceedings of the Machine Learning under Resource Constraints - Volume 1: Fundamentals, 2022

2021
Fast and eager k-medoids clustering: O(k) runtime improvement of the PAM, CLARA, and CLARANS algorithms.
Inf. Syst., 2021

MESS: Manifold Embedding Motivated Super Sampling.
Proceedings of the Similarity Search and Applications - 14th International Conference, 2021

Accelerating Spherical k-Means.
Proceedings of the Similarity Search and Applications - 14th International Conference, 2021

A Triangle Inequality for Cosine Similarity.
Proceedings of the Similarity Search and Applications - 14th International Conference, 2021

Metric Indexing for Graph Similarity Search.
Proceedings of the Similarity Search and Applications - 14th International Conference, 2021

CANDLE: Classification And Noise Detection With Local Embedding Approximations.
Proceedings of the LWDA 2021 Workshops: FGWM, 2021

HACAM: Hierarchical Agglomerative Clustering Around Medoids - and its Limitations.
Proceedings of the LWDA 2021 Workshops: FGWM, 2021

2020
Call for Special Issue Papers: Evaluation and Experimental Design in Data Mining and Machine Learning.
Big Data, 2020

ABID: Angle Based Intrinsic Dimensionality.
Proceedings of the Similarity Search and Applications - 13th International Conference, 2020

BETULA: Numerically Stable CF-Trees for BIRCH Clustering.
Proceedings of the Similarity Search and Applications - 13th International Conference, 2020

2019
Introduction to Special Issue of the 9th International Conference on Similarity Search and Applications (SISAP 2016).
Inf. Syst., 2019

ELKI: A large open-source library for data analysis - ELKI Release 0.7.5 "Heidelberg".
CoRR, 2019

Faster k-Medoids Clustering: Improving the PAM, CLARA, and CLARANS Algorithms.
Proceedings of the Similarity Search and Applications - 12th International Conference, 2019

1st Workshop on Evaluation and Experimental Design in Data Mining and Machine Learning (EDML 2019).
Proceedings of the 1st Workshop on Evaluation and Experimental Design in Data Mining and Machine Learning co-located with SIAM International Conference on Data Mining (SDM 2019), 2019

2018
Outlier Detection.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Numerically stable parallel computation of (co-)variance.
Proceedings of the 30th International Conference on Scientific and Statistical Database Management, 2018

On the Correlation Between Local Intrinsic Dimensionality and Outlierness.
Proceedings of the Similarity Search and Applications - 11th International Conference, 2018

The Relationship of DBSCAN to Matrix Factorization and Spectral Clustering.
Proceedings of the Conference "Lernen, Wissen, Daten, Analysen", 2018

Improving the Cluster Structure Extracted from OPTICS Plots.
Proceedings of the Conference "Lernen, Wissen, Daten, Analysen", 2018

Exploring Significant Interactions in Live News.
Proceedings of the Second International Workshop on Recent Trends in News Information Retrieval co-located with 40th European Conference on Information Retrieval (ECIR 2018), 2018

2017
DBSCAN Revisited, Revisited: Why and How You Should (Still) Use DBSCAN.
ACM Trans. Database Syst., 2017

Dimensional Testing for Reverse k-Nearest Neighbor Search.
Proc. VLDB Endow., 2017

The (black) art of runtime evaluation: Are we comparing algorithms or implementations?
Knowl. Inf. Syst., 2017

Semantic Word Clouds with Background Corpus Normalization and t-distributed Stochastic Neighbor Embedding.
CoRR, 2017

Intrinsic t-Stochastic Neighbor Embedding for Visualization and Outlier Detection - A Remedy Against the Curse of Dimensionality?
Proceedings of the Similarity Search and Applications - 10th International Conference, 2017

Good and Bad Neighborhood Approximations for Outlier Detection Ensembles.
Proceedings of the Similarity Search and Applications - 10th International Conference, 2017

2016
On the evaluation of unsupervised outlier detection: measures, datasets, and an empirical study.
Data Min. Knowl. Discov., 2016

SPOTHOT: Scalable Detection of Geo-spatial Events in Large Textual Streams.
Proceedings of the 28th International Conference on Scientific and Statistical Database Management, 2016

2015
A Framework for Clustering Uncertain Data.
Proc. VLDB Endow., 2015

Outlier Detection and Trend Detection: Two Sides of the Same Coin.
Proceedings of the IEEE International Conference on Data Mining Workshop, 2015

Fast and Scalable Outlier Detection with Approximate Nearest Neighbor Ensembles.
Proceedings of the Database Systems for Advanced Applications, 2015

2014
Local outlier detection reconsidered: a generalized view on locality with applications to spatial, video, and network outlier detection.
Data Min. Knowl. Discov., 2014

Generalized Outlier Detection with Flexible Kernel Density Estimates.
Proceedings of the 2014 SIAM International Conference on Data Mining, 2014

SigniTrend: scalable detection of emerging topics in textual streams by hashed significance thresholds.
Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2014

Discriminative features for identifying and interpreting outliers.
Proceedings of the IEEE 30th International Conference on Data Engineering, Chicago, 2014

2013
Generalized and efficient outlier detection for spatial, temporal, and high-dimensional data mining.
PhD thesis, 2013

Geodetic Distance Queries on R-Trees for Indexing Geographic Data.
Proceedings of the Advances in Spatial and Temporal Databases, 2013

Interactive data mining with 3D-parallel-coordinate-trees.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2013

2012
A survey on unsupervised outlier detection in high-dimensional numerical data.
Stat. Anal. Data Min., 2012

On Evaluation of Outlier Rankings and Outlier Scores.
Proceedings of the Twelfth SIAM International Conference on Data Mining, 2012

Outlier Detection in Arbitrarily Oriented Subspaces.
Proceedings of the 12th IEEE International Conference on Data Mining, 2012

Evaluation of Clusterings - Metrics and Visual Support.
Proceedings of the IEEE 28th International Conference on Data Engineering (ICDE 2012), 2012

2011
Quality of Similarity Rankings in Time Series.
Proceedings of the Advances in Spatial and Temporal Databases, 2011

Spatial Outlier Detection: Data, Algorithms, Visualizations.
Proceedings of the Advances in Spatial and Temporal Databases, 2011

Interpreting and Unifying Outlier Scores.
Proceedings of the Eleventh SIAM International Conference on Data Mining, 2011

Evaluation of Multiple Clustering Solutions.
Proceedings of the 2nd MultiClust Workshop: Discovering, 2011

2010
Can Shared-Neighbor Distances Defeat the Curse of Dimensionality?
Proceedings of the Scientific and Statistical Database Management, 2010

Subspace Similarity Search: Efficient k-NN Queries in Arbitrary Subspaces.
Proceedings of the Scientific and Statistical Database Management, 2010

Subspace similarity search using the ideas of ranking and top-k retrieval.
Proceedings of the Workshops Proceedings of the 26th International Conference on Data Engineering, 2010

Visual Evaluation of Outlier Detection Models.
Proceedings of the Database Systems for Advanced Applications, 2010

2009
ELKI in Time: ELKI 0.2 for the Performance Evaluation of Distance Measures for Time Series.
Proceedings of the Advances in Spatial and Temporal Databases, 2009

Outlier Detection in Axis-Parallel Subspaces of High Dimensional Data.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2009

LoOP: local outlier probabilities.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

2008
A General Framework for Increasing the Robustness of PCA-Based Correlation Clustering Algorithms.
Proceedings of the Scientific and Statistical Database Management, 2008

2005
Structure-Preserving Difference Search for XML Documents.
Proceedings of the Extreme Markup Languages® 2005 Conference, 2005


  Loading...