Anne-Laure Boulesteix

Ben Van Calster

CoRR, 2024

2023

A white paper on good research practices in benchmarking: The case of cluster analysis.

[BibT_eX]

[DOI]

Iven Van Mechelen

WIREs Data. Mining. Knowl. Discov., November, 2023

Over-optimistic evaluation and reporting of novel cluster algorithms: an illustrative study.

[BibT_eX]

[DOI]

Theresa Ullmann

Anna Beer

Maximilian Hünemörder

Thomas Seidl

Adv. Data Anal. Classif., March, 2023

Over-optimism in unsupervised microbiome analysis: Insights from network learning and clustering.

[BibT_eX]

[DOI]

Theresa Ullmann

Stefanie Peschel

Philipp F. M. Baumann

Christian L. Müller

PLoS Comput. Biol., January, 2023

Hyperparameter optimization: Foundations, algorithms, best practices, and open challenges.

[BibT_eX]

[DOI]

Difan Deng

Marius Lindauer

WIREs Data. Mining. Knowl. Discov., 2023

Evaluating machine learning models in non-standard settings: An overview and new findings.

[BibT_eX]

[DOI]

CoRR, 2023

Prediction approaches for partly missing multi-omics covariate data: A literature review and an empirical comparison study.

[BibT_eX]

[DOI]

Frederik Ludwigs

Jonas Hagenberg

CoRR, 2023

2022

Validation of cluster analysis results on validation data: A systematic framework.

[BibT_eX]

[DOI]

Theresa Ullmann

Christian Hennig

WIREs Data Mining Knowl. Discov., 2022

Over-optimism in benchmark studies and the multiplicity of design and analysis options when interpreting their results.

[BibT_eX]

[DOI]

WIREs Data Mining Knowl. Discov., 2022

Interaction forests: Identifying and exploiting interpretable quantitative and qualitative interaction effects.

[BibT_eX]

[DOI]

Comput. Stat. Data Anal., 2022

2021

Improved Outcome Prediction Across Data Sources Through Robust Parameter Tuning.

[BibT_eX]

[DOI]

Nicole Ellenbach

Bernd Bischl

Kristian Unger

J. Classif., 2021

NetCoMi: network construction and comparison for microbiome data in R.

[BibT_eX]

[DOI]

Stefanie Peschel

Christian L. Müller

Erika von Mutius

Martin Depner

Briefings Bioinform., 2021

Large-scale benchmark study of survival prediction methods using multi-omics data.

[BibT_eX]

[DOI]

Briefings Bioinform., 2021

2020

Combining clinical and molecular data in regression prediction models: insights from a simulation study.

[BibT_eX]

[DOI]

Axel Benner

Natalia Becker

Willi Sauerbrei

Briefings Bioinform., 2020

2019

Hyperparameters and tuning strategies for random forest.

[BibT_eX]

[DOI]

Marvin N. Wright

WIREs Data Mining Knowl. Discov., 2019

Tunability: Importance of Hyperparameters of Machine Learning Algorithms.

[BibT_eX]

[DOI]

Bernd Bischl

J. Mach. Learn. Res., 2019

2018

On the choice and influence of the number of boosting steps for high-dimensional linear Cox-models.

[BibT_eX]

[DOI]

Heidi Seibold

Comput. Stat., 2018

Priority-Lasso: a simple hierarchical approach to the prediction of clinical outcome using multi-omics data.

[BibT_eX]

[DOI]

BMC Bioinform., 2018

Random forest versus logistic regression: a large-scale benchmark experiment.

[BibT_eX]

[DOI]

Raphaël Couronné

BMC Bioinform., 2018

A computationally fast variable importance test for random forests for high-dimensional data.

[BibT_eX]

[DOI]

Ender Celik

Adv. Data Anal. Classif., 2018

2017

To Tune or Not to Tune the Number of Trees in Random Forest.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2017

Detection of influential points as a byproduct of resampling-based variable selection procedures.

[BibT_eX]

[DOI]

Willi Sauerbrei

Comput. Stat. Data Anal., 2017

IPF-LASSO: Integrative L<sup>1</sup>-Penalized Regression with Penalty Factors for Prediction Based on Multi-Omics Data.

[BibT_eX]

[DOI]

Xiaoyu Jiang

Mathias Fuchs

Comput. Math. Methods Medicine, 2017

Improving cross-study prediction through addon batch effect adjustment or addon normalization.

[BibT_eX]

[DOI]

David Causeur

Bioinform., 2017

2016

Random forest for ordinal responses: Prediction and variable selection.

[BibT_eX]

[DOI]

Gerhard Tutz

Comput. Stat. Data Anal., 2016

Combining location-and-scale batch effect adjustment with data cleaning by latent factor adjustment.

[BibT_eX]

[DOI]

David Causeur

BMC Bioinform., 2016

2015

Ten Simple Rules for Reducing Overoptimistic Reporting in Methodological Computational Research.

[BibT_eX]

[DOI]

PLoS Comput. Biol., 2015

Letter to the Editor: On the term 'interaction' and related phrases in the literature on Random Forests.

[BibT_eX]

[DOI]

Alexander Hapfelmeier

Kristel Van Steen

Briefings Bioinform., 2015

Letter to the Editor: On Reviews and Papers on New Methods.

[BibT_eX]

[DOI]

Briefings Bioinform., 2015

2014

Cross-study validation for the assessment of prediction algorithms.

[BibT_eX]

[DOI]

Markus Riester

Bioinform., 2014

2013

On the Simultaneous Analysis of Clinical and Omics Data: A Comparison of Globalboosttest and Pre-validation Techniques.

[BibT_eX]

[DOI]

Margret-Ruth Oelker

Proceedings of the Statistical Models for Data Analysis, 2013

Complexity Selection with Cross-validation for Lasso and Sparse Partial Least Squares Using High-Dimensional Data.

[BibT_eX]

[DOI]

Adrian Richter

Proceedings of the Algorithms from and for Nature and Life, 2013

An AUC-based permutation variable importance measure for random forests.

[BibT_eX]

[DOI]

BMC Bioinform., 2013

On representative and illustrative comparisons with real data in bioinformatics: response to the letter to the editor by Smith <i>et al.</i>.

[BibT_eX]

[DOI]

Bioinform., 2013

2012

Overview of random forest methodology and practical guidance with emphasis on computational biology and bioinformatics.

[BibT_eX]

[DOI]

Jochen Kruppa

Inke R. König

WIREs Data Mining Knowl. Discov., 2012

A Plea for Neutral Comparison Studies in Computational Sciences

[BibT_eX]

[DOI]

Manuel J. A. Eugster

CoRR, 2012

Random forest Gini importance favours SNPs with large minor allele frequency: impact, sources and recommendations.

[BibT_eX]

[DOI]

Andreas Bender

Justo Lorenzo Bermejo

Briefings Bioinform., 2012

2011

Added predictive value of high-throughput molecular data to clinical data and its validation.

[BibT_eX]

[DOI]

Willi Sauerbrei

Briefings Bioinform., 2011

Editorial.

[BibT_eX]

[DOI]

Briefings Bioinform., 2011

2010

Testing the additional predictive value of high-dimensional molecular data.

[BibT_eX]

[DOI]

Torsten Hothorn

BMC Bioinform., 2010

Over-optimism in bioinformatics: an illustration.

[BibT_eX]

[DOI]

Bioinform., 2010

Over-optimism in bioinformatics research.

[BibT_eX]

[DOI]

Bioinform., 2010

2009

Survival prediction using gene expression data: A review and comparison.

[BibT_eX]

[DOI]

Wessel N. van Wieringen

David Kun

Regina Hampel

Comput. Stat. Data Anal., 2009

Regularized estimation of large-scale gene association networks using graphical Gaussian models.

[BibT_eX]

[DOI]

Nicole Krämer

Juliane Schäfer

BMC Bioinform., 2009

Stability and aggregation of ranked gene lists.

[BibT_eX]

[DOI]

Martin Slawski

Briefings Bioinform., 2009

2008

Conditional variable importance for random forests.

[BibT_eX]

[DOI]

Thomas Kneib

Thomas Augustin

Achim Zeileis

BMC Bioinform., 2008

CMA - a comprehensive Bioconductor package for supervised classification with high dimensional data.

[BibT_eX]

[DOI]

Martin Slawski

Martin Daumer

BMC Bioinform., 2008

Microarray-based classification and clinical predictors: on combined classifiers and additional predictive value.

[BibT_eX]

[DOI]

Christine Porzelius

Martin Daumer

Bioinform., 2008

2007

Unbiased split selection for classification trees based on the Gini Index.

[BibT_eX]

[DOI]

Thomas Augustin

Comput. Stat. Data Anal., 2007

Maximally selected Chi-squared statistics and non-monotonic associations: An exact approach based on two cutpoints.

[BibT_eX]

[DOI]

Comput. Stat. Data Anal., 2007

Bias in random forest variable importance measures: Illustrations, sources and a solution.

[BibT_eX]

[DOI]

Achim Zeileis

Torsten Hothorn

BMC Bioinform., 2007

WilcoxCV: an R package for fast variable selection in cross-validation.

[BibT_eX]

[DOI]

Bioinform., 2007

Partial least squares: a versatile tool for the analysis of high-dimensional genomic data.

[BibT_eX]

[DOI]

Korbinian Strimmer

Briefings Bioinform., 2007

2006

Identification of interaction patterns and classification with applications to microarray data.

[BibT_eX]

[DOI]

Gerhard Tutz

Comput. Stat. Data Anal., 2006

2003

A CART-based approach to discover emerging patterns in microarray data.

[BibT_eX]

[DOI]