Showing 1 - 10 of 97
Persistent link: https://www.econbiz.de/10005172603
The Gini gain is one of the most common variable selection criteria in machine learning. We derive the exact distribution of the maximally selected Gini gain in the context of binary classification using continuous predictors by means of a combinatorial approach. This distribution provides a...
Persistent link: https://www.econbiz.de/10003310038
Persistent link: https://www.econbiz.de/10010722387
Binary outcomes that depend on an ordinal predictor in a nonmonotonic way are common in medical data analysis. Such patterns can be addressed in terms of cutpoints: for example, one looks for two cutpoints that define an interval in the range of the ordinal predictor for which the probability of...
Persistent link: https://www.econbiz.de/10010266135
Variable importance measures for random forests have been receiving increased attention as a means of variable selection in many classification tasks in bioinformatics and related scientific fields, for instance to select a subset of genetic markers relevant for the prediction of a certain...
Persistent link: https://www.econbiz.de/10010280795
Most genetic diseases are complex, i.e. associated to combinations of SNPs rather than individual SNPs. In the last few years, this topic has often been addressed in terms of SNP-SNP interaction patterns given as expressions linked by logical operators. Methods for multiple testing in...
Persistent link: https://www.econbiz.de/10005585092
Persistent link: https://www.econbiz.de/10005165617
Binary outcomes that depend on an ordinal predictor in a nonmonotonic way are common in medical data analysis. Such patterns can be addressed in terms of cutpoints: for example, one looks for two cutpoints that define an interval in the range of the ordinal predictor for which the probability of...
Persistent link: https://www.econbiz.de/10003377879
Variable importance measures for random forests have been receiving increased attention as a means of variable selection in many classification tasks in bioinformatics and related scientific fields, for instance to select a subset of genetic markers relevant for the prediction of a certain...
Persistent link: https://www.econbiz.de/10003378498
Persistent link: https://www.econbiz.de/10010947003