Showing 1 - 10 of 12
Various supervised learning and gene selection methods have been used for cancer diagnosis. Most of these methods do not consider interactions between genes, although this might be interesting biologically and improve classification accuracy. Here we introduce a new CART-based method to discover...
Persistent link: https://www.econbiz.de/10010265643
PLS dimension reduction is known to give good prediction accuracy in the context of classification with high-dimensional microarray data. In this paper, PLS is compared with some of the best state-of-the-art classification methods. In addition, a simple procedure to choose the number of...
Persistent link: https://www.econbiz.de/10010266131
Binary outcomes that depend on an ordinal predictor in a nonmonotonic way are common in medical data analysis. Such patterns can be addressed in terms of cutpoints: for example, one looks for two cutpoints that define an interval in the range of the ordinal predictor for which the probability of...
Persistent link: https://www.econbiz.de/10010266135
We present a stochastic model for single cell gel electrophoresis (COMET-assay) data. Essential is the use of point process structures, renewal theory and reduction to intensity histograms for further data analysis.
Persistent link: https://www.econbiz.de/10010266141
The study of the network between transcription factors and their targets is important for understanding the complex regulatory mechanisms in a cell. However, due to post-translational modifications the regulator transcription levels (as measured, e.g., by microarray expression arrays) generally...
Persistent link: https://www.econbiz.de/10010266142
Partial Least Squares (PLS) is a highly efficient statistical regression technique that is well suited for the analysis of high-dimensional genomic data. In this paper we review the theory and applications of PLS both under methodological and biological points of view. Focusing on microarray...
Persistent link: https://www.econbiz.de/10010266169
We propose a novel method to model nonlinear regression problems by adapting the principle of penalization to Partial Least Squares (PLS). Starting with a generalized additive model, we expand the additive component of each variable in terms of a generous amount of B-Splines basis functions. In...
Persistent link: https://www.econbiz.de/10010266177
This work is motivated by a mobility study conducted in the city of Munich, Germany. The variable of interest is a binary response, which indicates whether public transport has been utilized or not. One of the central questions is to identify areas of low/high utilization of public transport...
Persistent link: https://www.econbiz.de/10010266198
The Gini gain is one of the most common variable selection criteria in machine learning. We derive the exact distribution of the maximally selected Gini gain in the context of binary classification using continuous predictors by means of a combinatorial approach. This distribution provides a...
Persistent link: https://www.econbiz.de/10010266219
We address the problem of maximally selected chi-square statistics in the case of a binary Y variable and a nominal X variable with several categories. The distribution of the maximally selected chi-square statistic has already been derived when the best cutpoint is chosen from a continuous or...
Persistent link: https://www.econbiz.de/10010266224