Showing 1 - 10 of 20
For the problem of model selection, full cross-validation has been proposed as alternative criterion to the traditional cross-validation, particularly in cases where the latter one is not well defined. To justify the use of the new proposal we show that under some conditions, both criteria share...
Persistent link: https://www.econbiz.de/10010310761
Machine learning is increasingly applied to time series data, as it constitutes an attractive alternative to forecasts based on traditional time series models. For independent and identically distributed observations, cross-validation is the prevalent scheme for estimating out-of-sample...
Persistent link: https://www.econbiz.de/10012142940
Current statistical inference problems in genomic data analysis involve parameter estimation for high-dimensional multivariate distributions, with typically unknown and intricate correlation patterns among variables. Addressing these inference questions satisfactorily requires: (i) an intensive...
Persistent link: https://www.econbiz.de/10005459073
Cawley et al. (2004) have recently mapped the locations of binding sites for three transcription factors along human chromosomes 21 and 22 using ChIP-Chip experiments. ChIP-Chip experiments are a new approach to the genome-wide identification of transcription factor binding sites and consist of...
Persistent link: https://www.econbiz.de/10005459074
Suppose that we observe a sample of independent and identically distributed realizations of a random variable. Assume that the parameter of interest can be defined as the minimizer, over a suitably defined parameter space, of the expectation (with respect to the distribution of the random...
Persistent link: https://www.econbiz.de/10005459075
Information theory offers a coherent, intuitive view of model selection. This perspective arises from thinking of a statistical model as a code, an algorithm for compressing data into a sequence of bits. The description length is the length of this code for the data plus the length of a...
Persistent link: https://www.econbiz.de/10010789425
For the problem of model selection, full cross-validation has been proposed as alternative criterion to the traditional cross-validation, particularly in cases where the latter one is not well defined. To justify the use of the new proposal we show that under some conditions, both criteria share...
Persistent link: https://www.econbiz.de/10010956413
Estimates of the level of inequality of opportunity have traditionally been interpreted as lower bounds due to the downward bias resulting from the partial observability of circumstances that affect individual outcome. We show that such estimates may also suffer from upward bias as a consequence...
Persistent link: https://www.econbiz.de/10011873409
van der Laan and Dudoit (2003) provide a road map for estimation and performance assessment where a parameter of interest is defined as the risk minimizer for a suitable loss function and candidate estimators are generated using a loss function. After briefly reviewing this approach, this...
Persistent link: https://www.econbiz.de/10005046582
The results of analyzing experimental data using a parametric model may heavily depend on the chosen model. In this paper we propose procedures for the adequate selection of nonlinear regression models if the intended use of the model is among the following: prediction of future values of the...
Persistent link: https://www.econbiz.de/10005008409