Similar Search Results

Asymptotic optimality of full cross-validation for selecting linear regression models

Droge, Bernd - 1997

For the problem of model selection, full cross-validation has been proposed as alternative criterion to the traditional cross-validation, particularly in cases where the latter one is not well defined. To justify the use of the new proposal we show that under some conditions, both criteria share...

Persistent link: https://www.econbiz.de/10010310761

A comparison of machine learning model validation schemes for non-stationary time series data

Schnaubelt, Matthias - 2019

Machine learning is increasingly applied to time series data, as it constitutes an attractive alternative to forecasts based on traditional time series models. For independent and identically distributed observations, cross-validation is the prevalent scheme for estimating out-of-sample...

Persistent link: https://www.econbiz.de/10012142940

Loss-Based Estimation with Cross-Validation: Applications to Microarray Data Analysis and Motif Finding

Dudoit, Sandrine; Laan, Mark van der; Keles, Sunduz; … - Berkeley Electronic Press - 2004

Current statistical inference problems in genomic data analysis involve parameter estimation for high-dimensional multivariate distributions, with typically unknown and intricate correlation patterns among variables. Addressing these inference questions satisfactorily requires: (i) an intensive...

Persistent link: https://www.econbiz.de/10005459073

Multiple Testing Methods For ChIP-Chip High Density Oligonucleotide Array Data

Keles, Sunduz; Laan, Mark van der; Dudoit, Sandrine; … - Berkeley Electronic Press - 2004

Cawley et al. (2004) have recently mapped the locations of binding sites for three transcription factors along human chromosomes 21 and 22 using ChIP-Chip experiments. ChIP-Chip experiments are a new approach to the genome-wide identification of transcription factor binding sites and consist of...

Persistent link: https://www.econbiz.de/10005459074

The Cross-Validated Adaptive Epsilon-Net Estimator

Laan, Mark van der; Dudoit, Sandrine; Vaart, Aad van der - Berkeley Electronic Press - 2004

Suppose that we observe a sample of independent and identically distributed realizations of a random variable. Assume that the parameter of interest can be defined as the minimizer, over a suitably defined parameter space, of the expectation (with respect to the distribution of the random...

Persistent link: https://www.econbiz.de/10005459075

Model Selection Using Information Theory and the MDL Principle

Stine, Robert A. - In: Sociological Methods & Research 33 (2004) 2, pp. 230-260

Information theory offers a coherent, intuitive view of model selection. This perspective arises from thinking of a statistical model as a code, an algorithm for compressing data into a sequence of bits. The description length is the length of this code for the data plus the length of a...

Persistent link: https://www.econbiz.de/10010789425

Asymptotic optimality of full cross-validation for selecting linear regression models

Droge, Bernd - Sonderforschungsbereich 373, Quantifikation und … - 1997

Persistent link: https://www.econbiz.de/10010956413

Upward and Downward Bias When Measuring Inequality of Opportunity

Brunori, Paolo; Peragine, Vito; Serlenga, Laura - 2018

Estimates of the level of inequality of opportunity have traditionally been interpreted as lower bounds due to the downward bias resulting from the partial observability of circumstances that affect individual outcome. We show that such estimates may also suffer from upward bias as a consequence...

Persistent link: https://www.econbiz.de/10011873409

Deletion/Substitution/Addition Algorithm in Learning with Applications in Genomics

Sinisi, Sandra; Laan, Mark van der - In: Statistical Applications in Genetics and Molecular Biology 3 (2009) 1, pp. 18-18

van der Laan and Dudoit (2003) provide a road map for estimation and performance assessment where a parameter of interest is defined as the risk minimizer for a suitable loss function and candidate estimators are generated using a loss function. After briefly reviewing this approach, this...

Persistent link: https://www.econbiz.de/10005046582

Model Selection and Variable Transformations in Nonlinear Regression

BUNKE, Olaf; DROGE, Bernd; POLZEHL, Jörg - Center for Operations Research and Econometrics (CORE), … - 1993

The results of analyzing experimental data using a parametric model may heavily depend on the chosen model. In this paper we propose procedures for the adequate selection of nonlinear regression models if the intended use of the model is among the following: prediction of future values of the...

Persistent link: https://www.econbiz.de/10005008409