Overestimation of the receiver operating characteristic curve for logistic regression
Logistic regression is often used to find a linear combination of covariates which best discriminates between two groups or populations. The ROC, receiver operating characteristic, curve is a good way of assessing the performance of the resulting score, but using the same data both to fit the score and to calculate its ROC leads to an over-optimistic estimate of the performance which the score would give if it were to be validated on a sample of future cases. The paper studies the extent of this overestimation, and suggests a shrinkage correction for the ROC curve itself and for the area under the curve. The correction is consistent with Efron's formula for the bias in the error rate of a binary prediction rule. Two medical examples are discussed. Copyright Biometrika Trust 2002, Oxford University Press.
Year of publication: |
2002
|
---|---|
Authors: | Copas, J. B. |
Published in: |
Biometrika. - Biometrika Trust, ISSN 0006-3444. - Vol. 89.2002, 2, p. 315-331
|
Publisher: |
Biometrika Trust |
Saved in:
Saved in favorites
Similar items by person
-
Regression, prediction and shrinkage : read before the Royal Statistical Society, 12.1.1983
Copas, J. B., (1983)
-
Monte Carlo results for estimation in a stable markov time series
Copas, J. B., (1966)
-
Binary regression models for contaminated data
Copas, J. B., (1988)
- More ...