"Testing for improvement in prediction model performance" by Margaret S. Pepe PhD, Kathleen F. Kerr et al.

UW Biostatistics Working Paper Series

Title

Testing for improvement in prediction model performance

Authors

Margaret S. Pepe PhD, Fred Hutchinson Cancer Research CenterFollow
Kathleen F. Kerr, University of WashingtonFollow
Gary M. Longton, Fred Hutchinson Cancer Research CenterFollow
Zheyu Wang, Fred Hutchinson Cancer Research CenterFollow

Abstract

New methodology has been proposed in recent years for evaluating the improvement in prediction performance gained by adding a new predictor, Y, to a risk model containing a set of baseline predictors, X, for a binary outcome D. We prove theoretically that null hypotheses concerning no improvement in performance are equivalent to the simple null hypothesis that the coefficient for Y is zero in the risk model, P(D = 1|X, Y ). Therefore, testing for improvement in prediction performance is redundant if Y has already been shown to be a risk factor. We investigate properties of tests through simulation studies, focusing on the change in the area under the ROC curve (AUC). An unexpected finding is that standard testing procedures that do not adjust for variability in estimated regression coefficients are extremely conservative. This may explain why the AUC is widely considered insensitive to improvements in prediction performance and suggests that the problem of insensitivity has to do with use of invalid procedures for inference rather than with the measure itself. To avoid redundant testing and use of potentially problematic methods for inference, we recommend that hypothesis testing for no improvement be limited to evaluation of Y as a risk factor, for which methods are well developed and widely available. Analyses of measures of prediction performance should focus on estimation rather than on testing.

Disciplines

Biostatistics

Suggested Citation

Pepe, Margaret S. PhD; Kerr, Kathleen F.; Longton, Gary M.; and Wang, Zheyu, "Testing for improvement in prediction model performance" (March 2012). UW Biostatistics Working Paper Series. Working Paper 379.
https://biostats.bepress.com/uwbiostat/paper379

Download

Included in

Biostatistics Commons

COinS

Collection of Biostatistics Research Archive

UW Biostatistics Working Paper Series

Title

Authors

Abstract

Disciplines

Suggested Citation

Included in

Browse

Search

Author Corner

UW Biostatistics

Collection of Biostatistics Research Archive

UW Biostatistics Working Paper Series

Title

Authors

Abstract

Disciplines

Suggested Citation

Included in

Share

Browse

Search

Author Corner

UW Biostatistics