Minimum sample size for external validation of a clinical prediction model with a binary outcome
Riley, Richard D.; Debray, Thomas P.A.; Collins, Gary S.; Archer, Lucinda; Ensor, Joie; van Smeden, Maarten; Snell, Kym I.E.
(2021) Statistics in Medicine, volume 40, issue 19, pp. 4230 - 4251
(Article)
Abstract
In prediction model research, external validation is needed to examine an existing model's performance using data independent to that for model development. Current external validation studies often suffer from small sample sizes and consequently imprecise predictive performance estimates. To address this, we propose how to determine the minimum sample size
... read more
needed for a new external validation study of a prediction model for a binary outcome. Our calculations aim to precisely estimate calibration (Observed/Expected and calibration slope), discrimination (C-statistic), and clinical utility (net benefit). For each measure, we propose closed-form and iterative solutions for calculating the minimum sample size required. These require specifying: (i) target SEs (confidence interval widths) for each estimate of interest, (ii) the anticipated outcome event proportion in the validation population, (iii) the prediction model's anticipated (mis)calibration and variance of linear predictor values in the validation population, and (iv) potential risk thresholds for clinical decision-making. The calculations can also be used to inform whether the sample size of an existing (already collected) dataset is adequate for external validation. We illustrate our proposal for external validation of a prediction model for mechanical heart valve failure with an expected outcome event proportion of 0.018. Calculations suggest at least 9835 participants (177 events) are required to precisely estimate the calibration and discrimination measures, with this number driven by the calibration slope criterion, which we anticipate will often be the case. Also, 6443 participants (116 events) are required to precisely estimate net benefit at a risk threshold of 8%. Software code is provided.
show less
Download/Full Text
Keywords: binary outcomes, calibration, discrimination, external validation, minimum sample size, multivariable prediction model, net benefit, Epidemiology, Statistics and Probability
ISSN: 0277-6715
Publisher: John Wiley & Sons Inc.
Note: Funding Information: Gary Collins is supported by Cancer Research UK (program grant: C49297/A27294) and the NIHR Biomedical Research Centre, Oxford. Kym Snell is funded by the National Institute for Health Research School for Primary Care Research (NIHR SPCR). Thomas Debray is funded by the Netherlands Organisation for Health Research and Development (grant 91617050) and this project received funding from the European Union's Horizon 2020 research and innovation programme under ReCoDID grant agreement no. 825746. This publication presents independent research funded by the National Institute for Health Research (NIHR). The views expressed are those of the author(s) and not necessarily those of the NHS, the NIHR or the Department of Health and Social Care. We would like to thank two anonymous reviewers for their constructive feedback and suggestions that helped us to improve the article upon revision. Funding Information: Cancer Research UK, C49297/A27294; European Union's Horizon 2020 Research and Innovation Programme, ReCoDID Grant Agreement no. 825746; National Institute for Health Research School for Primary Care Research (NIHR SPCR), Netherlands Organisation for Health Research and Development, grant 91617050; NIHR Biomedical Research Centre, Royal Marsden NHS Foundation Trust/Institute of Cancer Research, Oxford, NIHR Biomedical Research Centre, Oxford Funding information Publisher Copyright: © 2021 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd.
(Peer reviewed)