Minimum sample size calculations for external validation of a clinical prediction model with a time-to-event outcome

被引:38
|
作者
Riley, Richard D. [1 ]
Collins, Gary S. [2 ,3 ]
Ensor, Joie [1 ]
Archer, Lucinda [1 ]
Booth, Sarah [4 ]
Mozumder, Sarwar, I [4 ]
Rutherford, Mark J. [4 ]
van Smeden, Maarten [5 ]
Lambert, Paul C. [4 ,6 ]
Snell, Kym I. E. [1 ]
机构
[1] Keele Univ, Ctr Prognosis Res, Sch Med, Keele ST5 5BG, Staffs, England
[2] Univ Oxford, Ctr Stat Med, Nuffield Dept Orthopaed Rheumatol & Musculoskelet, Oxford, England
[3] Oxford Univ Hosp NHS Fdn Trust, NIHR Oxford Biomed Res Ctr, Oxford, England
[4] Univ Leicester, George Davies Ctr, Dept Hlth Sci, Biostat Res Grp, Leicester, Leics, England
[5] Univ Utrecht, Univ Med Ctr Utrecht, Julius Ctr Hlth Sci & Primary Care, Utrecht, Netherlands
[6] Karolinska Inst, Dept Med Epidemiol & Biostat, Stockholm, Sweden
基金
美国国家卫生研究院;
关键词
calibration; external validation; prediction model; sample size; time-to-event & survival data; REGRESSION-ANALYSIS; PSEUDO-OBSERVATIONS; PROGNOSTIC MODEL; CALIBRATION; RISK;
D O I
10.1002/sim.9275
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Previous articles in Statistics in Medicine describe how to calculate the sample size required for external validation of prediction models with continuous and binary outcomes. The minimum sample size criteria aim to ensure precise estimation of key measures of a model's predictive performance, including measures of calibration, discrimination, and net benefit. Here, we extend the sample size guidance to prediction models with a time-to-event (survival) outcome, to cover external validation in datasets containing censoring. A simulation-based framework is proposed, which calculates the sample size required to target a particular confidence interval width for the calibration slope measuring the agreement between predicted risks (from the model) and observed risks (derived using pseudo-observations to account for censoring) on the log cumulative hazard scale. Precise estimation of calibration curves, discrimination, and net-benefit can also be checked in this framework. The process requires assumptions about the validation population in terms of the (i) distribution of the model's linear predictor and (ii) event and censoring distributions. Existing information can inform this; in particular, the linear predictor distribution can be approximated using the C-index or Royston's D statistic from the model development article, together with the overall event risk. We demonstrate how the approach can be used to calculate the sample size required to validate a prediction model for recurrent venous thromboembolism. Ideally the sample size should ensure precise calibration across the entire range of predicted risks, but must at least ensure adequate precision in regions important for clinical decision-making. Stata and R code are provided.
引用
收藏
页码:1280 / 1295
页数:16
相关论文
共 50 条
  • [1] Minimum sample size for external validation of a clinical prediction model with a continuous outcome
    Archer, Lucinda
    Snell, Kym I. E.
    Ensor, Joie
    Hudda, Mohammed T.
    Collins, Gary S.
    Riley, Richard D.
    [J]. STATISTICS IN MEDICINE, 2021, 40 (01) : 133 - 146
  • [2] Minimum sample size for external validation of a clinical prediction model with a binary outcome
    Riley, Richard D.
    Debray, Thomas P. A.
    Collins, Gary S.
    Archer, Lucinda
    Ensor, Joie
    van Smeden, Maarten
    Snell, Kym I. E.
    [J]. STATISTICS IN MEDICINE, 2021, 40 (19) : 4230 - 4251
  • [3] Minimum sample size for developing a multivariable prediction model: PART II - binary and time-to-event outcomes
    Riley, Richard D.
    Snell, Kym I. E.
    Ensor, Joie
    Burke, Danielle L.
    Harrell, Frank E., Jr.
    Moons, Karel G. M.
    Collins, Gary S.
    [J]. STATISTICS IN MEDICINE, 2019, 38 (07) : 1276 - 1296
  • [4] Aligning sample size calculations with estimands in clinical trials with time-to-event outcomes
    Fang, Yixin
    Jin, Man
    Wu, Chengqing
    [J]. STATISTICS AND ITS INTERFACE, 2024, 17 (01) : 63 - 68
  • [5] Statistical power and sample size calculations for time-to-event analysis
    Zurakowski, David
    Staffa, Steven J.
    [J]. JOURNAL OF THORACIC AND CARDIOVASCULAR SURGERY, 2023, 166 (06): : 1542 - +
  • [6] Sample size calculations for noninferiority trials for time-to-event data using the concept of proportional time
    Phadnis, Milind A.
    Mayo, Matthew S.
    [J]. JOURNAL OF APPLIED STATISTICS, 2021, 48 (06) : 1009 - 1032
  • [7] Power and Sample Size Calculations for SNP Association Studies With Censored Time-to-Event Outcomes
    Owzar, Kouros
    Li, Zhiguo
    Cox, Nancy
    Jung, Sin-Ho
    [J]. GENETIC EPIDEMIOLOGY, 2012, 36 (06) : 538 - 548
  • [8] Power and sample-size analysis for the Royston Parmar combined test in clinical trials with a time-to-event outcome
    Royston, Patrick
    [J]. STATA JOURNAL, 2018, 18 (01): : 3 - 21
  • [9] Prioritising deteriorating patients using time-to-event analysis: prediction model development and internal-external validation
    Blythe, Robin
    Parsons, Rex
    Barnett, Adrian G.
    Cook, David
    McPhail, Steven M.
    White, Nicole M.
    [J]. CRITICAL CARE, 2024, 28 (01)
  • [10] Discrimination-based sample size calculations for multivariable prognostic models for time-to-event data
    Jinks, Rachel C.
    Royston, Patrick
    Parmar, Mahesh K. B.
    [J]. BMC MEDICAL RESEARCH METHODOLOGY, 2015, 15