Simpson's paradox in the integrated discrimination improvement

被引:12
|
作者
Chipman, J. [1 ]
Braun, D. [2 ,3 ]
机构
[1] Vanderbilt Univ Sch Med, Dept Biostat, 2525 West End Ave,Suite 11000, Nashville, TN 37203 USA
[2] Harvard TH Chan Sch Publ Hlth, Dept Biostat, Boston, MA 02115 USA
[3] Dana Farber Canc Inst, Dept Biostat & Computat Biol, Boston, MA 02215 USA
基金
美国国家卫生研究院;
关键词
integrated discrimination improvement; simpson's paradox; risk prediction; reclassification; BRCAPRO; NET RECLASSIFICATION IMPROVEMENT; CARRIER PROBABILITY ESTIMATION; CLINICALLY RELEVANT MEASURES; PREDICTION MODELS; PERFORMANCE; MARKER; BRCA2; RISK; CALIBRATION; CAUTION;
D O I
10.1002/sim.6862
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The integrated discrimination improvement (IDI) is commonly used to compare two risk prediction models; it summarizes the extent a new model increases risk in events and decreases risk in non-events. The IDI averages risks across events and non-events and is therefore susceptible to Simpson's paradox. In some settings, adding a predictive covariate to a well calibrated model results in an overall negative (positive) IDI. However, if stratified by that same covariate, the strata-specific IDIs are positive (negative). Meanwhile, the calibration (observed to expected ratio and Hosmer-Lemeshow Goodness of Fit Test), area under the receiver operating characteristic curve, and Brier score improve overall and by stratum. We ran extensive simulations to investigate the impact of an imbalanced covariate upon metrics (IDI, area under the receiver operating characteristic curve, Brier score, and R-2), provide an analytic explanation for the paradox in the IDI, and use an investigative metric, a Weighted IDI, to better understand the paradox. In simulations, all instances of the paradox occurred under stratum-specific mis-calibration, yet there were mis-calibrated settings in which the paradox did not occur. The paradox is illustrated on Cancer Genomics Network data by calculating predictions based on two versions of BRCAPRO, a Mendelian risk prediction model for breast and ovarian cancer. In both simulations and the Cancer Genomics Network data, overall model calibration did not guarantee stratum-level calibration. We conclude that the IDI should only assess model performance among a clinically relevant subset when stratum-level calibration is strictly met and recommend calculating additional metrics to confirm the direction and conclusions of the IDI. Copyright (c) 2016 John Wiley & Sons, Ltd.
引用
收藏
页码:4468 / 4481
页数:14
相关论文
共 50 条
  • [21] How Likely Is Simpson's Paradox?
    Pavlides, Marios G.
    Perlman, Michael D.
    AMERICAN STATISTICIAN, 2009, 63 (03): : 226 - 233
  • [22] 102.54 Unravelling Simpson's paradox
    Beardon, A. F.
    MATHEMATICAL GAZETTE, 2018, 102 (555): : 534 - 535
  • [23] CONFIRMATION, CAUSATION, AND SIMPSON'S PARADOX
    Fitelson, Branden
    EPISTEME-A JOURNAL OF INDIVIDUAL AND SOCIAL EPISTEMOLOGY, 2017, 14 (03): : 297 - 309
  • [24] A geographical perspective on Simpson's paradox
    Sachdeva, Mehak
    Fotheringham, A. Stewart
    JOURNAL OF SPATIAL INFORMATION SCIENCE, 2023, (26): : 1 - 25
  • [25] The Simpson's paradox in quantum mechanics
    Selvitella, Alessandro
    JOURNAL OF MATHEMATICAL PHYSICS, 2017, 58 (03)
  • [26] An Applet for the Investigation of Simpson's Paradox
    Schneiter, Kady
    Symanzik, Juergen
    JOURNAL OF STATISTICS EDUCATION, 2013, 21 (01):
  • [27] Simpson's paradox, a tale of causality
    Chambaz, Antoine
    Drouet, Isabelle
    Memetea, Sonia
    JOURNAL OF THE SFDS, 2020, 161 (01): : 42 - 66
  • [28] Simpson's paradox beyond confounding
    Dong, Zili
    Cai, Weixin
    Zhao, Shimin
    EUROPEAN JOURNAL FOR PHILOSOPHY OF SCIENCE, 2024, 14 (03)
  • [29] Simpson's Paradox and Experimental Research
    Ameringer, Suzanne
    Serlin, Ronald C.
    Ward, Sandra
    NURSING RESEARCH, 2009, 58 (02) : 123 - 127
  • [30] Simpson's Paradox in Survival Models
    Di Serio, Clelia
    Rinott, Yosef
    Scarsini, Marco
    SCANDINAVIAN JOURNAL OF STATISTICS, 2009, 36 (03) : 463 - 480