Simpson's paradox in the integrated discrimination improvement

被引:12
|
作者
Chipman, J. [1 ]
Braun, D. [2 ,3 ]
机构
[1] Vanderbilt Univ Sch Med, Dept Biostat, 2525 West End Ave,Suite 11000, Nashville, TN 37203 USA
[2] Harvard TH Chan Sch Publ Hlth, Dept Biostat, Boston, MA 02115 USA
[3] Dana Farber Canc Inst, Dept Biostat & Computat Biol, Boston, MA 02215 USA
基金
美国国家卫生研究院;
关键词
integrated discrimination improvement; simpson's paradox; risk prediction; reclassification; BRCAPRO; NET RECLASSIFICATION IMPROVEMENT; CARRIER PROBABILITY ESTIMATION; CLINICALLY RELEVANT MEASURES; PREDICTION MODELS; PERFORMANCE; MARKER; BRCA2; RISK; CALIBRATION; CAUTION;
D O I
10.1002/sim.6862
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The integrated discrimination improvement (IDI) is commonly used to compare two risk prediction models; it summarizes the extent a new model increases risk in events and decreases risk in non-events. The IDI averages risks across events and non-events and is therefore susceptible to Simpson's paradox. In some settings, adding a predictive covariate to a well calibrated model results in an overall negative (positive) IDI. However, if stratified by that same covariate, the strata-specific IDIs are positive (negative). Meanwhile, the calibration (observed to expected ratio and Hosmer-Lemeshow Goodness of Fit Test), area under the receiver operating characteristic curve, and Brier score improve overall and by stratum. We ran extensive simulations to investigate the impact of an imbalanced covariate upon metrics (IDI, area under the receiver operating characteristic curve, Brier score, and R-2), provide an analytic explanation for the paradox in the IDI, and use an investigative metric, a Weighted IDI, to better understand the paradox. In simulations, all instances of the paradox occurred under stratum-specific mis-calibration, yet there were mis-calibrated settings in which the paradox did not occur. The paradox is illustrated on Cancer Genomics Network data by calculating predictions based on two versions of BRCAPRO, a Mendelian risk prediction model for breast and ovarian cancer. In both simulations and the Cancer Genomics Network data, overall model calibration did not guarantee stratum-level calibration. We conclude that the IDI should only assess model performance among a clinically relevant subset when stratum-level calibration is strictly met and recommend calculating additional metrics to confirm the direction and conclusions of the IDI. Copyright (c) 2016 John Wiley & Sons, Ltd.
引用
收藏
页码:4468 / 4481
页数:14
相关论文
共 50 条
  • [1] SIMPSON PARADOX IN MODEL DISCRIMINATION
    FERRARIS, GB
    QUADERNI DELL INGEGNERE CHIMICO ITALIANO, 1979, 15 (1-2): : 10 - 12
  • [2] Simpson's Paradox: A Data Set and Discrimination Case Study Exercise
    Taylor, Stanley A.
    Mickel, Amy E.
    JOURNAL OF STATISTICS EDUCATION, 2014, 22 (01):
  • [3] Simpson's paradox
    Alin, Aylin
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2010, 2 (02) : 247 - 250
  • [4] Simpson's paradox
    Kunisaki, K
    CRITICAL CARE MEDICINE, 2005, 33 (07) : 1673 - 1674
  • [5] The Drug Titration Paradox is Simpson's Paradox
    Schamberg, Gabriel
    Brown, Emery N.
    CLINICAL PHARMACOLOGY & THERAPEUTICS, 2021, 110 (06) : 1424 - 1425
  • [6] The logic of Simpson's paradox
    Bandyoapdhyay, Prasanta S.
    Nelson, Davin
    Greenwood, Mark
    Brittan, Gordon
    Berwald, Jesse
    SYNTHESE, 2011, 181 (02) : 185 - 208
  • [7] A Question of Simpson's Paradox
    McLean, Thomas R.
    JOURNAL OF THE AMERICAN COLLEGE OF SURGEONS, 2018, 227 (06) : 628 - 628
  • [8] The logic of Simpson’s paradox
    Prasanta S. Bandyoapdhyay
    Davin Nelson
    Mark Greenwood
    Gordon Brittan
    Jesse Berwald
    Synthese, 2011, 181 : 185 - 208
  • [9] The Simpson's paradox unraveled
    Hernan, Miguel A.
    Clayton, David
    Keiding, Niels
    INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2011, 40 (03) : 780 - 785
  • [10] Simpson's paradox - Reply
    Topeli, A
    Laghi, F
    Cakir, B
    Tobin, MJ
    CRITICAL CARE MEDICINE, 2005, 33 (07) : 1674 - 1674