Robust Estimation of Area Under ROC Curve Using Auxiliary Variables in the Presence of Missing Biomarker Values

被引:18
|
作者
Long, Qi [1 ]
Zhang, Xiaoxi [2 ]
Johnson, Brent A. [1 ]
机构
[1] Emory Univ, Dept Biostat & Bioinformat, Atlanta, GA 30322 USA
[2] Pfizer Inc, New York, NY 11017 USA
关键词
Area under the curve; Biomarker; Doubly robust estimators; Missing at random; Missing not at random; Receiver operating characteristic curve; Sensitivity analysis; POSTNATAL DEPRESSION SCALE; VERIFICATION BIAS; NONRESPONSE; VALIDATION; MODELS;
D O I
10.1111/j.1541-0420.2010.01487.x
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In medical research, the receiver operating characteristic (ROC) curves can be used to evaluate the performance of biomarkers for diagnosing diseases or predicting the risk of developing a disease in the future. The area under the ROC curve (ROC AUC), as a summary measure of ROC curves, is widely utilized, especially when comparing multiple ROC curves. In observational studies, the estimation of the AUC is often complicated by the presence of missing biomarker values, which means that the existing estimators of the AUC are potentially biased. In this article, we develop robust statistical methods for estimating the ROC AUC and the proposed methods use information from auxiliary variables that are potentially predictive of the missingness of the biomarkers or the missing biomarker values. We are particularly interested in auxiliary variables that are predictive of the missing biomarker values. In the case of missing at random (MAR), that is, missingness of biomarker values only depends on the observed data, our estimators have the attractive feature of being consistent if one correctly specifies, conditional on auxiliary variables and disease status, either the model for the probabilities of being missing or the model for the biomarker values. In the case of missing not at random (MNAR), that is, missingness may depend on the unobserved biomarker values, we propose a sensitivity analysis to assess the impact of MNAR on the estimation of the ROC AUC. The asymptotic properties of the proposed estimators are studied and their finite-sample behaviors are evaluated in simulation studies. The methods are further illustrated using data from a study of maternal depression during pregnancy.
引用
收藏
页码:559 / 567
页数:9
相关论文
共 50 条
  • [21] Non-parametric interval estimation for the partial area under the ROC curve
    Qin, Gengsheng
    Jin, Xiaoping
    Zhou, Xiao-Hua
    [J]. CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2011, 39 (01): : 17 - 33
  • [22] Imputation-based empirical likelihood inference for the area under the ROC curve with missing data
    Wang, Binhuan
    Qin, Gengsheng
    [J]. STATISTICS AND ITS INTERFACE, 2012, 5 (03) : 319 - 329
  • [23] Statistical inference for the area under the ROC curve in the presence of random measurement error.
    Schisterman, EF
    Faraggi, D
    Reiser, B
    Trevisan, M
    [J]. AMERICAN JOURNAL OF EPIDEMIOLOGY, 1999, 149 (11) : S16 - S16
  • [24] Bayesian ROC curve estimation under binormality using a rank likelihood
    Gu, Jiezhun
    Ghosal, Subhashis
    [J]. JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2009, 139 (06) : 2076 - 2083
  • [25] Doubly robust estimation of the area under the receiver-operating characteristic curve in the presence of verification bias
    Rotnitzky, Andrea
    Faraggi, David
    Schisterman, Enrique
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2006, 101 (475) : 1276 - 1288
  • [26] Mean estimation using robust quantile regression with two auxiliary variables
    Shahzad, U.
    Ahmad, I.
    Almanjahie, I. M.
    Al-Noor, N. H.
    Hanif, M.
    [J]. SCIENTIA IRANICA, 2023, 30 (03) : 1245 - 1254
  • [27] The estimation of gross flows in the presence of measurement error using auxiliary variables
    Pfeffermann, D
    Skinner, C
    Humphreys, K
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 1998, 161 : 13 - 32
  • [28] Optimizing area under the ROC curve using semi-supervised learning
    Wang, Shijun
    Li, Diana
    Petrick, Nicholas
    Sahiner, Berkman
    Linguraru, Marius George
    Summers, Ronald M.
    [J]. PATTERN RECOGNITION, 2015, 48 (01) : 276 - 287
  • [29] Combining biomarkers linearly and nonlinearly for classification using the area under the ROC curve
    Fong, Youyi
    Yin, Shuxin
    Huang, Ying
    [J]. STATISTICS IN MEDICINE, 2016, 35 (21) : 3792 - 3809
  • [30] Using the area under an estimated ROC curve to test the adequacy of binary predictors
    Lieli, Robert P.
    Hsu, Yu-Chin
    [J]. JOURNAL OF NONPARAMETRIC STATISTICS, 2019, 31 (01) : 100 - 130