Combining biomarkers linearly and nonlinearly for classification using the area under the ROC curve

被引:17
|
作者
Fong, Youyi [1 ,2 ]
Yin, Shuxin [1 ]
Huang, Ying [1 ,2 ]
机构
[1] Fred Hutchinson Canc Res Ctr, Publ Hlth Sci, 1100 Fairview Ave N,M2-B500, Seattle, WA 98109 USA
[2] Univ Washington, Dept Biostat, Seattle, WA 98195 USA
关键词
AUC; biomarker combination; classification; kernel; ramp loss; ROC curve; SUPPORT VECTOR MACHINE; VARIABLE SELECTION; MODELS;
D O I
10.1002/sim.6956
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In biomedical studies, it is often of interest to classify/predict a subject's disease status based on a variety of biomarker measurements. A commonly used classification criterion is based on area under the receiver operating characteristic curve (AUC). Many methods have been proposed to optimize approximated empirical AUC criteria, but there are two limitations to the existing methods. First, most methods are only designed to find the best linear combination of biomarkers, which may not perform well when there is strong nonlinearity in the data. Second, many existing linear combination methods use gradient-based algorithms to find the best marker combination, which often result in suboptimal local solutions. In this paper, we address these two problems by proposing a new kernel-based AUC optimization method called ramp AUC (RAUC). This method approximates the empirical AUC loss function with a ramp function and finds the best combination by a difference of convex functions algorithm. We show that as a linear combination method, RAUC leads to a consistent and asymptotically normal estimator of the linear marker combination when the data are generated from a semiparametric generalized linear model, just as the smoothed AUC method. Through simulation studies and real data examples, we demonstrate that RAUC outperforms smooth AUC in finding the best linear marker combinations, and can successfully capture nonlinear pattern in the data to achieve better classification performance. We illustrate our method with a dataset from a recent HIV vaccine trial. Copyright (c) 2016 John Wiley & Sons, Ltd.
引用
收藏
页码:3792 / 3809
页数:18
相关论文
共 50 条
  • [31] Area Under the ROC Curve Maximization for Metric Learning
    Gajic, Bojana
    Amato, Ariel
    Baldrich, Ramon
    van de Weijer, Joost
    Gatta, Carlo
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 2806 - 2815
  • [32] The area under the ROC curve as a measure of clustering quality
    Pablo A. Jaskowiak
    Ivan G. Costa
    Ricardo J. G. B. Campello
    Data Mining and Knowledge Discovery, 2022, 36 : 1219 - 1245
  • [33] Area Under the ROC Curve of Enhanced Energy Detector
    Khalid, Syed Safwan
    Abrar, Shafayat
    2013 11TH INTERNATIONAL CONFERENCE ON FRONTIERS OF INFORMATION TECHNOLOGY (FIT), 2013, : 131 - 135
  • [34] Feature Selection for Maximizing the Area Under the ROC Curve
    Wang, Rui
    Tang, Ke
    2009 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2009), 2009, : 400 - 405
  • [35] Ranking Instances by Maximizing the Area under ROC Curve
    Guvenir, H. Altay
    Kurtcephe, Murat
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (10) : 2356 - 2366
  • [36] A boosting method for maximization of the area under the ROC curve
    Komori, Osamu
    ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2011, 63 (05) : 961 - 979
  • [37] Active learning to maximize area under the ROC curve
    Culver, Matt
    Kun, Deng
    Scott, Stephen
    ICDM 2006: SIXTH INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2006, : 149 - +
  • [38] Exact bootstrap variances of the area under ROC curve
    Bandos, Andriy I.
    Rockette, Howard E.
    Gur, David
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2007, 36 (13-16) : 2443 - 2461
  • [39] Optimizing area under the ROC curve using semi-supervised learning
    Wang, Shijun
    Li, Diana
    Petrick, Nicholas
    Sahiner, Berkman
    Linguraru, Marius George
    Summers, Ronald M.
    PATTERN RECOGNITION, 2015, 48 (01) : 276 - 287
  • [40] Biomarker selection for medical diagnosis using the partial area under the ROC curve
    Hsu M.-J.
    Chang Y.-C.I.
    Hsueh H.-M.
    BMC Research Notes, 7 (1)