HingeBoost: ROC-Based Boost for Classification and Variable Selection

被引:21
|
作者
Wang, Zhu [1 ,2 ]
机构
[1] Connecticut Childrens Med Ctr, Hartford, CT USA
[2] Univ Connecticut, Sch Med, Storrs, CT 06269 USA
来源
关键词
functional gradient descent; support vector machine; ROC; classification; misclassification costs; SUPPORT VECTOR MACHINES; PROSTATE-CANCER; PREDICTION; REGULARIZATION; ALGORITHMS; REGRESSION; SPLINES; MODELS; LASSO; PATH;
D O I
10.2202/1557-4679.1304
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In disease classification, a traditional technique is the receiver operative characteristic (ROC) curve and the area under the curve (AUC). With high-dimensional data, the ROC techniques are needed to conduct classification and variable selection. The current ROC methods do not explicitly incorporate unequal misclassification costs or do not have a theoretical grounding for optimizing the AUC. Empirical studies in the literature have demonstrated that optimizing the hinge loss can maximize the AUC approximately. In theory, minimizing the hinge rank loss is equivalent to minimizing the AUC in the asymptotic limit. In this article, we propose a novel nonparametric method HingeBoost to optimize a weighted hinge loss incorporating misclassification costs. HingeBoost can be used to construct linear and nonlinear classifiers. The estimation and variable selection for the hinge loss are addressed by a new boosting algorithm. Furthermore, the proposed twin HingeBoost can select more sparse predictors. Some properties of HingeBoost are studied as well. To compare HingeBoost with existing classification methods, we present empirical study results using data from simulations and a prostate cancer study with mass spectrometry-based proteomics.
引用
收藏
页数:31
相关论文
共 50 条
  • [1] ROC-based cost-sensitive classification with a reject option
    Dubos, Clement
    Bernard, Simon
    Adam, Sebastien
    Sabourin, Robert
    [J]. 2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3320 - 3325
  • [2] ROC-Based Utility Function Maximization for Feature Selection and Classification with Applications to High-Dimensional Protease Data
    Liu, Zhenqiu
    Tan, Ming
    [J]. BIOMETRICS, 2008, 64 (04) : 1155 - 1161
  • [3] A ROC-based reject rule for dichotomizers
    Tortorella, F
    [J]. PATTERN RECOGNITION LETTERS, 2005, 26 (02) : 167 - 180
  • [4] A ROC-based feature selection method for computer-aided detection and diagnosis
    Wang, Songyuan
    Zhang, Guopeng
    Liao, Qimei
    Zhang, Junying
    Jiao, Chun
    Lu, Hongbing
    [J]. MEDICAL IMAGING 2014: COMPUTER-AIDED DIAGNOSIS, 2014, 9035
  • [5] CENTRE INDEPENDENT DETECTION OF MALIGNANCY BY MEANS OF CLASSIFICATION WITH ROC-BASED DATA TRANSFORMATION
    Bitterlich, N.
    Muley, Th.
    Schneider, J.
    [J]. ANTICANCER RESEARCH, 2008, 28 (6B) : 4041 - 4042
  • [6] Optimal ROC-Based Classification and Performance Analysis under Bayesian Uncertainty Models
    Dalton, Lori A.
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2016, 13 (04) : 719 - 729
  • [7] Reducing the classification cost of support vector classifiers through an ROC-based reject rule
    Tortorella, F
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2004, 7 (02) : 128 - 143
  • [8] ROC-based calibration of flood inundation models
    Schumann, G. J. -P.
    Vernieuwe, H.
    De Baets, B.
    Verhoest, N. E. C.
    [J]. HYDROLOGICAL PROCESSES, 2014, 28 (22) : 5495 - 5502
  • [9] Reducing the classification cost of support vector classifiers through an ROC-based reject rule
    Francesco Tortorella
    [J]. Pattern Analysis and Applications, 2004, 7 : 128 - 143
  • [10] Cost-Sensitive Neural Network with ROC-Based Moving Threshold for Imbalanced Classification
    Krawczyk, Bartosz
    Wozniak, Michal
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2015, 2015, 9375 : 45 - 52