Measuring classifier performance: a coherent alternative to the area under the ROC curve

被引:649
|
作者
Hand, David J. [1 ,2 ]
机构
[1] Univ London Imperial Coll Sci Technol & Med, Dept Math, London, England
[2] Univ London Imperial Coll Sci Technol & Med, Inst Math Sci, London, England
关键词
ROC curves; Classification; AUC; Specificity; Sensitivity; Misclassification rate; Cost; Loss; Error rate;
D O I
10.1007/s10994-009-5119-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The area under the ROC curve (AUC) is a very widely used measure of performance for classification and diagnostic rules. It has the appealing property of being objective, requiring no subjective input from the user. On the other hand, the AUC has disadvantages, some of which are well known. For example, the AUC can give potentially misleading results if ROC curves cross. However, the AUC also has a much more serious deficiency, and one which appears not to have been previously recognised. This is that it is fundamentally incoherent in terms of misclassification costs: the AUC uses different misclassification cost distributions for different classifiers. This means that using the AUC is equivalent to using different metrics to evaluate different classification rules. It is equivalent to saying that, using one classifier, misclassifying a class 1 point is p times as serious as misclassifying a class 0 point, but, using another classifier, misclassifying a class 1 point is P times as serious, where p not equal P. This is nonsensical because the relative severities of different kinds of misclassifications of individual points is a property of the problem, not the classifiers which happen to have been chosen. This property is explored in detail, and a simple valid alternative to the AUC is proposed.
引用
收藏
页码:103 / 123
页数:21
相关论文
共 50 条
  • [11] On use of partial area under the ROC curve for evaluation of diagnostic performance
    Ma, Hua
    Bandos, Andriy I.
    Rockette, Howard E.
    Gur, David
    STATISTICS IN MEDICINE, 2013, 32 (20) : 3449 - 3458
  • [12] The area under an ROC curve with limited information
    van den Hout, WB
    MEDICAL DECISION MAKING, 2003, 23 (02) : 160 - 166
  • [13] THE AREA UNDER THE ROC CURVE AND ITS COMPETITORS
    HILDEN, J
    MEDICAL DECISION MAKING, 1991, 11 (02) : 95 - 101
  • [14] Generalization bounds for the area under the ROC curve
    Agarwal, S
    Graepel, T
    Herbrich, R
    Har-Peled, S
    Roth, D
    JOURNAL OF MACHINE LEARNING RESEARCH, 2005, 6 : 393 - 425
  • [15] The partial area under the summary ROC curve
    Walter, SD
    STATISTICS IN MEDICINE, 2005, 24 (13) : 2025 - 2040
  • [16] Is the area under an ROC curve a valid measure of the performance of a screening or diagnostic test?
    Wald, N. J.
    Bestwick, J. P.
    JOURNAL OF MEDICAL SCREENING, 2014, 21 (01) : 51 - 56
  • [17] Performance of tests based on the area under the ROC curve for multireader diagnostic data
    Hwang, Yi-Ting
    Hsu, Ya-Ru
    Su, Nan-Cheng
    JOURNAL OF APPLIED STATISTICS, 2025, 52 (03) : 555 - 577
  • [18] On the limitations of the area under the ROC curve for NTCP modelling
    Bahn, Emanuel
    Alber, Markus
    RADIOTHERAPY AND ONCOLOGY, 2020, 144 : 148 - 151
  • [19] Regression analysis for the partial area under the ROC curve
    Cai, Tianxi
    Dodd, Lori E.
    STATISTICA SINICA, 2008, 18 (03) : 817 - 836
  • [20] The area under the ROC curve as a measure of clustering quality
    Jaskowiak, Pablo A.
    Costa, Ivan G.
    Campello, Ricardo J. G. B.
    DATA MINING AND KNOWLEDGE DISCOVERY, 2022, 36 (03) : 1219 - 1245