Is the area under an ROC curve a valid measure of the performance of a screening or diagnostic test?

被引:48
|
作者
Wald, N. J. [1 ]
Bestwick, J. P. [1 ]
机构
[1] Barts & London Queen Marys Sch Med & Dent, Wolfson Inst Prevent Med, London EC1M 6BQ, England
关键词
ROC curve; AUC; screening test; diagnostic test;
D O I
10.1177/0969141313517497
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Objectives: The area under a receiver operating characteristic (ROC) curve (the AUC) is used as a measure of the performance of a screening or diagnostic test. We here assess the validity of the AUC. Methods: Assuming the test results follow Gaussian distributions in affected and unaffected individuals, standard mathematical formulae were used to describe the relationship between the detection rate (DR) (or sensitivity) and the false-positive rate (FPR) of a test with the AUC. These formulae were used to calculate the screening performance (DR for a given FPR, or FPR for a given DR) for different AUC values according to different standard deviations of the test result in affected and unaffected individuals. Results: The DR for a given FPR is strongly dependent on relative differences in the standard deviation of the test variable in affected and unaffected individuals. Consequently, two tests with the same AUC can have a different DR for the same FPR. For example, an AUC of 0.75 has a DR of 24% for a 5% FPR if the standard deviations are the same in affected and unaffected individuals, but 39% for the same 5% FPR if the standard deviation in affected individuals is 1.5 times that in unaffected individuals. Conclusion: The AUC is an unreliable measure of screening performance because in practice the standard deviation of a screening or diagnostic test in affected and unaffected individuals can differ. The problem is avoided by not using AUC at all, and instead specifying DRs for given FPRs or FPRs for given DRs.
引用
收藏
页码:51 / 56
页数:6
相关论文
共 50 条
  • [41] Active learning to maximize area under the ROC curve
    Culver, Matt
    Kun, Deng
    Scott, Stephen
    [J]. ICDM 2006: SIXTH INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2006, : 149 - +
  • [42] Exact bootstrap variances of the area under ROC curve
    Bandos, Andriy I.
    Rockette, Howard E.
    Gur, David
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2007, 36 (13-16) : 2443 - 2461
  • [43] A boosting method for maximization of the area under the ROC curve
    Komori, Osamu
    [J]. ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2011, 63 (05) : 961 - 979
  • [44] Feature Selection for Maximizing the Area Under the ROC Curve
    Wang, Rui
    Tang, Ke
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2009), 2009, : 400 - 405
  • [45] Area Under the ROC Curve of Enhanced Energy Detector
    Khalid, Syed Safwan
    Abrar, Shafayat
    [J]. 2013 11TH INTERNATIONAL CONFERENCE ON FRONTIERS OF INFORMATION TECHNOLOGY (FIT), 2013, : 131 - 135
  • [46] Ranking Instances by Maximizing the Area under ROC Curve
    Guvenir, H. Altay
    Kurtcephe, Murat
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (10) : 2356 - 2366
  • [47] Score Fusion by Maximizing the Area under the ROC Curve
    Villegas, Mauricio
    Paredes, Roberto
    [J]. PATTERN RECOGNITION AND IMAGE ANALYSIS, PROCEEDINGS, 2009, 5524 : 473 - 480
  • [48] Exact Probability Distribution for the ROC Area under Curve
    Ekstrom, Joakim
    Akerren Ogren, Jim
    Sjoblom, Tobias
    [J]. CANCERS, 2023, 15 (06)
  • [49] A boosting method for maximization of the area under the ROC curve
    Osamu Komori
    [J]. Annals of the Institute of Statistical Mathematics, 2011, 63 : 961 - 979
  • [50] Equivalence of the statistics for replicability and area under the ROC curve
    Irwin, R. John
    [J]. BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY, 2009, 62 : 485 - 487