Visualizing the decision rules behind the ROC curves: understanding the classification process

被引:8
|
作者
Perez-Fernandez, Sonia [1 ]
Martinez-Camblor, Pablo [2 ]
Filzmoser, Peter [3 ]
Corral, Norberto [1 ]
机构
[1] Univ Oviedo, Dept Stat & OR & MD, Oviedo, Spain
[2] Geisel Sch Med Dartmouth, Hanover, NH USA
[3] Vienna Univ Technol, Inst Stat & Math Methods Econ, Vienna, Austria
关键词
Area under the curve; Classification regions; Graphical animations; Multivariate marker; Receiver operating characteristic curve; MULTIPLE DIAGNOSTIC BIOMARKERS; LINEAR-COMBINATIONS; AREA; MORTALITY; RISK;
D O I
10.1007/s10182-020-00385-2
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The receiver operating characteristic (ROC) curve is a graphical method commonly used to study the capacity of continuous variables (markers) to properly classify subjects into one of two groups. The decision made is ultimately endorsed by a classification subset on the space where the marker is defined. In this paper, we study graphical representations and propose visual forms to reflect those classification rules giving rise to the construction of the ROC curve. On the one hand, we use static pictures for displaying the classification regions for univariate markers, which are specially convenient when there is not a monotone relationship between the marker and the likelihood of belonging to one group. In those cases, there are two options to improve the classification accuracy: to allow for more flexibility in the classification rules (for example considering two cutoff points instead of one) or to transform the marker by using a function whose resulting ROC curve is optimal. On the other hand, we propose to build videos for visualizing the collection of subsets when several markers are considered simultaneously. A compilation of techniques for finding a rule that maximizes the area under the ROC curve is included, with a focus on linear combinations. We present a tool for the R software which generates those graphics, and we apply it to one real dataset. The R code is provided as Supplementary Material.
引用
收藏
页码:135 / 161
页数:27
相关论文
共 50 条
  • [1] Visualizing the decision rules behind the ROC curves: understanding the classification process
    Sonia Pérez-Fernández
    Pablo Martínez-Camblor
    Peter Filzmoser
    Norberto Corral
    [J]. AStA Advances in Statistical Analysis, 2021, 105 : 135 - 161
  • [2] On optimal reject rules and ROC curves
    Santos-Pereira, CM
    Pires, AM
    [J]. PATTERN RECOGNITION LETTERS, 2005, 26 (07) : 943 - 952
  • [3] ROC CURVES FOR CLASSIFICATION TREES
    RAUBERTAS, RF
    RODEWALD, LE
    HUMISTON, SG
    SZILAGYI, PG
    [J]. MEDICAL DECISION MAKING, 1994, 14 (02) : 169 - 174
  • [4] RuleMatrix: Visualizing and Understanding Classifiers with Rules
    Ming, Yao
    Qu, Huamin
    Bertini, Enrico
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2019, 25 (01) : 342 - 352
  • [5] THE USE OF ROC CURVES IN HISTOPATHOLOGIC DECISION-MAKING
    LANGLEY, FA
    BUCKLEY, CH
    TASKER, M
    [J]. ANALYTICAL AND QUANTITATIVE CYTOLOGY AND HISTOLOGY, 1985, 7 (03): : 167 - 173
  • [6] ROC CURVES FOR THE CLASSIFICATION ALGORTHIMS IN STATISTICA DATA MINER
    Jakoubek, Bohumil
    [J]. APLIMAT 2007 - 6TH INTERNATIONAL CONFERENCE, PT I, 2007, : 363 - 369
  • [7] Selecting features in microarray classification using ROC curves
    Mamitsuka, Hiroshi
    [J]. PATTERN RECOGNITION, 2006, 39 (12) : 2393 - 2404
  • [8] Feature selection using ROC curves on classification problems
    Serrano, Antonio J.
    Soria, Emilio
    Martin, Jose D.
    Magdalena, Rafael
    Gomez, Juan
    [J]. 2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [9] Ordinal classification with decision rules
    Dembczynski, Krzysztof
    Kotlowski, Wojciech
    Slowinski, Roman
    [J]. MINING COMPLEX DATA, 2008, 4944 : 169 - 181
  • [10] ROC Curves, Loss Functions, and Distorted Probabilities in Binary Classification
    Le, Phuong Bich
    Nguyen, Zung Tien
    [J]. MATHEMATICS, 2022, 10 (09)