Evaluation Measures of the Classification Performance of Imbalanced Data Sets

被引:173
|
作者
Gu, Qiong [1 ,2 ]
Zhu, Li [2 ]
Cai, Zhihua [2 ]
机构
[1] Xiangfan Univ, Fac Math & Comp Sci, Xiangfan 441053, Hubei, Peoples R China
[2] China Univ Geosci, Sch Comp, Wuhan 430074, Peoples R China
关键词
Evaluation; classification performance; imbalanced data sets;
D O I
10.1007/978-3-642-04962-0_53
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Discriminant Measures for Classification Performance play a critical role in guiding the design of classifiers, assessment methods and evaluation measures are at least as important as algorithm and are the first key stage to a successful data mining. We systematically summarized the evaluation measures of Imbalanced Data Sets (IDS). Several different type measures, such as commonly performance evaluation measures and visualizing classifier performance measures have been analyzed and compared. The problems of these measures towards IDS may lead to misunderstanding of classification results and even wrong strategy decision. Beside that, a series of complex numerical evaluation measures were also investigated which can also serve for evaluating classification performance of IDS.
引用
收藏
页码:461 / +
页数:2
相关论文
共 50 条
  • [1] Preliminary Evaluation of Classification Complexity Measures on Imbalanced Data
    Xing, Yan
    Cai, Hao
    Cai, Yanguang
    Hejlesen, Ole
    Toft, Egon
    PROCEEDINGS OF 2013 CHINESE INTELLIGENT AUTOMATION CONFERENCE: INTELLIGENT INFORMATION PROCESSING, 2013, 256 : 189 - 196
  • [2] Performance of evaluation metrics for classification in imbalanced data
    Huayanay, Alex de la Cruz
    Bazan, Jorge L.
    Russo, Cibele M.
    COMPUTATIONAL STATISTICS, 2024, : 1447 - 1473
  • [3] Graph Classification with Imbalanced Data Sets
    Xiao, Gang-Song
    Chen, Xiao-Yun
    2011 FIRST ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2011, : 57 - 61
  • [4] Linear Approximation of F-Measure for the Performance Evaluation of Classification Algorithms on Imbalanced Data Sets
    Wong, Tzu-Tsung
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (02) : 753 - 763
  • [5] The Text Classification for Imbalanced Data Sets
    Li, Yanling
    Zhu, Yehang
    Yang, Ping
    ISISE 2008: INTERNATIONAL SYMPOSIUM ON INFORMATION SCIENCE AND ENGINEERING, VOL 2, 2008, : 778 - +
  • [6] Classification on Imbalanced Data Sets, Taking Advantage of Errors to Improve Performance
    Lopez-Chau, Asdrubal
    Garcia-Lamont, Farid
    Cervantes, Jair
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, ICIC 2015, PT III, 2015, 9227 : 72 - 78
  • [7] Classification with local clustering in imbalanced data sets
    Ji, Hua
    Zhang, Huaxiang
    ADVANCED RESEARCH ON INFORMATION SCIENCE, AUTOMATION AND MATERIAL SYSTEM, PTS 1-6, 2011, 219-220 : 151 - 155
  • [8] Examining the Performance of Classification Algorithms for Imbalanced Data Sets in Web Author Identification
    Vorobeva, Alisa A.
    2016 18TH CONFERENCE OF OPEN INNOVATIONS ASSOCIATION AND SEMINAR ON INFORMATION SECURITY AND PROTECTION OF INFORMATION TECHNOLOGY (FRUCT-ISPIT), 2016, : 385 - 390
  • [9] On the Dynamics of Classification Measures for Imbalanced and Streaming Data
    Brzezinski, Dariusz
    Stefanowski, Jerzy
    Susmaga, Robert
    Szczech, Izabela
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (08) : 2868 - 2878
  • [10] Data Complexity Measures for Imbalanced Classification Tasks
    Barella, Victor H.
    Garcia, Luis P. F.
    de Souto, Marcilio P.
    Lorena, Ana C.
    de Carvalho, Andre
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,