A New Performance Measure for Class Imbalance Learning. Application to Bioinformatics Problems

被引:26
|
作者
Batuwita, Rukshan [1 ]
Palade, Vasile [1 ]
机构
[1] Univ Oxford, Comp Lab, Oxford OX1 3QD, England
关键词
Performance Measures; Class Imbalance Learning; Bioinformatics; Model Selection; SVMs; CLASSIFICATION;
D O I
10.1109/ICMLA.2009.126
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In class imbalance learning, the performance measure used for the model selection would play a vital role. It has been well-studied in the past research that the most widely used performance measure, the overall accuracy of the model, can lead to sub-optimal classification models when learning from imbalanced datasets. In order to overcome this problem, other performance measures, such as the Geometric-mean (Gm) and F-measure (Fm), have been used for imbalanced dataset learning. Training a classifier system with an imbalanced dataset (where the positive class is the minority class) would usually produce sub-optimal models having a higher Specificity (SP) and a lower Sensitivity (SE). By applying class imbalance learning methods, we would often be able to increase the SE by sacrificing some amount of SP. In some type of real world imbalanced classification problems, such as the gene finding Bioinformatics problems, it is important to improve the SE as much as possible by keeping the reduction of SP to the minimum. In this paper, we show that with respect to this type of classification problems the existing performance measures used in class imbalance learning (Gm and Fm) can still result in sub-optimal classification models. In order to circumvent these problems, we introduced a new performance measure, called Adjusted Geometric-mean (AGm). We show, both analytically and empirically on two real-world Bioinformatics datasets, that AGm can perform better than Gm and Fm metrics.
引用
收藏
页码:545 / 550
页数:6
相关论文
共 50 条
  • [21] Creating Universum for class imbalance via locality and its application in multiview subspace learning
    Yang, Xiang-Fei
    Wang, Dong-Lin
    Pan, Jia-Hang
    Li, Chun-Na
    Shao, Yuan-Hai
    [J]. INFORMATION SCIENCES, 2023, 647
  • [22] Application of deep reinforcement learning for spike sorting under multi-class imbalance
    Li, Suchen
    Tang, Zhuo
    Yang, Lifang
    Li, Mengmeng
    Shang, Zhigang
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 164
  • [23] SWSEL: Sliding Window-based Selective Ensemble Learning for class-imbalance problems
    Dai, Qi
    Liu, Jian-wei
    Yang, Jia-Peng
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 121
  • [24] FT4cip: A new functional tree for classification in class imbalance problems
    Canete-Sifuentes, Leonardo
    Monroy, Raul
    Medina-Perez, Miguel Angel
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 252
  • [25] Numerical solving of a new class of canonical problems and their application
    Vasilev, EN
    Solodukhov, VV
    Makkaveeva, VF
    [J]. RADIO SCIENCE, 1996, 31 (06) : 1853 - 1861
  • [26] Class imbalance revisited: a new experimental setup to assess the performance of treatment methods
    Prati, Ronaldo C.
    Batista, Gustavo E. A. P. A.
    Silva, Diego F.
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2015, 45 (01) : 247 - 270
  • [27] Class imbalance revisited: a new experimental setup to assess the performance of treatment methods
    Ronaldo C. Prati
    Gustavo E. A. P. A. Batista
    Diego F. Silva
    [J]. Knowledge and Information Systems, 2015, 45 : 247 - 270
  • [28] Class imbalance: A crucial factor affecting the performance of tea plantations mapping by machine learning
    Xiao, Yuanjun
    Huang, Jingfeng
    Weng, Wei
    Huang, Ran
    Shao, Qi
    Zhou, Chang
    Li, Shengcheng
    [J]. INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2024, 129
  • [29] Quantifying the Impact of Class Imbalance Handling Techniques On Medical Image Deep Learning Performance
    Reber, B.
    Brock, K.
    [J]. MEDICAL PHYSICS, 2022, 49 (06) : E257 - E257
  • [30] Performance Enhancement in Federated Learning by Reducing Class Imbalance of Non-IID Data
    Seol, Mihye
    Kim, Taejoon
    [J]. SENSORS, 2023, 23 (03)