Probabilistic Metric to measure the imbalance in multi-class problems

被引:0
|
作者
Lopes Agostinho, Solander Patricio [1 ]
Mendes-Moreira, Joao
机构
[1] Univ Porto, LIAAD INESC TEC, R Dr Roberto Frias, P-4200465 Porto, Portugal
关键词
imbalanced data; multi-class domain; classification; probabilistic metric;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In machine learning, imbalanced data has been one of the most relevant issue that the classifiers have to deal with. The most common techniques applied in this scenario are all, somehow, based on oversampling or under sampling concepts, In the former, the number of instances of minority classes are, somehow, increased while in the latter, the number of instances in the majority classes are somehow reduced. By increasing Pre-processing, approaches as the ones described have been well succeeded in binary classification problems.However, as the larger the number of classes, less effective the pre-processing approaches are. Another related problem is that the metrics that evaluate the predictive performance of the classifiers can be not effective in the presence of imbalanced data. The metrics used to measure the predictive performance of classifiers, can be divided into three groups: threshold, ranking and Probabilistic metrics. This paper aimed to purpose a probabilistic metric with the main objective of, given the results of a classifier in a multi-class domain, verify the relation between these result and the imbalance problem. The main purpose of this work, is to build a probabilistic metric based on non-parametric approaches, to measure the effect of imbalance feature of dataset in multi-class problems. As part of the work, a comparison with the existing metrics will be implemented and analyzed, both to understand the relation between them and to choose the best of them according to each scenario.
引用
收藏
页码:151 / 162
页数:12
相关论文
共 50 条
  • [1] Imbalance accuracy metric for model selection in multi-class imbalance classification problems
    Mortaz, Ebrahim
    [J]. KNOWLEDGE-BASED SYSTEMS, 2020, 210
  • [2] Spectral measure for multi-class problems
    Windeatt, T
    [J]. MULTIPLE CLASSIFIER SYSTEMS, PROCEEDINGS, 2004, 3077 : 184 - 193
  • [3] Measuring the class-imbalance extent of multi-class problems
    Ortigosa-Hernandez, Jonathan
    Inza, Inaki
    Lozano, Jose A.
    [J]. PATTERN RECOGNITION LETTERS, 2017, 98 : 32 - 38
  • [4] Solving Multi-class Imbalance Problems Using Improved Tabular GANs
    Farou, Zakarya
    Kopeikina, Liudmila
    Horvath, Tomas
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2022, 2022, 13756 : 527 - 539
  • [5] Multi-Class Probabilistic Active Learning
    Kottke, Daniel
    Krempl, Georg
    Lang, Dominik
    Teschner, Johannes
    Spiliopoulou, Myra
    [J]. ECAI 2016: 22ND EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, 285 : 586 - 594
  • [6] Multi-objective evolution of artificial neural networks in multi-class medical diagnosis problems with class imbalance
    Shenfield, Alex
    Rostami, Shahin
    [J]. 2017 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (CIBCB), 2017, : 217 - 224
  • [7] LRID: A new metric of multi-class imbalance degree based on likelihood-ratio test
    Zhu, Rui
    Wang, Ziyu
    Ma, Zhanyu
    Wang, Guijin
    Xue, Jing-Hao
    [J]. PATTERN RECOGNITION LETTERS, 2018, 116 : 36 - 42
  • [8] Multi-class imbalance problem: A multi-objective solution
    He, Yi-Xiao
    Liu, Dan-Xuan
    Lyu, Shen-Huan
    Qian, Chao
    Zhou, Zhi-Hua
    [J]. INFORMATION SCIENCES, 2024, 680
  • [9] Multi-Imbalance: An open-source software for multi-class imbalance learning
    Zhang, Chongsheng
    Bi, Jingjun
    Xu, Shixin
    Ramentol, Enislay
    Fan, Gaojuan
    Qiao, Baojun
    Fujita, Hamido
    [J]. KNOWLEDGE-BASED SYSTEMS, 2019, 174 : 137 - 143
  • [10] Efficient Multi-Class Probabilistic SVMs on GPUs
    Wen, Zeyi
    Shi, Jiashuai
    He, Bingsheng
    Chen, Jian
    Chen, Yawen
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (09) : 1693 - 1706