Probabilistic Metric to measure the imbalance in multi-class problems

被引：0

作者：

Lopes Agostinho, Solander Patricio ^{[1
]}

Mendes-Moreira, Joao

机构：

[1] Univ Porto, LIAAD INESC TEC, R Dr Roberto Frias, P-4200465 Porto, Portugal

来源：

FOURTH INTERNATIONAL WORKSHOP ON LEARNING WITH IMBALANCED DOMAINS: THEORY AND APPLICATIONS, VOL 183 | 2022年 / 183卷

关键词：

imbalanced data; multi-class domain; classification; probabilistic metric;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In machine learning, imbalanced data has been one of the most relevant issue that the classifiers have to deal with. The most common techniques applied in this scenario are all, somehow, based on oversampling or under sampling concepts, In the former, the number of instances of minority classes are, somehow, increased while in the latter, the number of instances in the majority classes are somehow reduced. By increasing Pre-processing, approaches as the ones described have been well succeeded in binary classification problems.However, as the larger the number of classes, less effective the pre-processing approaches are. Another related problem is that the metrics that evaluate the predictive performance of the classifiers can be not effective in the presence of imbalanced data. The metrics used to measure the predictive performance of classifiers, can be divided into three groups: threshold, ranking and Probabilistic metrics. This paper aimed to purpose a probabilistic metric with the main objective of, given the results of a classifier in a multi-class domain, verify the relation between these result and the imbalance problem. The main purpose of this work, is to build a probabilistic metric based on non-parametric approaches, to measure the effect of imbalance feature of dataset in multi-class problems. As part of the work, a comparison with the existing metrics will be implemented and analyzed, both to understand the relation between them and to choose the best of them according to each scenario.

引用

页码：151 / 162

页数：12

共 50 条

[1] Imbalance accuracy metric for model selection in multi-class imbalance classification problems
Mortaz, Ebrahim
[J]. KNOWLEDGE-BASED SYSTEMS, 2020, 210
[2] Spectral measure for multi-class problems
Windeatt, T
[J]. MULTIPLE CLASSIFIER SYSTEMS, PROCEEDINGS, 2004, 3077 : 184 - 193
[3] Measuring the class-imbalance extent of multi-class problems
Ortigosa-Hernandez, Jonathan
Inza, Inaki
Lozano, Jose A.
[J]. PATTERN RECOGNITION LETTERS, 2017, 98 : 32 - 38
[4] Solving Multi-class Imbalance Problems Using Improved Tabular GANs
Farou, Zakarya
Kopeikina, Liudmila
Horvath, Tomas
[J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2022, 2022, 13756 : 527 - 539
[5] Multi-Class Probabilistic Active Learning
Kottke, Daniel
Krempl, Georg
Lang, Dominik
Teschner, Johannes
Spiliopoulou, Myra
[J]. ECAI 2016: 22ND EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, 285 : 586 - 594
[6] Multi-objective evolution of artificial neural networks in multi-class medical diagnosis problems with class imbalance
Shenfield, Alex
Rostami, Shahin
[J]. 2017 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (CIBCB), 2017, : 217 - 224
[7] LRID: A new metric of multi-class imbalance degree based on likelihood-ratio test
Zhu, Rui
Wang, Ziyu
Ma, Zhanyu
Wang, Guijin
Xue, Jing-Hao
[J]. PATTERN RECOGNITION LETTERS, 2018, 116 : 36 - 42
[8] Multi-class imbalance problem: A multi-objective solution
He, Yi-Xiao
Liu, Dan-Xuan
Lyu, Shen-Huan
Qian, Chao
Zhou, Zhi-Hua
[J]. INFORMATION SCIENCES, 2024, 680
[9] Multi-Imbalance: An open-source software for multi-class imbalance learning
Zhang, Chongsheng
Bi, Jingjun
Xu, Shixin
Ramentol, Enislay
Fan, Gaojuan
Qiao, Baojun
Fujita, Hamido
[J]. KNOWLEDGE-BASED SYSTEMS, 2019, 174 : 137 - 143
[10] Efficient Multi-Class Probabilistic SVMs on GPUs
Wen, Zeyi
Shi, Jiashuai
He, Bingsheng
Chen, Jian
Chen, Yawen
[J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (09) : 1693 - 1706

← 1 2 3 4 5 →