Probabilistic Metric to measure the imbalance in multi-class problems

被引:0
|
作者
Lopes Agostinho, Solander Patricio [1 ]
Mendes-Moreira, Joao
机构
[1] Univ Porto, LIAAD INESC TEC, R Dr Roberto Frias, P-4200465 Porto, Portugal
关键词
imbalanced data; multi-class domain; classification; probabilistic metric;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In machine learning, imbalanced data has been one of the most relevant issue that the classifiers have to deal with. The most common techniques applied in this scenario are all, somehow, based on oversampling or under sampling concepts, In the former, the number of instances of minority classes are, somehow, increased while in the latter, the number of instances in the majority classes are somehow reduced. By increasing Pre-processing, approaches as the ones described have been well succeeded in binary classification problems.However, as the larger the number of classes, less effective the pre-processing approaches are. Another related problem is that the metrics that evaluate the predictive performance of the classifiers can be not effective in the presence of imbalanced data. The metrics used to measure the predictive performance of classifiers, can be divided into three groups: threshold, ranking and Probabilistic metrics. This paper aimed to purpose a probabilistic metric with the main objective of, given the results of a classifier in a multi-class domain, verify the relation between these result and the imbalance problem. The main purpose of this work, is to build a probabilistic metric based on non-parametric approaches, to measure the effect of imbalance feature of dataset in multi-class problems. As part of the work, a comparison with the existing metrics will be implemented and analyzed, both to understand the relation between them and to choose the best of them according to each scenario.
引用
收藏
页码:151 / 162
页数:12
相关论文
共 50 条
  • [31] Clustering-Based Oversampling Algorithm for Multi-class Imbalance Learning
    Zhao, Haixia
    Wu, Jian
    [J]. JOURNAL OF CLASSIFICATION, 2024,
  • [32] Probabilistic Methods in Multi-Class Brain-Computer Interface
    Ping YangXu LeiTieJun LiuPeng Xuand DeZhong Yao The authors are with the Key Laboratory for NeuroInformation of Ministry of EducationSchool of Life Science and TechnologyUniversity of Electronic Science and Technology of ChinaChengduChina
    [J]. Journal of Electronic Science and Technology of China, 2009, 7 (01) - 16
  • [33] A probabilistic approach to feature selection for multi-class text categorization
    Wu, Ke
    Lu, Bao-Liang
    Uchiyama, Masao
    Isahara, Hitoshi
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2007, PT 1, PROCEEDINGS, 2007, 4491 : 1310 - +
  • [34] Multi-Class Imbalance Classification Based on Data Distribution and Adaptive Weights
    Li, Shuxian
    Song, Liyan
    Wu, Xiaoyu
    Hu, Zheng
    Cheung, Yiu-ming
    Yao, Xin
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (10) : 5265 - 5279
  • [35] Dynamic Ensemble Selection and Data Preprocessing for Multi-Class Imbalance Learning
    Cruz, Rafael M. O.
    Souza, Mariana de Araujo
    Sabourin, Robert
    Cavalcanti, George D. C.
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2019, 33 (11)
  • [36] Truly Unordered Probabilistic Rule Sets for Multi-class Classification
    Yang, Lincen
    van Leeuwen, Matthijs
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT V, 2023, 13717 : 87 - 103
  • [37] Probabilistic Decision Trees using SVM for Multi-class Classification
    Uribe, Juan Sebastian
    Mechbal, Nazih
    Rebillat, Marc
    Bouamama, Karima
    Pengov, Marco
    [J]. 2013 2ND INTERNATIONAL CONFERENCE ON CONTROL AND FAULT-TOLERANT SYSTEMS (SYSTOL), 2013, : 619 - 624
  • [38] Dynamic and Probabilistic Multi-class Prediction of Tunnel Squeezing Intensity
    Chen, Yu
    Li, Tianbin
    Zeng, Peng
    Ma, Junjie
    Patelli, Edoardo
    Edwards, Ben
    [J]. ROCK MECHANICS AND ROCK ENGINEERING, 2020, 53 (08) : 3521 - 3542
  • [40] A Streaming Ensemble Classifier with Multi-Class Imbalance Learning for Activity Recognition
    Shahi, Ahmad
    Deng, Jeremiah D.
    Woodford, Brendon J.
    [J]. 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 3983 - 3990