The Impact of Local Data Characteristics on Learning from Imbalanced Data

被引:0
|
作者
Stefanowski, Jerzy [1 ]
机构
[1] Poznan Univ Tech, Inst Comp Sci, PL-60965 Poznan, Poland
关键词
RULE INDUCTION; CLASSIFICATION; CLASSIFIERS; SMOTE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Problems of learning classifiers from imbalanced data are discussed. First, we look at different data difficulty factors corresponding to complex distributions of the minority class and show that they could be approximated by analysing the neighbourhood of the learning examples from the minority class. We claim that the results of this analysis could be a basis for developing new algorithms. In this paper we show such possibilities by discussing modifications of informed pre-processing method LN-SMOTE as well as by incorporating types of examples into rule induction algorithm BRACID.
引用
收藏
页码:1 / 13
页数:13
相关论文
共 50 条
  • [1] The Impact of Imbalanced Training Data on Local Matching Learning of Ontologies
    Laadhar, Amir
    Ghozzi, Faiza
    Megdiche, Imen
    Ravat, Franck
    Teste, Olivier
    Gargouri, Faiez
    [J]. BUSINESS INFORMATION SYSTEMS, PT I, 2019, 353 : 162 - 175
  • [2] Learning from Imbalanced Data
    He, Haibo
    Garcia, Edwardo A.
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (09) : 1263 - 1284
  • [3] An empirical study of data intrinsic characteristics that make learning from imbalanced data difficult
    Dudjak, Mario
    Martinovic, Goran
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 182
  • [4] Metric Learning from Imbalanced Data
    Gautheron, Leo
    Habrard, Amaury
    Morvant, Emilie
    Sebban, Marc
    [J]. 2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 923 - 930
  • [5] Local ensemble learning from imbalanced and noisy data for word sense disambiguation
    Krawczyk, Bartosz
    McInnes, Bridget T.
    [J]. PATTERN RECOGNITION, 2018, 78 : 103 - 119
  • [6] A Study on the Impact of Data Characteristics in Imbalanced Regression Tasks
    Branco, Paula
    Torgo, Luis
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA 2019), 2019, : 193 - 202
  • [7] Local cost sensitive learning for handling imbalanced data sets
    Karagiannopoulos, M. G.
    Anyfantis, D. S.
    Kotsiantis, S. B.
    Pintelas, P. E.
    [J]. 2007 MEDITERRANEAN CONFERENCE ON CONTROL & AUTOMATION, VOLS 1-4, 2007, : 235 - 240
  • [8] Online Continual Learning from Imbalanced Data
    Chrysakis, Aristotelis
    Moens, Marie-Francine
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [9] Online Continual Learning from Imbalanced Data
    Chrysakis, Aristotelis
    Moens, Marie-Francine
    [J]. 25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
  • [10] Learning in Imbalanced Relational Data
    Ghanem, Amal S.
    Venkatesh, Svetha
    West, Geoff
    [J]. 19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 436 - 439