Collective of Base Classifiers for Mining Imbalanced Data

被引:0
|
作者
Jedrzejowicz, Joanna [1 ]
Jedrzejowicz, Piotr [2 ]
机构
[1] Univ Gdansk, Inst Informat, Fac Math Phys & Informat, PL-80308 Gdansk, Poland
[2] Gdynia Maritime Univ, Dept Informat Syst, PL-81225 Gdynia, Poland
关键词
Imbalanced data; Oversampling; Gene expression programming;
D O I
10.1007/978-3-031-08754-7_62
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Mining imbalanced datasets is a challenging and difficult problem. In this paper we adress it by proposing GEP-NB classifier based on the oversampling technique. It combines two learning methods - Gene Expression Programming and Naive Bayes, which cooperate to produce a final prediction. At the pre-processing stage a simple mechanism for generating synthetic minority class examples and balancing the training set is used. Next, two genes g1 and g2 are evolved using Gene Expression Programming. They differ by applying in each case a different procedure for selecting synthetic minority class examples. If the class prediction by g1 agrees with the class prediction made by g2, their decision is final. Otherwise the final predictive decision is taken by the Naive Bayes classifier. The approach is validated in an extensive computational experiment. Results produced by GEP-NB are compared with performance of several state-of-the-art classifiers. Comparisons show that GEP-NB offers a competitive performance.
引用
收藏
页码:571 / 585
页数:15
相关论文
共 50 条
  • [41] Cayley graphs as classifiers for data mining: The influence of asymmetries
    Kelarev, Andrei
    Ryan, Joe
    Yearwood, John
    DISCRETE MATHEMATICS, 2009, 309 (17) : 5360 - 5369
  • [42] DATA MINING CLASSIFIERS COMPARISON FOR SEISMIC HAZARD PREDICTION
    Sneha
    Abhari, Abdolreza
    Ding, Chen
    COMMUNICATIONS AND NETWORKING SYMPOSIUM (CNS 2018), 2018,
  • [43] Mining Event Associations using Structured Data and Classifiers
    Zhao, Jinxin
    Wang, Xinjun
    Yan, Zhongmin
    Wei, Song
    2015 12TH WEB INFORMATION SYSTEM AND APPLICATION CONFERENCE (WISA), 2015, : 259 - 264
  • [44] A similarity evaluation technique for data mining with an ensemble of classifiers
    Puuronen, S
    Terziyan, V
    11TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATION, PROCEEDINGS, 2000, : 1155 - 1159
  • [45] Complex Neural Classifiers for Power Quality Data Mining
    Vidhya, S.
    Kamaraj, V.
    JOURNAL OF ELECTRICAL ENGINEERING & TECHNOLOGY, 2018, 13 (04) : 1714 - 1722
  • [46] Data Mining by Symbolic Fuzzy Classifiers and Genetic Programming
    Owais, Suhail
    Kroemer, Pavel
    Platos, Jan
    Snasel, Vaclav
    Zelinka, Ivan
    NOSTRADAMUS: MODERN METHODS OF PREDICTION, MODELING AND ANALYSIS OF NONLINEAR SYSTEMS, 2013, 192 : 273 - +
  • [47] Types of minority class examples and their influence on learning classifiers from imbalanced data
    Napierala, Krystyna
    Stefanowski, Jerzy
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2016, 46 (03) : 563 - 597
  • [48] Types of minority class examples and their influence on learning classifiers from imbalanced data
    Krystyna Napierala
    Jerzy Stefanowski
    Journal of Intelligent Information Systems, 2016, 46 : 563 - 597
  • [49] Hierarchical Combining of Classifiers in Privacy Preserving Data Mining
    Andruszkiewicz, Piotr
    HYBRID ARTIFICIAL INTELLIGENCE SYSTEMS, HAIS 2014, 2014, 8480 : 573 - 584
  • [50] Learning classifiers from imbalanced data based on biased minimax probability machine
    Huang, KZ
    Yang, HQ
    King, I
    Lyu, MR
    PROCEEDINGS OF THE 2004 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 2, 2004, : 558 - 563