Imbalanced Classification with TPG Genetic Programming: Impact of Problem Imbalance and Selection Mechanisms

被引:2
|
作者
Sourbier, Nicolas [1 ]
Bonnot, Justine [1 ]
Majorczyk, Frederic [2 ]
Gesny, Olivier [3 ]
Guyet, Thomas [4 ]
Pelcat, Maxime [1 ]
机构
[1] Univ Rennes, INSA Rennes, CNRS, IETR,UMR 6164, Rennes, France
[2] DGA MI, CIDRE, Bruz, France
[3] Silicom, 3 E Rue Paris, Cesson Sevigne, France
[4] INRIA, Ctr Lyon, Villeurbanne, France
关键词
classification; machine learning; genetic programming; imbalanced data; tangled program graphs; selection;
D O I
10.1145/3520304.3529008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent research advances on Tangled Program Graphs (TPGs) have demonstrated that Genetic Programming (GP) can be used to build accurate classifiers. However, this performance has been tested on balanced classification problems while most of the real world classification problems are imbalanced, with both over-represented classes and rare classes. This paper explores the effect of imbalanced data on the performance of a TPG classifier, and proposes mitigation methods for imbalance-caused classifier performance degradation using adapted GP selection phases. The GP selection phase is characterized by a fitness function, and by a comparison operator. We show that adapting the TPG to imbalanced data significantly improves the classifier performance. The proposed adaptations on the fitness make the TPG agent capable to fit a model even with 104 less examples than the majority class whereas the revised selection phase of the GP process increases the robustness of the method for moderate imbalance ratios.
引用
收藏
页码:608 / 611
页数:4
相关论文
共 50 条
  • [1] Bayes Imbalance Impact Index: A Measure of Class Imbalanced Data Set for Classification Problem
    Lu, Yang
    Cheung, Yiu-Ming
    Tang, Yuan Yan
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (09) : 3525 - 3539
  • [2] SVGPM: evolving SVM decision function by using genetic programming to solve imbalanced classification problem
    Muhammad Syafiq Mohd Pozi
    Nur Athirah Azhar
    Abdul Rafiez Abdul Raziff
    Lina Hazmi Ajrina
    [J]. Progress in Artificial Intelligence, 2022, 11 : 65 - 77
  • [3] SVGPM: evolving SVM decision function by using genetic programming to solve imbalanced classification problem
    Pozi, Muhammad Syafiq Mohd
    Azhar, Nur Athirah
    Raziff, Abdul Rafiez Abdul
    Ajrina, Lina Hazmi
    [J]. PROGRESS IN ARTIFICIAL INTELLIGENCE, 2022, 11 (01) : 65 - 77
  • [4] A new fitness function in genetic programming for classification of imbalanced data
    Kumar, Arvind
    [J]. JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2024, 36 (07) : 1021 - 1033
  • [5] A Classification Model For Class Imbalance Dataset Using Genetic Programming
    Tahir, Mirza Amaad Ul Haq
    Asghar, Sohail
    Manzoor, Awais
    Noor, Muhammad Asim
    [J]. IEEE ACCESS, 2019, 7 : 71013 - 71037
  • [6] Classification of Imbalanced data sets using Multi Objective Genetic Programming
    Maheta, Hardik H.
    Dabhi, Vipul K.
    [J]. 2015 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2015,
  • [7] A Genetic Programming Approach to Binary Classification Problem
    Santoso L.W.
    Singh B.
    Rajest S.S.
    Regin R.
    Kadhim K.H.
    [J]. EAI Endorsed Transactions on Energy Web, 2021, 8 (31): : 1 - 8
  • [8] Selection of Augmented Data for Overcoming the Imbalance Problem in Facies Classification
    Kim, Dowan
    Byun, Joongmoo
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [9] Evolving ensembles using multi-objective genetic programming for imbalanced classification
    Zhang, Liang
    Wang, Kefan
    Xu, Luyuan
    Sheng, Wenjia
    Kang, Qi
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 255
  • [10] Automatic selection of features for classification using genetic programming
    Sherrah, J
    Bogner, RE
    Bouzerdoum, A
    [J]. ANZIIS 96 - 1996 AUSTRALIAN NEW ZEALAND CONFERENCE ON INTELLIGENT INFORMATION SYSTEMS, PROCEEDINGS, 1996, : 284 - 287