Multi-objective evolution of oblique decision trees for imbalanced data binary classification

被引:13
|
作者
Chabbouh, Marwa [1 ]
Bechikh, Slim [1 ]
Hung, Chih-Cheng [2 ,3 ]
Ben Said, Lamjed [1 ]
机构
[1] Univ Tunis, SMART Lab, Comp Sci Dept, ISG Campus, Tunis, Tunisia
[2] Kennesaw State Univ, Kennesaw, GA 30144 USA
[3] Anyang Normal Univ, Anyang, Peoples R China
关键词
Imbalanced data binary classification; Oblique decision trees; Evolutionary algorithms; Multi-objective optimization; DATA-SETS; COEVOLUTIONARY ALGORITHM; STATISTICAL COMPARISONS; IMPROVING PREDICTION; GENETIC ALGORITHM; MINORITY CLASS; DISCRETIZATION; INDUCTION; SELECTION; CLASSIFIERS;
D O I
10.1016/j.swevo.2019.05.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Imbalanced data classification is one of the most challenging problems in data mining. In this kind of problems, we have two types of classes: the majority class and the minority one. The former has a relatively high number of instances while the latter contains a much less number of instances. As most traditional classifiers usually assume that data is evenly distributed for all classes, they may considerably fail in recognizing instances in the minority class due to the imbalance problem. Several interesting approaches have been proposed to handle the class imbalance issue in the literature and the Oblique Decision Tree (ODT) is one of them. Nevertheless, most standard ODT construction algorithms use a greedy search process; while only very few works have addressed this induction problem using an evolutionary approach and this is done without really considering the class imbalance issue. To cope with this limitation, we propose in this paper a multi-objective evolutionary approach to find optimized ODTs for imbalanced binary classification. Our approach, called ODT-Theta-NSGA-III (ODT-based-Theta-Nondominated Sorting Genetic Algorithm-III), is motivated by its abilities: (a) to escape local optima in the ODT search space and (b) to maximize simultaneously both Precision and Recall. Thanks to these two features, ODT-Theta-NSGA-III provides competitive and better results when compared to many state-of-the-art classification algorithms on commonly used imbalanced benchmark data sets.
引用
收藏
页码:1 / 22
页数:22
相关论文
共 50 条
  • [1] Binary and Multiclass Imbalanced Classification Using Multi-Objective Ant Programming
    Luis Olmo, Juan
    Cano, Alberto
    Raul Romero, Jose
    Ventura, Sebastian
    [J]. 2012 12TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2012, : 70 - 76
  • [2] Adaptive multi-objective swarm fusion for imbalanced data classification
    li, Jinyan
    Fong, Simon
    Wong, Raymond K.
    Chu, Victor W.
    [J]. INFORMATION FUSION, 2018, 39 : 1 - 24
  • [3] Multi-objective Automatic Algorithm Configuration for the Classification Problem of Imbalanced Data
    Tari, Sara
    Szczepanski, Nicolas
    Mousin, Lucien
    Jacques, Julie
    Kessaci, Marie-Eleonore
    Jourdan, Laetitia
    [J]. 2020 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2020,
  • [4] Ensembles of multi-objective decision trees
    Kocev, Dragi
    Vens, Celine
    Struyf, Jan
    Dzeroski, Saso
    [J]. MACHINE LEARNING: ECML 2007, PROCEEDINGS, 2007, 4701 : 624 - +
  • [5] A Multi-Objective Evolutionary Approach to Imbalanced Classification Problems
    Chira, Camelia
    Lemnaru, Camelia
    [J]. 2015 IEEE 11TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTER COMMUNICATION AND PROCESSING (ICCP), 2015, : 149 - 154
  • [6] SVM ensemble training for imbalanced data classification using multi-objective optimization techniques
    Joanna Grzyb
    Michał Woźniak
    [J]. Applied Intelligence, 2023, 53 : 15424 - 15441
  • [7] SVM ensemble training for imbalanced data classification using multi-objective optimization techniques
    Grzyb, Joanna
    Wozniak, Michal
    [J]. APPLIED INTELLIGENCE, 2023, 53 (12) : 15424 - 15441
  • [8] A Hybrid CP/MOLS Approach for Multi-Objective Imbalanced Classification
    Szczepanski, Nicolas
    Audemard, Gilles
    Jourdan, Laetitia
    Lecoutre, Christophe
    Mousin, Lucien
    Veerapen, Nadarajen
    [J]. PROCEEDINGS OF THE 2021 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'21), 2021, : 723 - 731
  • [9] Multi-objective genetic programming optimization of decision trees for classifying medical data
    Mugambi, EM
    Hunter, A
    [J]. KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 1, PROCEEDINGS, 2003, 2773 : 293 - 299
  • [10] Lexicographic multi-objective evolutionary induction of decision trees
    Basgalupp, Marcia P.
    de Carvalho, Andre C. P. L. F.
    Barros, Rodrigo C.
    Ruiz, Duncan D.
    Freitas, Alex A.
    [J]. INTERNATIONAL JOURNAL OF BIO-INSPIRED COMPUTATION, 2009, 1 (1-2) : 105 - 117