New Fitness Functions in Genetic Programming for Classification with High-dimensional Unbalanced Data

被引:0
|
作者
Pei, Wenbin [1 ]
Xue, Bing [1 ]
Shang, Lin [2 ]
Zhang, Mengjie [1 ]
机构
[1] Victoria Univ Wellington, Sch Engn & Comp Sci, POB 600, Wellington 6140, New Zealand
[2] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing 210093, Jiangsu, Peoples R China
关键词
Classification; Genetic Programming; Fitness Functions; High-dimensionality; Class Imbalance; FEATURE-SELECTION;
D O I
10.1109/cec.2019.8789974
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
High-dimensionality and class imbalance represent two main challenges in classification. Recently, there is a growing number of datasets exhibiting the characteristics of the combination of the class imbalance and high-dimensionality. Genetic programming (GP) has been successfully applied to solve high-dimensional classification tasks. However, most existing GP methods may also suffer from a performance bias if the class distribution is unbalanced. Using fitness functions for cost adjustment is one of the most important methods in GP to address the class imbalance issue. This paper develops new fitness functions in GP to address the class imbalance issue in classification with high-dimensional unbalanced data. Two fitness functions are proposed to increase the performance of the traditional accuracy measures, and one fitness function is proposed to approximate Area Under Curve (AUC) with the goal to save the training time. Experiments on six high-dimensional unbalanced datasets show the better performance of the proposed fitness functions, compared to existing fitness functions.
引用
收藏
页码:2779 / 2786
页数:8
相关论文
共 50 条
  • [1] Developing New Fitness Functions in Genetic Programming for Classification With Unbalanced Data
    Bhowan, Urvesh
    Johnston, Mark
    Zhang, Mengjie
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2012, 42 (02): : 406 - 421
  • [2] Fitness functions in genetic programming for classification with unbalanced data
    Patterson, Grant
    Zhang, Mengjie
    AI 2007: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2007, 4830 : 769 - 775
  • [3] Reuse of Program Trees in Genetic Programming with a New Fitness Function in High-dimensional Unbalanced Classification
    Pei, Wenbin
    Xue, Bing
    Shang, Lin
    Zhang, Mengjie
    PROCEEDINGS OF THE 2019 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION (GECCCO'19 COMPANION), 2019, : 187 - 188
  • [4] High-Dimensional Unbalanced Binary Classification by Genetic Programming with Multi-Criterion Fitness Evaluation and Selection
    Pei, Wenbin
    Xue, Bing
    Shang, Lin
    Zhang, Mengjie
    EVOLUTIONARY COMPUTATION, 2022, 30 (01) : 99 - 129
  • [5] Genetic Programming for Borderline Instance Detection in High-dimensional Unbalanced Classification
    Pei, Wenbin
    Xue, Bing
    Shang, Lin
    Zhang, Mengjie
    PROCEEDINGS OF THE 2021 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'21), 2021, : 349 - 357
  • [6] Improving Fitness Functions in Genetic Programming for Classification on Unbalanced Credit Card Data
    Cao, Van Loi
    Le-Khac, Nhien-An
    O'Neill, Michael
    Nicolau, Miguel
    McDermott, James
    APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2016, PT I, 2016, 9597 : 35 - 45
  • [7] A Threshold-free Classification Mechanism in Genetic Programming for High-dimensional Unbalanced Classification
    Pei, Wenbin
    Xue, Bing
    Shang, Lin
    Zhang, Mengjie
    2020 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2020,
  • [8] A Cost-sensitive Genetic Programming Approach for High-dimensional Unbalanced Classification
    Pei, Wenbin
    Xue, Bing
    Zhang, Mengjie
    Shang, Lin
    2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 1770 - 1777
  • [9] Unbalanced breast cancer data classification using novel fitness functions in genetic programming
    Devarriya, Divyaansh
    Gulati, Cairo
    Mansharamani, Vidhi
    Sakalle, Aditi
    Bhardwaj, Arpit
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 140
  • [10] Genetic programming for high-dimensional imbalanced classification with a new fitness function and program reuse mechanism
    Wenbin Pei
    Bing Xue
    Lin Shang
    Mengjie Zhang
    Soft Computing, 2020, 24 : 18021 - 18038