New Fitness Functions in Genetic Programming for Classification with High-dimensional Unbalanced Data

被引:0
|
作者
Pei, Wenbin [1 ]
Xue, Bing [1 ]
Shang, Lin [2 ]
Zhang, Mengjie [1 ]
机构
[1] Victoria Univ Wellington, Sch Engn & Comp Sci, POB 600, Wellington 6140, New Zealand
[2] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing 210093, Jiangsu, Peoples R China
关键词
Classification; Genetic Programming; Fitness Functions; High-dimensionality; Class Imbalance; FEATURE-SELECTION;
D O I
10.1109/cec.2019.8789974
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
High-dimensionality and class imbalance represent two main challenges in classification. Recently, there is a growing number of datasets exhibiting the characteristics of the combination of the class imbalance and high-dimensionality. Genetic programming (GP) has been successfully applied to solve high-dimensional classification tasks. However, most existing GP methods may also suffer from a performance bias if the class distribution is unbalanced. Using fitness functions for cost adjustment is one of the most important methods in GP to address the class imbalance issue. This paper develops new fitness functions in GP to address the class imbalance issue in classification with high-dimensional unbalanced data. Two fitness functions are proposed to increase the performance of the traditional accuracy measures, and one fitness function is proposed to approximate Area Under Curve (AUC) with the goal to save the training time. Experiments on six high-dimensional unbalanced datasets show the better performance of the proposed fitness functions, compared to existing fitness functions.
引用
收藏
页码:2779 / 2786
页数:8
相关论文
共 50 条
  • [41] A training algorithm for classification of high-dimensional data
    Vieira, A
    Barradas, N
    NEUROCOMPUTING, 2003, 50 : 461 - 472
  • [42] Ensemble Method for Classification of High-Dimensional Data
    Piao, Yongjun
    Park, Hyun Woo
    Jin, Cheng Hao
    Ryu, Keun Ho
    2014 INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2014, : 245 - +
  • [43] Neural networks trained with high-dimensional functions approximation data in high-dimensional space
    Zheng, Jian
    Wang, Jianfeng
    Chen, Yanping
    Chen, Shuping
    Chen, Jingjin
    Zhong, Wenlong
    Wu, Wenling
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 41 (02) : 3739 - 3750
  • [44] Neural networks trained with high-dimensional functions approximation data in high-dimensional space
    Zheng, Jian
    Wang, Jianfeng
    Chen, Yanping
    Chen, Shuping
    Chen, Jingjin
    Zhong, Wenlong
    Wu, Wenling
    Journal of Intelligent and Fuzzy Systems, 2021, 41 (02): : 3739 - 3750
  • [45] Evolving Ensembles in Multi-objective Genetic Programming for Classification with Unbalanced Data
    Bhowan, Urvesh
    Johnston, Mark
    Zhang, Mengjie
    GECCO-2011: PROCEEDINGS OF THE 13TH ANNUAL GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2011, : 1331 - 1338
  • [46] Data-dependent kernels for high-dimensional data classification
    Wang, JD
    Kwok, JT
    Shen, HC
    Quan, L
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), VOLS 1-5, 2005, : 102 - 107
  • [47] A New Ensemble Method with Feature Space Partitioning for High-Dimensional Data Classification
    Piao, Yongjun
    Piao, Minghao
    Jin, Cheng Hao
    Shon, Ho Sun
    Chung, Ji-Moon
    Hwang, Buhyun
    Ryu, Keun Ho
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
  • [48] Genetic Programming for Imputation Predictor Selection and Ranking in Symbolic Regression with High-Dimensional Incomplete Data
    Al-Helali, Baligh
    Chen, Qi
    Xue, Bing
    Zhang, Mengjie
    AI 2019: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, 11919 : 523 - 535
  • [49] Genetic-Programming-Based Architecture of Fuzzy Modeling: Towards Coping With High-Dimensional Data
    Safari Mamaghani, Ali
    Pedrycz, Witold
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2021, 29 (09) : 2774 - 2784
  • [50] Exploration of high-dimensional data manifolds for object classification
    Shah, N
    Waagen, D
    Ordaz, M
    Cassabaum, M
    Coit, A
    AUTOMATIC TARGET RECOGNITON XV, 2005, 5807 : 400 - 408