Genetic Programming Based on Granular Computing for Classification with High-Dimensional Data

被引:2
|
作者
Pei, Wenbin [1 ]
Xue, Bing [1 ]
Shang, Lin [2 ]
Zhang, Mengjie [1 ]
机构
[1] Victoria Univ Wellington, Sch Engn & Comp Sci, POB 600, Wellington 6140, New Zealand
[2] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing 210093, Peoples R China
关键词
High-dimensional data; Genetic programming; Granular computing; Classification; NEURAL-NETWORKS;
D O I
10.1007/978-3-030-03991-2_58
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Classification tasks become more challenging when having the curse of dimensionality issue. Recently, there has been an increasing number of datasets with thousands of features. Some classification algorithms often need feature selection to avoid the curse of dimensionality. Genetic programming (GP) has shown success in classification tasks. GP does not require to do feature selection because of its built-in capability to automatically select informative features. However, GP-based methods are often computationally intensive to achieve a good classification accuracy. Based on perspectives from granular computing (GrC), this paper proposes a new approach to linking features hierarchically for GP-based classification. Experiments on seven high-dimensional datasets show the effectiveness of the proposed algorithm in terms of saving training time and enhancing the classification accuracy, compared to baseline methods.
引用
收藏
页码:643 / 655
页数:13
相关论文
共 50 条
  • [1] A Novel Multiobjective Genetic Programming Approach to High-Dimensional Data Classification
    Zhou, Yu
    Yang, Nanjian
    Huang, Xingyue
    Lee, Jaesung
    Kwong, Sam
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (09) : 5205 - 5216
  • [2] Genetic programming for feature construction and selection in classification on high-dimensional data
    Binh Tran
    Bing Xue
    Mengjie Zhang
    [J]. Memetic Computing, 2016, 8 : 3 - 15
  • [3] Genetic programming for feature construction and selection in classification on high-dimensional data
    Binh Tran
    Xue, Bing
    Zhang, Mengjie
    [J]. MEMETIC COMPUTING, 2016, 8 (01) : 3 - 15
  • [4] New Fitness Functions in Genetic Programming for Classification with High-dimensional Unbalanced Data
    Pei, Wenbin
    Xue, Bing
    Shang, Lin
    Zhang, Mengjie
    [J]. 2019 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2019, : 2779 - 2786
  • [5] Classification methods for high-dimensional genetic data
    Kalina, Jan
    [J]. BIOCYBERNETICS AND BIOMEDICAL ENGINEERING, 2014, 34 (01) : 10 - 18
  • [6] Genetic programming for multiple-feature construction on high-dimensional classification
    Binh Tran
    Xue, Bing
    Zhang, Mengjie
    [J]. PATTERN RECOGNITION, 2019, 93 : 404 - 417
  • [7] Genetic Programming for Borderline Instance Detection in High-dimensional Unbalanced Classification
    Pei, Wenbin
    Xue, Bing
    Shang, Lin
    Zhang, Mengjie
    [J]. PROCEEDINGS OF THE 2021 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'21), 2021, : 349 - 357
  • [8] A Threshold-free Classification Mechanism in Genetic Programming for High-dimensional Unbalanced Classification
    Pei, Wenbin
    Xue, Bing
    Shang, Lin
    Zhang, Mengjie
    [J]. 2020 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2020,
  • [9] A Cost-sensitive Genetic Programming Approach for High-dimensional Unbalanced Classification
    Pei, Wenbin
    Xue, Bing
    Zhang, Mengjie
    Shang, Lin
    [J]. 2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 1770 - 1777
  • [10] The Application of high-dimensional Data Classification by Random Forest based on Hadoop Cloud Computing Platform
    Li, Chong
    [J]. 3RD INTERNATIONAL CONFERENCE ON APPLIED ENGINEERING, 2016, 51 : 385 - 390