Genetic Programming Based on Granular Computing for Classification with High-Dimensional Data

被引:2
|
作者
Pei, Wenbin [1 ]
Xue, Bing [1 ]
Shang, Lin [2 ]
Zhang, Mengjie [1 ]
机构
[1] Victoria Univ Wellington, Sch Engn & Comp Sci, POB 600, Wellington 6140, New Zealand
[2] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing 210093, Peoples R China
关键词
High-dimensional data; Genetic programming; Granular computing; Classification; NEURAL-NETWORKS;
D O I
10.1007/978-3-030-03991-2_58
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Classification tasks become more challenging when having the curse of dimensionality issue. Recently, there has been an increasing number of datasets with thousands of features. Some classification algorithms often need feature selection to avoid the curse of dimensionality. Genetic programming (GP) has shown success in classification tasks. GP does not require to do feature selection because of its built-in capability to automatically select informative features. However, GP-based methods are often computationally intensive to achieve a good classification accuracy. Based on perspectives from granular computing (GrC), this paper proposes a new approach to linking features hierarchically for GP-based classification. Experiments on seven high-dimensional datasets show the effectiveness of the proposed algorithm in terms of saving training time and enhancing the classification accuracy, compared to baseline methods.
引用
收藏
页码:643 / 655
页数:13
相关论文
共 50 条
  • [41] A depth-based nearest neighbor algorithm for high-dimensional data classification
    Harikumar S.
    Aravindakshan Savithri A.
    Kaimal R.
    Turkish Journal of Electrical Engineering and Computer Sciences, 2019, 27 (06): : 4082 - 4101
  • [42] Interaction-based feature selection and classification for high-dimensional biological data
    Wang, Haitian
    Lo, Shaw-Hwa
    Zheng, Tian
    Hu, Inchi
    BIOINFORMATICS, 2012, 28 (21) : 2834 - 2842
  • [43] Weighted linear programming discriminant analysis for high-dimensional binary classification
    Wu, Yufei
    Yu, Guan
    STATISTICAL ANALYSIS AND DATA MINING, 2020, 13 (05) : 437 - 450
  • [44] Visualization Study of High-dimensional Data Classification Based on PCA-SVM
    Zhao Zhongwen
    Guo Huanghuang
    2017 IEEE SECOND INTERNATIONAL CONFERENCE ON DATA SCIENCE IN CYBERSPACE (DSC), 2017, : 346 - 349
  • [45] Defining and Evaluating Classification Algorithm for High-Dimensional Data Based on Latent Topics
    Luo, Le
    Li, Li
    PLOS ONE, 2014, 9 (01):
  • [46] An Efficient Extraction-based Bagging Ensemble for High-dimensional data classification
    Huang, Hsiao-Yun
    Li, Yen-Chieh
    6TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS, AND THE 13TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS, 2012, : 1557 - 1560
  • [47] A depth-based nearest neighbor algorithm for high-dimensional data classification
    Harikumar, Sandhya
    Aravindakshan Savithri, Akhil
    Kaimal, Ramachandra
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2019, 27 (06) : 4082 - 4101
  • [48] Computing with High-Dimensional Vectors
    Kanerva, Pentti
    IEEE DESIGN & TEST, 2019, 36 (03) : 7 - 14
  • [49] Classifier Ensemble Based on Multiview Optimization for High-Dimensional Imbalanced Data Classification
    Xu, Yuhong
    Yu, Zhiwen
    Chen, C. L. Philip
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (01) : 870 - 883
  • [50] A classification method for high-dimensional imbalanced multi-classification data
    Li, Mengmeng
    Zheng, Qibin
    Liu, Yi
    Li, Gengsong
    Qin, Wei
    Ren, Xiaoguang
    ELECTRONICS LETTERS, 2023, 59 (20)