Genetic Programming Based on Granular Computing for Classification with High-Dimensional Data

被引:2
|
作者
Pei, Wenbin [1 ]
Xue, Bing [1 ]
Shang, Lin [2 ]
Zhang, Mengjie [1 ]
机构
[1] Victoria Univ Wellington, Sch Engn & Comp Sci, POB 600, Wellington 6140, New Zealand
[2] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing 210093, Peoples R China
关键词
High-dimensional data; Genetic programming; Granular computing; Classification; NEURAL-NETWORKS;
D O I
10.1007/978-3-030-03991-2_58
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Classification tasks become more challenging when having the curse of dimensionality issue. Recently, there has been an increasing number of datasets with thousands of features. Some classification algorithms often need feature selection to avoid the curse of dimensionality. Genetic programming (GP) has shown success in classification tasks. GP does not require to do feature selection because of its built-in capability to automatically select informative features. However, GP-based methods are often computationally intensive to achieve a good classification accuracy. Based on perspectives from granular computing (GrC), this paper proposes a new approach to linking features hierarchically for GP-based classification. Experiments on seven high-dimensional datasets show the effectiveness of the proposed algorithm in terms of saving training time and enhancing the classification accuracy, compared to baseline methods.
引用
收藏
页码:643 / 655
页数:13
相关论文
共 50 条
  • [21] Enhanced algorithm for high-dimensional data classification
    Wang, Xiaoming
    Wang, Shitong
    APPLIED SOFT COMPUTING, 2016, 40 : 1 - 9
  • [22] Online Nonlinear Classification for High-Dimensional Data
    Vanli, N. Denizcan
    Ozkan, Huseyin
    Delibalta, Ibrahim
    Kozat, Suleyman S.
    2015 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2015, 2015, : 685 - 688
  • [23] Reuse of Program Trees in Genetic Programming with a New Fitness Function in High-dimensional Unbalanced Classification
    Pei, Wenbin
    Xue, Bing
    Shang, Lin
    Zhang, Mengjie
    PROCEEDINGS OF THE 2019 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION (GECCCO'19 COMPANION), 2019, : 187 - 188
  • [24] A Compressive Classification Framework for High-Dimensional Data
    Tabassum, Muhammad Naveed
    Ollila, Esa
    IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2020, 1 : 177 - 186
  • [25] A training algorithm for classification of high-dimensional data
    Vieira, A
    Barradas, N
    NEUROCOMPUTING, 2003, 50 : 461 - 472
  • [26] Ensemble Method for Classification of High-Dimensional Data
    Piao, Yongjun
    Park, Hyun Woo
    Jin, Cheng Hao
    Ryu, Keun Ho
    2014 INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2014, : 245 - +
  • [27] Developing Interval-Based Cost-Sensitive Classifiers by Genetic Programming for Binary High-Dimensional Unbalanced Classification
    Pei, Wenbin
    Xue, Bing
    Shang, Lin
    Zhang, Mengjie
    IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2021, 16 (01) : 84 - 98
  • [28] High-Dimensional Data Classification Based on Smooth Support Vector Machines
    Purnami, Santi Wulan
    Andari, Shofi
    Pertiwi, Yuniati Dian
    THIRD INFORMATION SYSTEMS INTERNATIONAL CONFERENCE 2015, 2015, 72 : 477 - 484
  • [29] Adaptive threshold-based classification of sparse high-dimensional data
    Pavlenko, Tatjana
    Stepanova, Natalia
    Thompson, Lee
    ELECTRONIC JOURNAL OF STATISTICS, 2022, 16 (01): : 1952 - 1996
  • [30] Regression-Based Network Estimation for High-Dimensional Genetic Data
    Lee, Kyu Min
    Lee, Minhyeok
    Seok, Junhee
    Han, Sung Won
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2019, 26 (04) : 336 - 349