Optimizing Sequential Forward Selection on Classification Using Genetic Algorithm

被引:0
|
作者
Chotchantarakun K. [1 ]
机构
[1] Department of Information Studies, Faculty of Humanities and Social Science, Burapha University, 169 Longhaad Bangsaen Rd, Saensuk, Mueang, Chonburi
来源
Informatica (Slovenia) | 2023年 / 47卷 / 09期
关键词
classification accuracy; data mining; genetic algorithm; optimization; sequential feature selection;
D O I
10.31449/inf.v47i9.4964
中图分类号
学科分类号
摘要
Regarding the digital transformation of modern technologies, the amount of data increases significantly resulting in novel knowledge discovery techniques in Data Analytic and Data Mining. These data usually consist of noises or non-informative features that affect the analysis results. The features-eliminating approaches have been studied extensively in the past few decades name feature selection. It is a significant preprocessing step of the mining process, which selects only the informative features from the original feature set. These selected features improve the learning model efficiency. This study proposes a forward sequential feature selection method called Forward Selection with Genetic Algorithm (FS-GA). FS-GA consists of three major steps. First, it creates the preliminarily selected subsets. Second, it provides an improvement on the previous subsets. Third, it optimizes the selected subset using the genetic algorithm. Hence, it maximizes the classification accuracy during the feature addition. We performed experiments based on ten standard UCI datasets using three popular classification models including the Decision Tree, Naive Bayes, and K-Nearest Neighbour classifiers. The results are compared with the state-of-the-art methods. FS-GA has shown the best results against the other sequential forward selection methods for all the tested datasets with O(n2) time complexity. © 2023 Slovene Society Informatika. All rights reserved.
引用
收藏
页码:81 / 90
页数:9
相关论文
共 50 条
  • [1] Emotional speech classification using Gaussian mixture models and the sequential floating forward selection algorithm
    Ververidis, D
    Kotropoulos, C
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), VOLS 1 AND 2, 2005, : 1501 - 1504
  • [2] A Framework for Optimizing Malware Classification by Using Genetic Algorithm
    Yusoff, Mohd Najwadi
    Jantan, Aman
    [J]. SOFTWARE ENGINEERING AND COMPUTER SYSTEMS, PT 2, 2011, 180 : 58 - 72
  • [3] Round-Robin sequential forward selection algorithm for prostate cancer classification and diagnosis using multispectral imagery
    Sabrina Bouatmane
    Mohamed Ali Roula
    Ahmed Bouridane
    Somaya Al-Maadeed
    [J]. Machine Vision and Applications, 2011, 22 : 865 - 878
  • [4] Round-Robin sequential forward selection algorithm for prostate cancer classification and diagnosis using multispectral imagery
    Bouatmane, Sabrina
    Roula, Mohammed Ali
    Bouridane, Ahmed
    Al-Maadeed, Somaya
    [J]. MACHINE VISION AND APPLICATIONS, 2011, 22 (05) : 865 - 878
  • [5] Trajectory Classification Using Feature Selection by Genetic Algorithm
    Saini, Rajkumar
    Kumar, Pradeep
    Roy, Partha Pratim
    Pal, Umapada
    [J]. PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON COMPUTER VISION AND IMAGE PROCESSING, CVIP 2018, VOL 2, 2020, 1024 : 377 - 388
  • [6] Feature Selection Using Sequential Forward Selection and classification applying Artificial Metaplasticity Neural Network
    Marcano-Cedeno, A.
    Quintanilla-Dominguez, J.
    Cortina-Januchs, M. G.
    Andina, D.
    [J]. IECON 2010 - 36TH ANNUAL CONFERENCE ON IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2010,
  • [7] Optimizing feed-forward neural networks using cascaded genetic algorithm
    Zhou, LX
    Li, M
    Yang, XQ
    [J]. THIRD INTERNATIONAL SYMPOSIUM ON MULTISPECTRAL IMAGE PROCESSING AND PATTERN RECOGNITION, PTS 1 AND 2, 2003, 5286 : 183 - 186
  • [8] Feature Selection Classification of Skin Cancer using Genetic Algorithm
    Srividya, T. D.
    Arulmozhi, V.
    [J]. PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON COMMUNICATION AND ELECTRONICS SYSTEMS (ICCES 2018), 2018, : 412 - 417
  • [9] Variable selection for financial distress classification using a genetic algorithm
    Galvao, RKH
    Becerra, VM
    Abou-Seada, M
    [J]. CEC'02: PROCEEDINGS OF THE 2002 CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1 AND 2, 2002, : 2000 - 2005
  • [10] Classification of Parkinson Disease with Feature Selection using Genetic Algorithm
    Iftikhar, Mahnoor
    Ali, Nisar
    Ali, Raja Hashim
    Bais, Abdul
    [J]. 2023 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CCECE, 2023,