LAGOA: Learning automata based grasshopper optimization algorithm for feature selection in disease datasets

被引:14
|
作者
Dey, Chiradeep [1 ]
Bose, Rajarshi [1 ]
Ghosh, Kushal Kanti [2 ]
Malakar, Samir [3 ]
Sarkar, Ram [2 ]
机构
[1] Jadavpur Univ, Dept Elect & Telecommun Engn, Kolkata, India
[2] Jadavpur Univ, Dept Comp Sci & Engn, Kolkata, India
[3] Asutosh Coll, Dept Comp Sci, Kolkata, India
关键词
Grasshopper optimization algorithm; Learning automata; Two-phase mutation; Biomedical data; Feature selection; Cancer data; SEARCH ALGORITHM; CLASSIFICATION; SYSTEM;
D O I
10.1007/s12652-021-03155-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In predictive modelling it is important to use any feature selection methods as irrelevant features when used with powerful classifiers can lead to over-fitting and thus create models which fail to perform as good as when these features are not used. Particularly it is important in case of disease datasets where various features or attributes are available through the patients' medical records and many features in these datasets may not be relevant to the diagnosis of some specific disease. Wrong models in this case can be disastrous and lead to wrong diagnosis, and maybe in extreme cases lead to loss of life. To this end, we have used a wrapper based feature selection model for the said purpose. In recent years, Grasshopper Optimization Algorithm (GOA) has proved its superiority over other optimization algorithms in different research areas. In this paper, we propose an improved version of GOA, called (LAGOA), which uses Learning Automata (LA) for adjusting the parameters of GOA in an adaptive way, and two-phase mutation for enhancing exploitation capability of the algorithm. LA is used for adjusting the parameter values of each grasshopper in the population individually. In two-phase mutation the first phase reduces the number of selected features while preserving high classification accuracy, while the second phase adds relevant features which increase the classification accuracy. Proposed method has been applied to Breast Cancer (Wisconsin), Breast Cancer (Diagnosis), Statlog (Heart), Lung Cancer, SpectF Heart and Hepatitis datasets taken from UCI Machine Learning Repository. Experimental results confirm its superiority over state-of-the-art methods considered here for comparison.
引用
收藏
页码:3175 / 3194
页数:20
相关论文
共 50 条
  • [41] Feature Selection Approach Based on Whale Optimization Algorithm
    Sharawi, Marwa
    Zawbaa, Hossam M.
    Emary, E.
    Zawbaa, Hossam M.
    [J]. 2017 NINTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI), 2017, : 163 - 168
  • [42] Binary grasshopper optimisation algorithm approaches for feature selection problems
    Mafarja, Majdi
    Aljarah, Ibrahim
    Faris, Hossam
    Hammouri, Abdelaziz I.
    Al-Zoubi, Ala' M.
    Mirjalili, Seyedali
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2019, 117 : 267 - 286
  • [43] Improved grasshopper optimization algorithm using opposition-based learning
    Ewees, Ahmed A.
    Abd Elaziz, Mohamed
    Houssein, Essam H.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2018, 112 : 156 - 172
  • [44] Feature Selection for High-Dimensional and Imbalanced Biomedical Data Based on Robust Correlation Based Redundancy and Binary Grasshopper Optimization Algorithm
    Abdulrauf Sharifai, Garba
    Zainol, Zurinahni
    [J]. GENES, 2020, 11 (07)
  • [45] Sequential and Mixed Genetic Algorithm and Learning Automata (SGALA, MGALA) for Feature Selection in QSAR
    MotieGhader, Habib
    Gharaghani, Sajjad
    Masoudi-Sobhanzadeh, Yosef
    Masoudi-Nejad, Ali
    [J]. IRANIAN JOURNAL OF PHARMACEUTICAL RESEARCH, 2017, 16 (02): : 533 - 553
  • [46] Ant Colony Algorithm for Feature Selection on Microarray Datasets
    Fahrudin, Tresna Maulana
    Syarif, Iwan
    Barakbah, Ali Ridho
    [J]. 2016 INTERNATIONAL ELECTRONICS SYMPOSIUM (IES), 2016, : 351 - 356
  • [47] A biobjective feature selection algorithm for large omics datasets
    Cavique, Luis
    Mendes, Armando B.
    Martiniano, Hugo F. M. C.
    Correia, Luis
    [J]. EXPERT SYSTEMS, 2018, 35 (04)
  • [48] A Novel Hybrid Algorithm for Feature Selection Based on Whale Optimization Algorithm
    Zheng, Yuefeng
    Li, Ying
    Wang, Gang
    Chen, Yupeng
    Xu, Qian
    Fan, Jiahao
    Cui, Xueting
    [J]. IEEE ACCESS, 2019, 7 : 14908 - 14923
  • [49] Datasets Meta-Feature Description for Recommending Feature Selection Algorithm
    Filchenkov, Andrey
    Pendryak, Arseniy
    [J]. 2015 ARTIFICIAL INTELLIGENCE AND NATURAL LANGUAGE AND INFORMATION EXTRACTION, SOCIAL MEDIA AND WEB SEARCH FRUCT CONFERENCE (AINL-ISMW FRUCT), 2015, : 11 - 18
  • [50] An Improved Gannet Optimization Algorithm Based on Opposition-Based Schemes for Feature Selection Problems in High-Dimensional Datasets
    Avinash N.
    Sinha S.K.
    Shivamurthaiah M.
    [J]. SN Computer Science, 5 (1)