A Novel Feature Selection Method for Classification of Medical Data Using Filters, Wrappers, and Embedded Approaches

被引:8
|
作者
Bashir, Saba [1 ]
Khattak, Irfan Ullah [2 ]
Khan, Aihab [2 ]
Khan, Farhan Hassan [3 ]
Gani, Abdullah [4 ]
Shiraz, Muhammad [1 ]
机构
[1] Fed Urdu Univ Arts Sci & Technol, Dept Comp Sci, Islamabad, Pakistan
[2] Iqra Univ, Dept Comp & Technol, Islamabad, Pakistan
[3] Natl Univ Sci & Technol NUST, Coll E & ME, Dept Comp Engn, Knowledge & Data Sci Res Ctr KDRC, Islamabad, Pakistan
[4] Univ Malaysia Sabah, Fac Comp & Informat, Jalan UMS, Kota Kinabalu 88400, Sabah, Malaysia
关键词
HEART-FAILURE EPIDEMIC; DIAGNOSIS; DISEASE; MODELS; AGE;
D O I
10.1155/2022/8190814
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Feature selection is the process of identifying the most relevant features from the given data having a large feature space. Microarray datasets are comprised of high-quality features and very few samples of data. Feature selection is performed on such datasets to identify the optimal feature subset. The major goal of feature selection is to improve the accuracy by identifying a minimal feature subset. For this purpose, the proposed research focused on analyzing and identifying effective feature selection algorithms. A novel framework is proposed which utilizes different feature selection methods from filters, wrappers, and embedded algorithms. Furthermore, classification is then performed on selected features to classify the data using a support vector machine (SVM) classifier. Two publically available benchmark datasets are used, i.e., the Microarray dataset and the Cleveland Heart Disease dataset, for experimentation and analysis, and they are archived from the UCI data repository. The performance of SVM is analyzed using accuracy, sensitivity, specificity, and f-measure. The accuracy of 94.45% and 91% is achieved on each dataset, respectively.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] A novel feature selection approach:: Combining feature wrappers and filters
    Uncu, Oezge
    Tuerksen, I. B.
    INFORMATION SCIENCES, 2007, 177 (02) : 449 - 466
  • [2] An Embedded Feature Selection Method for Imbalanced Data Classification
    Liu, Haoyue
    Zhou, MengChu
    Liu, Qing
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2019, 6 (03) : 703 - 715
  • [3] An Embedded Feature Selection Method for Imbalanced Data Classification
    Haoyue Liu
    MengChu Zhou
    Qing Liu
    IEEE/CAAJournalofAutomaticaSinica, 2019, 6 (03) : 703 - 715
  • [4] Hybrid feature selection by combining filters and wrappers
    Hsu, Hui-Huang
    Hsieh, Cheng-Wei
    Lu, Ming-Da
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (07) : 8144 - 8150
  • [5] Novel and efficient method on feature selection and data classification
    Chen, Tieming
    Ma, Jixia
    Huang, Samuel H.
    Cai, Jiamei
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2012, 49 (04): : 735 - 745
  • [6] A Feature Selection Method Combining Filter and Wrapper Approaches for Medical Dataset Classification
    Yavuz, Gurcan
    Moghanjoughi, Mohammad Karrari
    Dumlu, Hatem
    Cakir, Halil Ibrahim
    VIETNAM JOURNAL OF COMPUTER SCIENCE, 2025,
  • [7] Ensembles of wrappers for automated feature selection in fish age classification
    Bermejo, Sergio
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2017, 134 : 27 - 32
  • [8] A novel optimal feature selection technique for medical data classification using ANOVA based whale optimization
    Moorthy, Usha
    Gandhi, Usha Devi
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (03) : 3527 - 3538
  • [9] A novel feature selection method for image classification
    Singh, D. Asir Antony Gnana
    Balamurugan, S. Appavu Alias
    Leavline, E. Jebamalar
    OPTOELECTRONICS AND ADVANCED MATERIALS-RAPID COMMUNICATIONS, 2015, 9 (11-12): : 1362 - 1368
  • [10] A Classification Method for Incomplete Mixed Data Using Imputation and Feature Selection
    Li, Gengsong
    Zheng, Qibin
    Liu, Yi
    Li, Xiang
    Qin, Wei
    Diao, Xingchun
    APPLIED SCIENCES-BASEL, 2024, 14 (14):