Bio inspired Ensemble Feature Selection (BEFS) Model with Machine Learning and Data Mining Algorithms for Disease Risk Prediction

被引:3
|
作者
Pasha, Syed Javeed [1 ]
Mohamed, E. Syed [2 ]
机构
[1] BS Abdur Rahman Crescent Inst Sci & Technol, Dept Comp Applicat, Chennai, Tamil Nadu, India
[2] BS Abdur Rahman Crescent Inst Sci & Technol, Dept Comp Sci, Chennai, Tamil Nadu, India
关键词
Bio inspired ensemble feature selection (BEFS) model; machine learning; data mining; feature selection; health care; disease risk prediction; breast cancer risk prediction; genetic algorithm; random forest; logistic regression; BREAST-CANCER; DIAGNOSIS;
D O I
10.1109/iccubea47591.2019.9129304
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Use of machine learning (ML) and data mining (DM) algorithms has surfaced more often in the recent years for disease risk prediction problems in the healthcare communities. Several traditional feature selection models are combined with the DM and ML algorithms to improve accuracy of the disease risk prediction. In this study, a new Bio-inspired Ensemble Feature Selection (BEFS) model is introduced which is applied with the DM and ML algorithms. In the BEFS model, the most relevant and highly contributing features in the prediction are determined with a bio-inspired algorithm i.e., genetic algorithm, and an ensemble algorithm i.e., random forest algorithm. These important features obtained from the proposed model are then combined in various combinations and applied with the DM and ML algorithms, here logistic regression (LR) and random forest (RF), and the results obtained are promising. The experiment is executed using the famous ML language R. To accomplish this objective, the Breast Cancer Wisconsin (Diagnostic) dataset of UCI (University of California, Irvine) ML repository is utilized. In the experimental outcomes, the highest accuracy attained with the BEFS model is 96.49%, the AUC (Area Under Curve) achieved is 96%, and the sensitivity is 98.11%. These results, which greatly improve the disease risk prediction, are higher than several other existing works, while utilizing only six most relevant features out of the thirty two features of the dataset.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] A multidomain bio-inspired feature extraction and selection model for diabetic retinopathy severity classification: an ensemble learning approach
    Posham Uppamma
    Sweta Bhattacharya
    Scientific Reports, 13
  • [22] A multidomain bio-inspired feature extraction and selection model for diabetic retinopathy severity classification: an ensemble learning approach
    Uppamma, Posham
    Bhattacharya, Sweta
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [23] An Ensemble Machine Learning and Data Mining Approach to Enhance Stroke Prediction
    Wijaya, Richard
    Saeed, Faisal
    Samimi, Parnia
    Albarrak, Abdullah M.
    Qasem, Sultan Noman
    BIOENGINEERING-BASEL, 2024, 11 (07):
  • [24] A Review of Feature Selection Methods for Machine Learning-Based Disease Risk Prediction
    Pudjihartono, Nicholas
    Fadason, Tayaza
    Kempa-Liehr, Andreas W.
    O'Sullivan, Justin M.
    FRONTIERS IN BIOINFORMATICS, 2022, 2
  • [25] Sarcopenia feature selection and risk prediction using machine learning
    Yoo, Jun-Il
    Park, Chan-Ho
    Kim, Hyeonmok
    JOURNAL OF BONE AND MINERAL RESEARCH, 2019, 34 : 145 - 145
  • [26] Heart Disease Prediction Model Using Feature Selection and Ensemble Deep Learning with Optimized Weight
    Al-Mahdi, Iman S.
    Darwish, Saad M.
    Madbouly, Magda M.
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2025,
  • [27] A diabetes prediction model based on Boruta feature selection and ensemble learning
    Zhou, Hongfang
    Xin, Yinbo
    Li, Suli
    BMC BIOINFORMATICS, 2023, 24 (01)
  • [28] Prediction of Cardiovascular Disease by Feature Selection and Machine Learning Techniques
    Ranade, Aditya
    Pise, Nitin
    ARTIFICIAL INTELLIGENCE: THEORY AND APPLICATIONS, VOL 2, AITA 2023, 2024, 844 : 457 - 472
  • [29] A diabetes prediction model based on Boruta feature selection and ensemble learning
    Hongfang Zhou
    Yinbo Xin
    Suli Li
    BMC Bioinformatics, 24
  • [30] Data Driven Feature Selection for Machine Learning Algorithms in Computer Vision
    Zhang, Fan
    Li, Wei
    Zhang, Yifan
    Feng, Zhiyong
    IEEE INTERNET OF THINGS JOURNAL, 2018, 5 (06): : 4262 - 4272