Bio inspired Ensemble Feature Selection (BEFS) Model with Machine Learning and Data Mining Algorithms for Disease Risk Prediction

被引:3
|
作者
Pasha, Syed Javeed [1 ]
Mohamed, E. Syed [2 ]
机构
[1] BS Abdur Rahman Crescent Inst Sci & Technol, Dept Comp Applicat, Chennai, Tamil Nadu, India
[2] BS Abdur Rahman Crescent Inst Sci & Technol, Dept Comp Sci, Chennai, Tamil Nadu, India
关键词
Bio inspired ensemble feature selection (BEFS) model; machine learning; data mining; feature selection; health care; disease risk prediction; breast cancer risk prediction; genetic algorithm; random forest; logistic regression; BREAST-CANCER; DIAGNOSIS;
D O I
10.1109/iccubea47591.2019.9129304
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Use of machine learning (ML) and data mining (DM) algorithms has surfaced more often in the recent years for disease risk prediction problems in the healthcare communities. Several traditional feature selection models are combined with the DM and ML algorithms to improve accuracy of the disease risk prediction. In this study, a new Bio-inspired Ensemble Feature Selection (BEFS) model is introduced which is applied with the DM and ML algorithms. In the BEFS model, the most relevant and highly contributing features in the prediction are determined with a bio-inspired algorithm i.e., genetic algorithm, and an ensemble algorithm i.e., random forest algorithm. These important features obtained from the proposed model are then combined in various combinations and applied with the DM and ML algorithms, here logistic regression (LR) and random forest (RF), and the results obtained are promising. The experiment is executed using the famous ML language R. To accomplish this objective, the Breast Cancer Wisconsin (Diagnostic) dataset of UCI (University of California, Irvine) ML repository is utilized. In the experimental outcomes, the highest accuracy attained with the BEFS model is 96.49%, the AUC (Area Under Curve) achieved is 96%, and the sensitivity is 98.11%. These results, which greatly improve the disease risk prediction, are higher than several other existing works, while utilizing only six most relevant features out of the thirty two features of the dataset.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Enhancing software defect prediction: a framework with improved feature selection and ensemble machine learning
    Ali, Misbah
    Mazhar, Tehseen
    Al-Rasheed, Amal
    Shahzad, Tariq
    Ghadi, Yazeed Yasin
    Khan, Muhammad Amir
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [42] BIO-INSPIRED ENSEMBLE FEATURE SELECTION (BIEFS) AND ENSEMBLE MULTIPLE DEEP LEARNING (EMDL) CLASSIFIER FOR BREAST CANCER DIAGNOSIS
    Priya, R. S. Padma
    Vadivu, P. Senthil
    JOURNAL OF PHARMACEUTICAL NEGATIVE RESULTS, 2022, 13 : 483 - 499
  • [43] Cardiac disease risk prediction using machine learning algorithms
    Stonier, Albert Alexander
    Gorantla, Rakesh Krishna
    Manoj, K.
    HEALTHCARE TECHNOLOGY LETTERS, 2024, 11 (04) : 213 - 217
  • [44] Feature Selection Based Machine Learning to Improve Prediction of Parkinson Disease
    Nahar, Nazmun
    Ara, Ferdous
    Neloy, Md Arif Istiek
    Biswas, Anik
    Hossain, Mohammad Shahadat
    Andersson, Karl
    BRAIN INFORMATICS, BI 2021, 2021, 12960 : 496 - 508
  • [45] Prediction of heart disease by classifying with feature selection and machine learning methods
    Gazeloglu, Cengiz
    PROGRESS IN NUTRITION, 2020, 22 (02): : 660 - 670
  • [46] An Outcome Based Analysis on Heart Disease Prediction using Machine Learning Algorithms and Data Mining Approaches
    Deb, Aushtmi
    Koli, Mst Sadia Akter
    Akter, Sheikh Beauty
    Chowdhury, Adil Ahmed
    2022 IEEE WORLD AI IOT CONGRESS (AIIOT), 2022, : 418 - 424
  • [47] Sensor Event Mining with Hybrid Ensemble Learning and Evolutionary Feature Subset Selection Model
    Mehdiyev, Nijat
    Krumeich, Julian
    Werth, Dirk
    Loos, Peter
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 2159 - 2168
  • [48] Investigation of machine learning algorithms on heart disease through dominant feature detection and feature selection
    Fuat Türk
    Signal, Image and Video Processing, 2024, 18 : 3943 - 3955
  • [49] Investigation of machine learning algorithms on heart disease through dominant feature detection and feature selection
    Turk, Fuat
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (04) : 3943 - 3955
  • [50] Prediction of intrapartum fetal hypoxia considering feature selection algorithms and machine learning models
    Zafer Cömert
    Abdulkadir Şengür
    Ümit Budak
    Adnan Fatih Kocamaz
    Health Information Science and Systems, 7