OmicPredict: a framework for omics data prediction using ANOVA-Firefly algorithm for feature selection

被引:1
|
作者
Kaur, Parampreet [1 ]
Singh, Ashima [1 ]
Chana, Inderveer [1 ]
机构
[1] Thapar Inst Engn & Technol, Comp Sci & Engn Dept, Patiala, India
关键词
Omics data; deep neural network (DNN); breast cancer; Alzheimer's disease; COVID-19; BREAST-CANCER; CLINICAL-SIGNIFICANCE; TELOMERASE; EXPRESSION; HER2;
D O I
10.1080/10255842.2023.2268236
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
High-throughput technologies and machine learning (ML), when applied to a huge pool of medical data such as omics data, result in efficient analysis. Recent research aims to apply and develop ML models to predict a disease well in time using available omics datasets. The present work proposed a framework, 'OmicPredict', deploying a hybrid feature selection method and deep neural network (DNN) model to predict multiple diseases using omics data. The hybrid feature selection method is developed using the Analysis of Variance (ANOVA) technique and firefly algorithm. The OmicPredict framework is applied to three case studies, Alzheimer's disease, Breast cancer, and Coronavirus disease 2019 (COVID-19). In the case study of Alzheimer's disease, the framework predicts patients using GSE33000 and GSE44770 dataset. In the case study of Breast cancer, the framework predicts human epidermal growth factor receptor 2 (HER2) subtype status using Molecular Taxonomy of Breast Cancer International Consortium (METABRIC) dataset. In the case study of COVID-19, the framework performs patients' classification using GSE157103 dataset. The experimental results show that DNN model achieved an Area Under Curve (AUC) score of 0.949 for the Alzheimer's (GSE33000 and GSE44770) dataset. Furthermore, it achieved an AUC score of 0.987 and 0.989 for breast cancer (METABRIC) and COVID-19 (GSE157103) datasets, respectively, outperforming Random Forest, Naive Bayes models, and the existing research.
引用
收藏
页码:1970 / 1983
页数:14
相关论文
共 50 条
  • [31] Feature Selection and Classification of Big Data Using MapReduce Framework
    Devi, D. Renuka
    Sasikala, S.
    INTELLIGENT COMPUTING, INFORMATION AND CONTROL SYSTEMS, ICICCS 2019, 2020, 1039 : 666 - 673
  • [32] Improving firefly algorithm-based logistic regression for feature selection
    Kahya, Mohammed Abdulrazaq
    Altamir, Suhaib Abduljabbar
    Algamal, Zakariya Yahya
    JOURNAL OF INTERDISCIPLINARY MATHEMATICS, 2019, 22 (08) : 1577 - 1581
  • [33] Hybrid firefly particle swarm optimisation algorithm for feature selection problems
    Ragab, Mahmoud
    EXPERT SYSTEMS, 2024, 41 (07)
  • [34] A return-cost-based binary firefly algorithm for feature selection
    Zhang, Yong
    Song, Xian-fang
    Gong, Dun-wei
    INFORMATION SCIENCES, 2017, 418 : 561 - 574
  • [35] Software fault prediction using firefly algorithm
    Arora, Ishani
    Saha, Anju
    INTERNATIONAL JOURNAL OF INTELLIGENT ENGINEERING INFORMATICS, 2018, 6 (3-4) : 356 - 377
  • [36] NMF-guided feature selection and genetic algorithm-driven framework for tumor mutational burden classification in bladder cancer using multi-omics data
    Al-Ghafer, Ibrahim Abed
    Alafeshat, Noor
    Alshomali, Lujain
    Alanee, Shaheen
    Qattous, Hazem
    Azzeh, Mohammad
    Alkhateeb, Abedalrhman
    NETWORK MODELING AND ANALYSIS IN HEALTH INFORMATICS AND BIOINFORMATICS, 2024, 13 (01):
  • [37] Wrapper-filter feature selection algorithm using a memetic framework
    Zhu, Zexuan
    Ong, Yew-Soon
    Dash, Manoranjan
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2007, 37 (01): : 70 - 76
  • [38] Feature Selection and Classification of Microarray Data for Cancer Prediction Using MapReduce Implementation of Random Forest Algorithm
    Dhanalakshmi, R.
    Khaire, Utkarsh M.
    JOURNAL OF SCIENTIFIC & INDUSTRIAL RESEARCH, 2019, 78 (03): : 158 - 161
  • [39] PepAls: Performance Prediction and Algorithm Selection Framework for Data Mining Applications
    You, Mingyu
    Xu, Xuanhui
    Wang, Zheng
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 4648 - 4657
  • [40] Feature Selection using Gravitational Search Algorithm for Biomedical Data
    Nagpal, Sushama
    Arora, Sanchit
    Dey, Sangeeta
    Shreya
    7TH INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING & COMMUNICATIONS (ICACC-2017), 2017, 115 : 258 - 265