OmicPredict: a framework for omics data prediction using ANOVA-Firefly algorithm for feature selection

被引:1
|
作者
Kaur, Parampreet [1 ]
Singh, Ashima [1 ]
Chana, Inderveer [1 ]
机构
[1] Thapar Inst Engn & Technol, Comp Sci & Engn Dept, Patiala, India
关键词
Omics data; deep neural network (DNN); breast cancer; Alzheimer's disease; COVID-19; BREAST-CANCER; CLINICAL-SIGNIFICANCE; TELOMERASE; EXPRESSION; HER2;
D O I
10.1080/10255842.2023.2268236
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
High-throughput technologies and machine learning (ML), when applied to a huge pool of medical data such as omics data, result in efficient analysis. Recent research aims to apply and develop ML models to predict a disease well in time using available omics datasets. The present work proposed a framework, 'OmicPredict', deploying a hybrid feature selection method and deep neural network (DNN) model to predict multiple diseases using omics data. The hybrid feature selection method is developed using the Analysis of Variance (ANOVA) technique and firefly algorithm. The OmicPredict framework is applied to three case studies, Alzheimer's disease, Breast cancer, and Coronavirus disease 2019 (COVID-19). In the case study of Alzheimer's disease, the framework predicts patients using GSE33000 and GSE44770 dataset. In the case study of Breast cancer, the framework predicts human epidermal growth factor receptor 2 (HER2) subtype status using Molecular Taxonomy of Breast Cancer International Consortium (METABRIC) dataset. In the case study of COVID-19, the framework performs patients' classification using GSE157103 dataset. The experimental results show that DNN model achieved an Area Under Curve (AUC) score of 0.949 for the Alzheimer's (GSE33000 and GSE44770) dataset. Furthermore, it achieved an AUC score of 0.987 and 0.989 for breast cancer (METABRIC) and COVID-19 (GSE157103) datasets, respectively, outperforming Random Forest, Naive Bayes models, and the existing research.
引用
收藏
页码:1970 / 1983
页数:14
相关论文
共 50 条
  • [41] FEATURE SELECTION IN OMICS PREDICTION PROBLEMS USING CAT SCORES AND FALSE NONDISCOVERY RATE CONTROL
    Ahdesmaeki, Miika
    Strimmer, Korbinian
    ANNALS OF APPLIED STATISTICS, 2010, 4 (01): : 503 - 519
  • [42] Test Data Generation and Selection Using Levy Flight-Based Firefly Algorithm
    Pandey, Abhishek
    Banerjee, Soumya
    INTERNATIONAL JOURNAL OF SOFTWARE INNOVATION, 2021, 9 (02) : 18 - 34
  • [43] A novel optimal feature selection technique for medical data classification using ANOVA based whale optimization
    Moorthy, Usha
    Gandhi, Usha Devi
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 12 (03) : 3527 - 3538
  • [44] Software Defect Prediction using Feature Selection and Random Forest Algorithm
    Ibrahim, Dyana Rashid
    Ghnemat, Rawan
    Hudaib, Amjad
    2017 INTERNATIONAL CONFERENCE ON NEW TRENDS IN COMPUTING SCIENCES (ICTCS), 2017, : 252 - 257
  • [45] Prediction of Essential Proteins Using Genetic Algorithm as a Feature Selection Technique
    Inzamam-Ul-Hossain, Md.
    Islam, Md. Rafiqul
    IEEE ACCESS, 2024, 12 : 126200 - 126220
  • [46] Feature selection using firefly optimization for classification and regression models
    Zhang, Li
    Mistry, Kamlesh
    Lim, Chee Peng
    Neoh, Siew Chin
    DECISION SUPPORT SYSTEMS, 2018, 106 : 64 - 85
  • [47] Stability of Feature Selection in Multi-Omics Data Analysis
    Lukaszuk, Tomasz
    Krawczuk, Jerzy
    Zyla, Kamil
    Kesik, Jacek
    APPLIED SCIENCES-BASEL, 2024, 14 (23):
  • [48] Optimal feature selection using distance-based discrete firefly algorithm with mutual information criterion
    Zhang, Long
    Shan, Linlin
    Wang, Jianhua
    NEURAL COMPUTING & APPLICATIONS, 2017, 28 (09): : 2795 - 2808
  • [49] Optimal feature selection using distance-based discrete firefly algorithm with mutual information criterion
    Long Zhang
    Linlin Shan
    Jianhua Wang
    Neural Computing and Applications, 2017, 28 : 2795 - 2808
  • [50] Feature Selection and Classification of Microarray Data using MapReduce based ANOVA and K-Nearest Neighbor
    Kumar, Mukesh
    Rath, Nitish Kumar
    Swain, Amitav
    Rath, Santanu Kumar
    ELEVENTH INTERNATIONAL CONFERENCE ON COMMUNICATION NETWORKS, ICCN 2015/INDIA ELEVENTH INTERNATIONAL CONFERENCE ON DATA MINING AND WAREHOUSING, ICDMW 2015/NDIA ELEVENTH INTERNATIONAL CONFERENCE ON IMAGE AND SIGNAL PROCESSING, ICISP 2015, 2015, 54 : 301 - 310