Disease Prediction using Optimal Feature Selection from Epigenetic Data

被引:0
|
作者
Siyad, Mohammed B. [1 ]
Visakh, R. [1 ]
Nazeer, K. A. Abdul [1 ]
机构
[1] Natl Inst Technol, Dept Comp Sci & Engn, Calicut 673601, Kerala, India
关键词
Epigenetics; Genetic Algorithm; Fusion; Optimal Feature Selection; Prediction; GENE; KEGG;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Epigenetic changes are identified to correlate with gene expression changes among numerous diseases including cancer. Although integrated data analysis tools are available, a quantitative model that precisely predicts the diseases from epigenetic information is still in its infancy. In this paper, a data mining approach is presented for predicting a disease using the most relevant genes (treated as features) associated with it. A few existing feature selection methods have been applied independently and their results combined to form a fused feature set. For selecting the optimal set of genes from them, a genetic algorithm based evolutionary approach is proposed. They have been applied to a classifier for recognizing the disease from a set of reference samples. A key observation from our analysis is that the optimization using genetic algorithm gives better prediction accuracy than the existing methods. Hence the proposed optimal fusion based feature selection can be effectively applied for better disease diagnosis and prognosis.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Feature selection and risk prediction for patients with coronary artery disease using data mining
    Md Idris, Nashreen
    Chiam, Yin Kia
    Varathan, Kasturi Dewi
    Wan Ahmad, Wan Azman
    Chee, Kok Han
    Liew, Yih Miin
    [J]. MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2020, 58 (12) : 3123 - 3140
  • [2] Feature selection and risk prediction for patients with coronary artery disease using data mining
    Nashreen Md Idris
    Yin Kia Chiam
    Kasturi Dewi Varathan
    Wan Azman Wan Ahmad
    Kok Han Chee
    Yih Miin Liew
    [J]. Medical & Biological Engineering & Computing, 2020, 58 : 3123 - 3140
  • [3] Optimal Feature Selection using Fuzzy Combination of Feature Subset for Transcriptome Data
    Singh, Vikas
    Vardhan, Harsh
    Verma, Nishchal K.
    Cui, Yan
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2018,
  • [4] Modelling of IDBN with LSNN based optimal feature selection for the prediction of CKD using real time data
    Pradeepa, P.
    Jeyakumar, M. K.
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (04) : 6309 - 6344
  • [5] Modelling of IDBN with LSNN based optimal feature selection for the prediction of CKD using real time data
    P. Pradeepa
    M. K. Jeyakumar
    [J]. Multimedia Tools and Applications, 2023, 82 : 6309 - 6344
  • [6] Optimal feature selection using a modified differential evolution algorithm and its effectiveness for prediction of heart disease
    Vivekanandan, T.
    Iyengar, N. Ch Sriman Narayana
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2017, 90 : 125 - 136
  • [7] Correction to: Feature selection and risk prediction for patients with coronary artery disease using data mining
    Nashreen Md Idris
    Yin Kia Chiam
    Kasturi Dewi Varathan
    Wan Azman Wan Ahmad
    Kok Han Chee
    Yih Miin Liew
    [J]. Medical & Biological Engineering & Computing, 2022, 60 : 887 - 887
  • [8] Enhanced prediction of misalignment conditions from spectral data using feature selection and filtering
    Cho, Hyun-Woo
    Jeong, Myong K.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2008, 35 (1-2) : 451 - 458
  • [9] An optimal criterion feature selection method for prediction and effective analysis of heart disease
    S. Prakash
    K. Sangeetha
    N. Ramkumar
    [J]. Cluster Computing, 2019, 22 : 11957 - 11963
  • [10] An optimal criterion feature selection method for prediction and effective analysis of heart disease
    Prakash, S.
    Sangeetha, K.
    Ramkumar, N.
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 5): : 11957 - 11963