Prioritization of risk genes for Alzheimer's disease: an analysis framework using spatial and temporal gene expression data in the human brain based on support vector machine

被引:4
|
作者
Wang, Shiyu [1 ]
Fang, Xixian [1 ]
Wen, Xiang [2 ]
Yang, Congying [1 ]
Yang, Ying [1 ]
Zhang, Tianxiao [1 ,3 ]
机构
[1] Xi An Jiao Tong Univ, Sch Publ Hlth, Dept Epidemiol & Biostat, Hlth Sci Ctr, Xian, Peoples R China
[2] Univ Chinese Acad Sci, Hangzhou Inst Adv Study, Beijing, Peoples R China
[3] Shaanxi Reg Ctr, Natl Antidrug Lab, Xian, Peoples R China
基金
中国国家自然科学基金;
关键词
Alzheimer's disease; risk gene prioritization; gene expression patterns; machine learning; genome-wide association analyses; REGULATORS;
D O I
10.3389/fgene.2023.1190863
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Background: Alzheimer's disease (AD) is a complex disorder, and its risk is influenced by multiple genetic and environmental factors. In this study, an AD risk gene prediction framework based on spatial and temporal features of gene expression data (STGE) was proposed.Methods: We proposed an AD risk gene prediction framework based on spatial and temporal features of gene expression data. The gene expression data of providers of different tissues and ages were used as model features. Human genes were classified as AD risk or non-risk sets based on information extracted from relevant databases. Support vector machine (SVM) models were constructed to capture the expression patterns of genes believed to contribute to the risk of AD.Results: The recursive feature elimination (RFE) method was utilized for feature selection. Data for 64 tissue-age features were obtained before feature selection, and this number was reduced to 19 after RFE was performed. The SVM models were built and evaluated using 19 selected and full features. The area under curve (AUC) values for the SVM model based on 19 selected features (0.740 [0.690-0.790]) and full feature sets (0.730 [0.678-0.769]) were very similar. Fifteen genes predicted to be risk genes for AD with a probability greater than 90% were obtained.Conclusion: The newly proposed framework performed comparably to previous prediction methods based on protein-protein interaction (PPI) network properties. A list of 15 candidate genes for AD risk was also generated to provide data support for further studies on the genetic etiology of AD.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Support vector machine model of developmental brain gene expression data for prioritization of Autism risk gene candidates
    Cogill, S.
    Wang, L.
    BIOINFORMATICS, 2016, 32 (23) : 3611 - 3618
  • [2] Machine Learning Framework for the Prediction of Alzheimer's Disease Using Gene Expression Data Based on Efficient Gene Selection
    El-Gawady, Aliaa
    Makhlouf, Mohamed A.
    Tawfik, BenBella S.
    Nassar, Hamed
    SYMMETRY-BASEL, 2022, 14 (03):
  • [3] Detection of Novel Biomarker Genes of Alzheimer's Disease Using Gene Expression Data
    Perera, Shehan
    Hewage, Kaveesha
    Gunarathne, Chamara
    Navarathna, Rajitha
    Herath, Damayanthi
    Ragel, Roshan G.
    MERCON 2020: 6TH INTERNATIONAL MULTIDISCIPLINARY MORATUWA ENGINEERING RESEARCH CONFERENCE (MERCON), 2020, : 656 - 661
  • [4] Volumetric Histogram-Based Alzheimer's Disease Detection Using Support Vector Machine
    Elshatoury, Heba
    Avots, Egils
    Anbarjafari, Gholamreza
    JOURNAL OF ALZHEIMERS DISEASE, 2019, 72 (02) : 515 - 524
  • [5] Gene Expression Data Classification using Support Vector Machine and Mutual Information-based Gene Selection
    Vanitha, Devi Arockia C.
    Devaraj, D.
    Venkatesulu, M.
    GRAPH ALGORITHMS, HIGH PERFORMANCE IMPLEMENTATIONS AND ITS APPLICATIONS (ICGHIA 2014), 2015, 47 : 13 - 21
  • [6] Analysis of SPECT brain images for the diagnosis of Alzheimer's disease using moments and support vector machines
    Salas-Gonzalez, Diego
    Gorriz, Juan M.
    Ramirez, Javier
    Lopez, Miriam
    Illan, Ignacio A.
    Segovia, Fermin
    Puntonet, Carlos G.
    Gomez-Rio, Manuel
    NEUROSCIENCE LETTERS, 2009, 461 (01) : 60 - 64
  • [7] Research on Voxel-Based Features Detection and Analysis of Alzheimer's Disease Using Random Survey Support Vector Machine
    Meng, Xianglian
    Wu, Yue
    Liu, Wenjie
    Wang, Ying
    Xu, Zhe
    Jiao, Zhuqing
    FRONTIERS IN NEUROINFORMATICS, 2022, 16
  • [8] Support vector machine-based classification of Alzheimer’s disease from whole-brain anatomical MRI
    Benoît Magnin
    Lilia Mesrob
    Serge Kinkingnéhun
    Mélanie Pélégrini-Issac
    Olivier Colliot
    Marie Sarazin
    Bruno Dubois
    Stéphane Lehéricy
    Habib Benali
    Neuroradiology, 2009, 51 : 73 - 83
  • [9] Support vector machine-based classification of Alzheimer's disease from whole-brain anatomical MRI
    Magnin, Benoit
    Mesrob, Lilia
    Kinkingnehun, Serge
    Pelegrini-Issac, Melanie
    Colliot, Olivier
    Sarazin, Marie
    Dubois, Bruno
    Lehericy, Stephane
    Benali, Habib
    NEURORADIOLOGY, 2009, 51 (02) : 73 - 83
  • [10] Using Rule-Based Machine Learning for Candidate Disease Gene Prioritization and Sample Classification of Cancer Gene Expression Data
    Glaab, Enrico
    Bacardit, Jaume
    Garibaldi, Jonathan M.
    Krasnogor, Natalio
    PLOS ONE, 2012, 7 (07):