Machine learning approaches to genome-wide association studies

被引:12
|
作者
Enoma, David O. [1 ,2 ]
Bishung, Janet [1 ]
Abiodun, Theresa [1 ]
Ogunlana, Olubanke [2 ,3 ]
Osamor, Victor Chukwudi [1 ,2 ]
机构
[1] Covenant Univ, Dept Comp & Informat Sci, Ota, Nigeria
[2] Covenant Univ, Covenant Appl Informat & Commun African Ctr Excel, Ota, Nigeria
[3] Covenant Univ, Dept Biochem, Ota, Nigeria
关键词
Genome-wide association studies; Machine learning; Statistical learning; Single neuclotide polymorphism; Risk prediction; Epistasis;
D O I
10.1016/j.jksus.2022.101847
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Genome-wide Association Studies (GWAS) are conducted to identify single nucleotide polymorphisms (variants) associated with a phenotype within a specific population. These variants associated with diseases have a complex molecular aetiology with which they cause the disease phenotype. The genotyping data generated from subjects of study is of high dimensionality, which is a challenge. The problem is that the dataset has a large number of features and a relatively smaller sample size. However, statistical testing is the standard approach being applied to identify these variants that influence the phenotype of interest. The wide applications and abilities of Machine Learning (ML) algorithms promise to understand the effects of these variants better. The aim of this work is to discuss the applications and future trends of ML algorithms in GWAS towards understanding the effects of population genetic variant. It was discovered that algorithms such as classification, regression, ensemble, and neural networks have been applied to GWAS for which this work has further discussed comprehensively including their application areas. The ML algorithms have been applied to the identification of significant single nucleotide polymorphisms (SNP), disease risk assessment & prediction, detection of epistatic non-linear interaction, and integrated with other omics sets. This comprehensive review has highlighted these areas of application and sheds light on the promise of innovating machine learning algorithms into the computational and statistical pipeline of genome-wide association studies. This will be beneficial for better understanding of how variants are affected by disease biology and how the same variants can influence risk by developing a particular phenotype for favourable natural selection. (C) 2022 The Author(s). Published by Elsevier B.V. on behalf of King Saud University.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Machine Learning in Genome-Wide Association Studies
    Szymczak, Silke
    Biernacka, Joanna M.
    Cordell, Heather J.
    Gonzalez-Recio, Oscar
    Koenig, Inke R.
    Zhang, Heping
    Sun, Yan V.
    GENETIC EPIDEMIOLOGY, 2009, 33 : S51 - S57
  • [2] Editorial: Machine Learning in Genome-Wide Association Studies
    Hu, Ting
    Darabos, Christian
    Urbanowicz, Ryan
    FRONTIERS IN GENETICS, 2020, 11
  • [3] Leveraging machine learning to advance genome-wide association studies
    Dagasso, Gabrielle
    Yan, Yan
    Wang, Lipu
    Li, Longhai
    Kutcher, Randy
    Zhang, Wentao
    Jin, Lingling
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2021, 25 (1-2) : 17 - 36
  • [4] Machine Learning to Advance Human Genome-Wide Association Studies
    Sigala, Rafaella E.
    Lagou, Vasiliki
    Shmeliov, Aleksey
    Atito, Sara
    Kouchaki, Samaneh
    Awais, Muhammad
    Prokopenko, Inga
    Mahdi, Adam
    Demirkan, Ayse
    GENES, 2024, 15 (01)
  • [5] Network approaches to Genome-Wide Association studies
    Remondini, D.
    Verondini, E.
    Lescai, F.
    Zironi, I.
    Castellani, G.
    NUOVO CIMENTO DELLA SOCIETA ITALIANA DI FISICA C-COLLOQUIA ON PHYSICS, 2009, 32 (02): : 19 - 22
  • [6] Statistical approaches for genome-wide association studies
    Balding, D.
    EJC SUPPLEMENTS, 2008, 6 (09): : 187 - 187
  • [7] Current approaches of genome-wide association studies
    Jianfeng Xu Wake Forest University School of Medicine Winston-Salem
    Journal of Nanjing Medical University, 2008, (02) : 95 - 95
  • [8] Genetic interactions in lung cancer using machine-learning approaches in genome-wide association studies
    Byun, Jinyoung
    Han, Younghun
    Amos, Christopher I.
    CANCER RESEARCH, 2019, 79 (13)
  • [9] Genetic Architecture of Lung Cancer Using Machine-Learning Approaches in Genome-Wide Association Studies
    Byun, J.
    Han, Y.
    Edelson, J.
    Ostrom, Q.
    Amos, C.
    JOURNAL OF THORACIC ONCOLOGY, 2019, 14 (10) : S516 - S517
  • [10] Forecasting Staphylococcus aureus Infections Using Genome-Wide Association Studies, Machine Learning, and Transcriptomic Approaches
    Sassi, Mohamed
    Bronsard, Julie
    Pascreau, Gaetan
    Emily, Mathieu
    Donnio, Pierre-Yves
    Revest, Matthieu
    Felden, Brice
    Wirth, Thierry
    Augagneur, Yoann
    MSYSTEMS, 2022, 7 (04)