Machine Learning to Advance Human Genome-Wide Association Studies

被引:2
|
作者
Sigala, Rafaella E. [1 ]
Lagou, Vasiliki [1 ]
Shmeliov, Aleksey [1 ]
Atito, Sara [2 ,3 ]
Kouchaki, Samaneh [2 ,3 ]
Awais, Muhammad [2 ,3 ]
Prokopenko, Inga [1 ,2 ]
Mahdi, Adam [4 ]
Demirkan, Ayse [1 ,2 ]
机构
[1] Dept Clin & Expt Med, Sect Stat Multiom, Guildford GU2 7XH, Surrey, England
[2] Univ Surrey, Surrey Inst People Centred Artificial Intelligence, Guildford GU2 7XH, Surrey, England
[3] Univ Surrey, Ctr Vis Speech Signal Proc, Guildford GU2 7XH, Surrey, England
[4] Univ Oxford, Oxford Internet Inst, Oxford OX1 3JS, Oxon, England
关键词
genome-wide association; human genetics; machine learning; RISK PREDICTION; GENE; DISEASE; GWAS; PRIORITIZATION; SCHIZOPHRENIA; DISCOVERY; VARIANTS; OBESITY; FTO;
D O I
10.3390/genes15010034
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Machine learning, including deep learning, reinforcement learning, and generative artificial intelligence are revolutionising every area of our lives when data are made available. With the help of these methods, we can decipher information from larger datasets while addressing the complex nature of biological systems in a more efficient way. Although machine learning methods have been introduced to human genetic epidemiological research as early as 2004, those were never used to their full capacity. In this review, we outline some of the main applications of machine learning to assigning human genetic loci to health outcomes. We summarise widely used methods and discuss their advantages and challenges. We also identify several tools, such as Combi, GenNet, and GMSTool, specifically designed to integrate these methods for hypothesis-free analysis of genetic variation data. We elaborate on the additional value and limitations of these tools from a geneticist's perspective. Finally, we discuss the fast-moving field of foundation models and large multi-modal omics biobank initiatives.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Genome-Wide Association Studies and Diet
    Ferguson, Lynnette R.
    JOURNAL OF NUTRIGENETICS AND NUTRIGENOMICS, 2010, 3 (4-6) : 144 - 150
  • [42] The road to genome-wide association studies
    Kruglyak, Leonid
    NATURE REVIEWS GENETICS, 2008, 9 (04) : 314 - 318
  • [43] Genome-wide association studies in ADHD
    Barbara Franke
    Benjamin M. Neale
    Stephen V. Faraone
    Human Genetics, 2009, 126 : 13 - 50
  • [44] Genome-Wide Association Studies of Cancer
    Stadler, Zsofia K.
    Thom, Peter
    Robson, Mark E.
    Weitzel, Jeffrey N.
    Kauff, Noah D.
    Hurley, Karen E.
    Devlin, Vincent
    Gold, Bert
    Klein, Robert J.
    Offit, Kenneth
    JOURNAL OF CLINICAL ONCOLOGY, 2010, 28 (27) : 4255 - 4267
  • [45] Genome-wide association studies in pharmacogenomics
    Daly, Ann K.
    NATURE REVIEWS GENETICS, 2010, 11 (04) : 241 - 246
  • [46] Genome-Wide Association Studies in Atherosclerosis
    S. Sivapalaratnam
    M. M. Motazacker
    S. Maiwald
    G. K. Hovingh
    J. J. P. Kastelein
    M. Levi
    M. D. Trip
    G. M. Dallinga-Thie
    Current Atherosclerosis Reports, 2011, 13 : 225 - 232
  • [47] The road to genome-wide association studies
    Leonid Kruglyak
    Nature Reviews Genetics, 2008, 9 : 314 - 318
  • [48] Genome-wide association studies in atherothrombosis
    Lotta, Luca Andrea
    EUROPEAN JOURNAL OF INTERNAL MEDICINE, 2010, 21 (02) : 74 - 78
  • [49] Problems with genome-wide association studies
    Williams, Scott M.
    Canter, Jeffrey A.
    Crawford, Dana C.
    Moore, Jason H.
    Ritchie, Marylyn D.
    Haines, Jonathan L.
    SCIENCE, 2007, 316 (5833) : 1841 - 1842
  • [50] Genome-Wide Association Studies in Glioma
    Kinnersley, Ben
    Houlston, Richard S.
    Bondy, Melissa L.
    CANCER EPIDEMIOLOGY BIOMARKERS & PREVENTION, 2018, 27 (04) : 418 - 428