A resource-efficient tool for mixed model association analysis of large-scale data

被引:0
|
作者
Longda Jiang
Zhili Zheng
Ting Qi
Kathryn E. Kemper
Naomi R. Wray
Peter M. Visscher
Jian Yang
机构
[1] The University of Queensland,Institute for Molecular Bioscience
[2] Wenzhou Medical University,Institute for Advanced Research
[3] The University of Queensland,Queensland Brain Institute
来源
Nature Genetics | 2019年 / 51卷
关键词
D O I
暂无
中图分类号
学科分类号
摘要
The genome-wide association study (GWAS) has been widely used as an experimental design to detect associations between genetic variants and a phenotype. Two major confounding factors, population stratification and relatedness, could potentially lead to inflated GWAS test statistics and hence to spurious associations. Mixed linear model (MLM)-based approaches can be used to account for sample structure. However, genome-wide association (GWA) analyses in biobank samples such as the UK Biobank (UKB) often exceed the capability of most existing MLM-based tools especially if the number of traits is large. Here, we develop an MLM-based tool (fastGWA) that controls for population stratification by principal components and for relatedness by a sparse genetic relationship matrix for GWA analyses of biobank-scale data. We demonstrate by extensive simulations that fastGWA is reliable, robust and highly resource-efficient. We then apply fastGWA to 2,173 traits on array-genotyped and imputed samples from 456,422 individuals and to 2,048 traits on whole-exome-sequenced samples from 46,191 individuals in the UKB.
引用
收藏
页码:1749 / 1755
页数:6
相关论文
共 50 条
  • [21] DNA pooling: A tool for large-scale association studies
    Sham, P
    Bader, JS
    Craig, I
    O'Donovan, M
    Owen, M
    NATURE REVIEWS GENETICS, 2002, 3 (11) : 862 - 871
  • [22] DNA Pooling: a tool for large-scale association studies
    Pak Sham
    Joel S. Bader
    Ian Craig
    Michael O'Donovan
    Michael Owen
    Nature Reviews Genetics, 2002, 3 : 862 - 871
  • [23] A generalized linear mixed model association tool for biobank-scale data
    Jiang, Longda
    Zheng, Zhili
    Fang, Hailing
    Yang, Jian
    NATURE GENETICS, 2021, 53 (11) : 1616 - +
  • [24] A generalized linear mixed model association tool for biobank-scale data
    Longda Jiang
    Zhili Zheng
    Hailing Fang
    Jian Yang
    Nature Genetics, 2021, 53 : 1616 - 1621
  • [25] Quantitative Sustainability Analysis: A Powerful Tool to Develop Resource-Efficient Catalytic Technologies
    Subramaniam, Bala
    Helling, Richard K.
    Bode, Claudia J.
    ACS SUSTAINABLE CHEMISTRY & ENGINEERING, 2016, 4 (11): : 5859 - 5865
  • [26] FarmCPUpp: Efficient large-scale genomewide association studies
    Kusmec, Aaron
    Schnable, Patrick S.
    PLANT DIRECT, 2018, 2 (04)
  • [27] Resource-Efficient Transformer Pruning for Finetuning of Large Models
    Ilhan, Fatih
    Su, Gong
    Tekin, Selim Furkan
    Huang, Tiansheng
    Hu, Sihao
    Liu, Ling
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 16206 - 16215
  • [28] Resource Allocation for Energy Efficient Large-Scale Distributed Systems
    Lee, Young Choon
    Zomaya, Albert Y.
    INFORMATION SYSTEMS, TECHNOLOGY AND MANAGEMENT, PROCEEDINGS, 2010, 54 : 16 - 19
  • [29] Resource allocation for energy efficient large-scale distributed systems
    Lee Y.C.
    Zomaya A.Y.
    Communications in Computer and Information Science, 2010, 54 : 16 - 19
  • [30] Cistrome Explorer: an interactive visual analysis tool for large-scale epigenomic data
    L'Yi, Sehi
    Keller, Mark S.
    Dandawate, Ariaki
    Taing, Len
    Chen, Chen-Hao
    Brown, Myles
    Meyer, Clifford A.
    Gehlenborg, Nils
    BIOINFORMATICS, 2023, 39 (02)