Iterative sure independence screening EM-Bayesian LASSO algorithm for multi-locus genome-wide association studies

被引:177
|
作者
Tamba, Cox Lwaka [1 ,2 ]
Ni, Yuan-Li [1 ]
Zhang, Yuan-Ming [1 ,3 ]
机构
[1] Nanjing Agr Univ, State Key Lab Crop Genet & Germplasm Enhancement, Nanjing, Jiangsu, Peoples R China
[2] Egerton Univ, Dept Math, Egerton, Kenya
[3] Huazhong Agr Univ, Stat Genom Lab, Coll Plant Sci & Technol, Wuhan, Peoples R China
基金
中国国家自然科学基金;
关键词
PENALIZED LOGISTIC-REGRESSION; VARIABLE SELECTION; ORACLE PROPERTIES;
D O I
10.1371/journal.pcbi.1005357
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Genome-wide association study (GWAS) entails examining a large number of single nucleotide polymorphisms (SNPs) in a limited sample with hundreds of individuals, implying a variable selection problem in the high dimensional dataset. Although many single-locus GWAS approaches under polygenic background and population structure controls have been widely used, some significant loci fail to be detected. In this study, we used an iterative modified-sure independence screening (ISIS) approach in reducing the number of SNPs to a moderate size. Expectation-Maximization (EM)-Bayesian least absolute shrinkage and selection operator (BLASSO) was used to estimate all the selected SNP effects for true quantitative trait nucleotide (QTN) detection. This method is referred to as ISIS EM-BLASSO algorithm. Monte Carlo simulation studies validated the new method, which has the highest empirical power in QTN detection and the highest accuracy in QTN effect estimation, and it is the fastest, as compared with efficient mixed-model association (EMMA), smoothly clipped absolute deviation (SCAD), fixed and random model circulating probability unification (FarmCPU), and multi-locus random-SNP-effect mixed linear model (mrMLM). To further demonstrate the new method, six flowering time traits in Arabidopsis thaliana were re-analyzed by four methods (New method, EMMA, FarmCPU, and mrMLM). As a result, the new method identified most previously reported genes. Therefore, the new method is a good alternative for multi-locus GWAS.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] A fast algorithm for Bayesian multi-locus model in genome-wide association studies
    Duan, Weiwei
    Zhao, Yang
    Wei, Yongyue
    Yang, Sheng
    Bai, Jianling
    Shen, Sipeng
    Du, Mulong
    Huang, Lihong
    Hu, Zhibin
    Chen, Feng
    [J]. MOLECULAR GENETICS AND GENOMICS, 2017, 292 (04) : 923 - 934
  • [2] A fast algorithm for Bayesian multi-locus model in genome-wide association studies
    Weiwei Duan
    Yang Zhao
    Yongyue Wei
    Sheng Yang
    Jianling Bai
    Sipeng Shen
    Mulong Du
    Lihong Huang
    Zhibin Hu
    Feng Chen
    [J]. Molecular Genetics and Genomics, 2017, 292 : 923 - 934
  • [3] A Two-Stage Mutual Information Based Bayesian Lasso Algorithm for Multi-Locus Genome-Wide Association Studies
    Guo, Hongping
    Yu, Zuguo
    An, Jiyuan
    Han, Guosheng
    Ma, Yuanlin
    Tang, Runbin
    [J]. ENTROPY, 2020, 22 (03)
  • [4] The Bayesian lasso for genome-wide association studies
    Li, Jiahan
    Das, Kiranmoy
    Fu, Guifang
    Li, Runze
    Wu, Rongling
    [J]. BIOINFORMATICS, 2011, 27 (04) : 516 - 523
  • [5] Multi-locus Test and Correction for Confounding Effects in Genome-Wide Association Studies
    Chen, Donglai
    Liu, Chuanhai
    Xie, Jun
    [J]. INTERNATIONAL JOURNAL OF BIOSTATISTICS, 2016, 12 (02):
  • [6] A Fast Multi-Locus Ridge Regression Algorithm for High-Dimensional Genome-Wide Association Studies
    Zhang, Jin
    Chen, Min
    Wen, Yangjun
    Zhang, Yin
    Lu, Yunan
    Wang, Shengmeng
    Chen, Juncong
    [J]. FRONTIERS IN GENETICS, 2021, 12
  • [7] Multi-Locus Genome-Wide Association Studies Reveal Fruit Quality Hotspots in Peach Genome
    Linge, Cassia da Silva
    Cai, Lichun
    Fu, Wanfang
    Clark, John
    Worthington, Margaret
    Rawandoozi, Zena
    Byrne, David H.
    Gasic, Ksenija
    [J]. FRONTIERS IN PLANT SCIENCE, 2021, 12
  • [8] Fast and efficient correction for population stratification in multi-locus genome-wide association studies
    Liu, Rui
    Yuan, Min
    Xu, Xu Steven
    Yang, Yaning
    [J]. GENETICA, 2021, 149 (5-6) : 313 - 325
  • [9] Fast and efficient correction for population stratification in multi-locus genome-wide association studies
    Rui Liu
    Min Yuan
    Xu Steven Xu
    Yaning Yang
    [J]. Genetica, 2021, 149 : 313 - 325
  • [10] Multi-Locus Genome-Wide Association Studies for 14 Main Agronomic Traits in Barley
    Hu, Xin
    Zuo, Jianfang
    Wang, Jibin
    Liu, Lipan
    Sun, Genlou
    Li, Chengdao
    Ren, Xifeng
    Sun, Dongfa
    [J]. FRONTIERS IN PLANT SCIENCE, 2018, 9