Accounting for linkage disequilibrium in genome-wide association studies: A penalized regression method

被引:0
|
作者
Liu, Jin [1 ]
Wang, Kai [2 ]
Ma, Shuangge [1 ]
Huang, Jian [3 ]
机构
[1] Yale Univ, Sch Publ Hlth, New Haven, CT 06520 USA
[2] Univ Iowa, Dept Biostat, Iowa City, IA 52242 USA
[3] Univ Iowa, Dept Biostat, Dept Stat & Actuarial Sci, Iowa City, IA 52242 USA
基金
美国国家科学基金会;
关键词
Genetic association; Feature selection; Linkage disequilibrium; Penalized regression; Single nucleotide polymorphism; VARIABLE SELECTION; LASSO; REGULARIZATION; SPARSITY;
D O I
暂无
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Penalized regression methods are becoming increasingly popular in genome-wide association studies (GWAS) for identifying genetic markers associated with disease. However, standard penalized methods such as LASSO do not take into account the possible linkage disequilibrium between adjacent markers. We propose a novel penalized approach for GWAS using a dense set of single nucleotide polymorphisms (SNPs). The proposed method uses the minimax concave penalty (MCP) for marker selection and incorporates linkage disequilibrium (LD) information by penalizing the difference of the genetic effects at adjacent SNPs with high correlation. A coordinate descent algorithm is derived to implement the proposed method. This algorithm is efficient in dealing with a large number of SNPs. A multi-split method is used to calculate the p-values of the selected SNPs for assessing their significance. We refer to the proposed penalty function as the smoothed MCP and the proposed approach as the SMCP method. Performance of the proposed SMCP method and its comparison with LASSO and MCP approaches are evaluated through simulation studies, which demonstrate that the proposed method is more accurate in selecting associated SNPs. Its applicability to real data is illustrated using heterogeneous stock mice data and a rheumatoid arthritis.
引用
收藏
页码:99 / 115
页数:17
相关论文
共 50 条
  • [1] Penalized Regression and Risk Prediction in Genome-Wide Association Studies
    Austin, Erin
    Pan, Wei
    Shen, Xiaotong
    [J]. STATISTICAL ANALYSIS AND DATA MINING, 2013, 6 (04) : 315 - 328
  • [2] Genome-wide association studies using a penalized moving-window regression
    Bao, Minli
    Wang, Kai
    [J]. BIOINFORMATICS, 2017, 33 (24) : 3887 - 3894
  • [3] Regularized regression method for genome-wide association studies
    Jin Liu
    Kai Wang
    Shuangge Ma
    Jian Huang
    [J]. BMC Proceedings, 5 (Suppl 9)
  • [4] CHROMSCAN:: genome-wide association using a linkage disequilibrium map
    Collins, Andrew
    Lau, Winston
    [J]. JOURNAL OF HUMAN GENETICS, 2008, 53 (02) : 121 - 126
  • [5] CHROMSCAN: genome-wide association using a linkage disequilibrium map
    Andrew Collins
    Winston Lau
    [J]. Journal of Human Genetics, 2008, 53 : 121 - 126
  • [6] Genome-wide association analysis by lasso penalized logistic regression
    Wu, Tong Tong
    Chen, Yi Fang
    Hastie, Trevor
    Sobel, Eric
    Lange, Kenneth
    [J]. BIOINFORMATICS, 2009, 25 (06) : 714 - 721
  • [7] PENALIZED REGRESSION FOR GENOME-WIDE ASSOCIATION SCREENING OF SEQUENCE DATA
    Zhou, H.
    Alexander, D. H.
    Sehl, M. E.
    Sinsheimer, J. S.
    Sobel, E. M.
    Lange, K.
    [J]. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2011, 2011, : 106 - 117
  • [8] Magnitude and distribution of linkage disequilibrium in population isolates and implications for genome-wide association studies
    Susan Service
    Joseph DeYoung
    Maria Karayiorgou
    J Louw Roos
    Herman Pretorious
    Gabriel Bedoya
    Jorge Ospina
    Andres Ruiz-Linares
    António Macedo
    Joana Almeida Palha
    Peter Heutink
    Yurii Aulchenko
    Ben Oostra
    Cornelia van Duijn
    Marjo-Riitta Jarvelin
    Teppo Varilo
    Lynette Peddle
    Proton Rahman
    Giovanna Piras
    Maria Monne
    Sarah Murray
    Luana Galver
    Leena Peltonen
    Chiara Sabatti
    Andrew Collins
    Nelson Freimer
    [J]. Nature Genetics, 2006, 38 : 556 - 560
  • [9] Magnitude and distribution of linkage disequilibrium in population isolates and implications for genome-wide association studies
    Service, S
    DeYoung, J
    Karayiorgou, M
    Roos, JL
    Pretorious, H
    Bedoya, G
    Ospina, J
    Ruiz-Linares, A
    Macedo, A
    Palha, JA
    Heutink, P
    Aulchenko, Y
    Oostra, B
    van Duijn, C
    Jarvelin, MR
    Varilo, T
    Peddle, L
    Rahman, P
    Piras, G
    Monne, M
    Murray, S
    Galver, L
    Peltonen, L
    Sabatti, C
    Collins, A
    Freimer, N
    [J]. NATURE GENETICS, 2006, 38 (05) : 556 - 560
  • [10] Linkage-Disequilibrium-Based Binning Misleads the Interpretation of Genome-wide Association Studies
    Zhu, Xiaofeng
    Feng, Tao
    Elston, Robert C.
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 2012, 91 (05) : 965 - 968