Confounding of linkage disequilibrium patterns in large scale DNA based gene-gene interaction studies

被引:24
|
作者
Joiret, Marc [1 ,2 ]
John, Jestinah M. Mahachie [1 ]
Gusareva, Elena S. [1 ]
Van Steen, Kristel [1 ,3 ]
机构
[1] GIGA R Med Gen, BIO3, Ave Hop 1-B34 CHU, B-4000 Liege, Belgium
[2] GIGA R In Silico Med, Biomech Res Unit, Ave Hop 1-B34 CHU, B-4000 Liege, Belgium
[3] Ave Hop 1-B34 CHU, B-4000 Liege, Belgium
关键词
Genome-wide association interaction studies (GWAIS); Model-based multifactor-dimensionality reduction (MB-MDR); Gametic phase disequilibrium (GPD); Signal sensitivity; 1000 genomes project; Ankylosing spondylitis; MULTIFACTOR-DIMENSIONALITY REDUCTION; GENOME-WIDE ASSOCIATION; POPULATION-STRUCTURE; HAPLOTYPE BLOCKS; SIMULATION; EPISTASIS; MODEL; SAMPLE;
D O I
10.1186/s13040-019-0199-7
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
BackgroundIn Genome-Wide Association Studies (GWAS), the concept of linkage disequilibrium is important as it allows identifying genetic markers that tag the actual causal variants. In Genome-Wide Association Interaction Studies (GWAIS), similar principles hold for pairs of causal variants. However, Linkage Disequilibrium (LD) may also interfere with the detection of genuine epistasis signals in that there may be complete confounding between Gametic Phase Disequilibrium (GPD) and interaction. GPD may involve unlinked genetic markers, even residing on different chromosomes. Often GPD is eliminated in GWAIS, via feature selection schemes or so-called pruning algorithms, to obtain unconfounded epistasis results. However, little is known about the optimal degree of GPD/LD-pruning that gives a balance between false positive control and sufficient power of epistasis detection statistics. Here, we focus on Model-Based Multifactor Dimensionality Reduction as one large-scale epistasis detection tool. Its performance has been thoroughly investigated in terms of false positive control and power, under a variety of scenarios involving different trait types and study designs, as well as error-free and noisy data, but never with respect to multicollinear SNPs.ResultsUsing real-life human LD patterns from a homogeneous subpopulation of British ancestry, we investigated the impact of LD-pruning on the statistical sensitivity of MB-MDR. We considered three different non-fully penetrant epistasis models with varying effect sizes. There is a clear advantage in pre-analysis pruning using sliding windows at r(2) of 0.75 or lower, but using a threshold of 0.20 has a detrimental effect on the power to detect a functional interactive SNP pair (power <25%). Signal sensitivity, directly using LD-block information to determine whether an epistasis signal is present or not, benefits from LD-pruning as well (average power across scenarios: 87%), but is largely hampered by functional loci residing at the boundaries of an LD-block.ConclusionsOur results confirm that LD patterns and the position of causal variants in LD blocks do have an impact on epistasis detection, and that pruning strategies and LD-blocks definitions combined need careful attention, if we wish to maximize the power of large-scale epistasis screenings.
引用
收藏
页数:23
相关论文
共 50 条
  • [21] Entropy Based Genetic Association And Gene-Gene Interaction Tests
    Wang, Xin
    de Andrade, Mariza
    GENETIC EPIDEMIOLOGY, 2012, 36 (02) : 141 - 141
  • [22] Sample size calculations for association studies of gene-gene interaction.
    Gauderman, WJ
    AMERICAN JOURNAL OF HUMAN GENETICS, 2001, 69 (04) : 401 - 401
  • [23] Robust Gene-Gene Interaction Analysis in Genome Wide Association Studies
    Kim, Yongkang
    Park, Taesung
    PLOS ONE, 2015, 10 (08):
  • [24] Mutational spectrum and linkage disequilibrium patterns at the ornithine transcarbamylase gene (OTC)
    Azevedo, L.
    Soares, P. A.
    Quental, R.
    Vilarinho, L.
    Teles, E. L.
    Martins, E.
    Diogo, L.
    Garcia, P.
    Cenni, B.
    Wermuth, B.
    Amorim, A.
    ANNALS OF HUMAN GENETICS, 2006, 70 : 797 - 801
  • [25] Common SNPs, haplotypes, and patterns of linkage disequilibrium in the FANCA gene.
    Taylor, JG
    Yamaguchi, H
    Young, NS
    Liu, JM
    Chancock, SJ
    BLOOD, 2003, 102 (11) : 506A - 506A
  • [26] Navigating gene-gene and drug-gene interaction landscapes underpinning the DNA damage response
    Durocher, Daniel
    MOLECULAR CANCER THERAPEUTICS, 2019, 18 (12)
  • [27] Allelic based Gene-Gene Interaction in Case-Control Study
    Jung, J.
    GENETIC EPIDEMIOLOGY, 2008, 32 (07) : 697 - 697
  • [28] Jackknife-based gene-gene interaction tests for untyped SNPs
    Song, Minsun
    BMC GENETICS, 2015, 16
  • [29] Environmental Confounding in Gene-Environment Interaction Studies
    VanderWeele, Tyler J.
    Ko, Yi-An
    Mukherjee, Bhramar
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 2013, 178 (01) : 144 - 152
  • [30] Efficient estimation for large-scale linkage disequilibrium patterns of the human genome
    Huang, Xin
    Zhu, Tian-Neng
    Liu, Ying-Chao
    Qi, Guo-An
    Zhang, Jian-Nan
    Chen, Guo-Bo
    ELIFE, 2023, 12