Confounding of linkage disequilibrium patterns in large scale DNA based gene-gene interaction studies

被引:24
|
作者
Joiret, Marc [1 ,2 ]
John, Jestinah M. Mahachie [1 ]
Gusareva, Elena S. [1 ]
Van Steen, Kristel [1 ,3 ]
机构
[1] GIGA R Med Gen, BIO3, Ave Hop 1-B34 CHU, B-4000 Liege, Belgium
[2] GIGA R In Silico Med, Biomech Res Unit, Ave Hop 1-B34 CHU, B-4000 Liege, Belgium
[3] Ave Hop 1-B34 CHU, B-4000 Liege, Belgium
关键词
Genome-wide association interaction studies (GWAIS); Model-based multifactor-dimensionality reduction (MB-MDR); Gametic phase disequilibrium (GPD); Signal sensitivity; 1000 genomes project; Ankylosing spondylitis; MULTIFACTOR-DIMENSIONALITY REDUCTION; GENOME-WIDE ASSOCIATION; POPULATION-STRUCTURE; HAPLOTYPE BLOCKS; SIMULATION; EPISTASIS; MODEL; SAMPLE;
D O I
10.1186/s13040-019-0199-7
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
BackgroundIn Genome-Wide Association Studies (GWAS), the concept of linkage disequilibrium is important as it allows identifying genetic markers that tag the actual causal variants. In Genome-Wide Association Interaction Studies (GWAIS), similar principles hold for pairs of causal variants. However, Linkage Disequilibrium (LD) may also interfere with the detection of genuine epistasis signals in that there may be complete confounding between Gametic Phase Disequilibrium (GPD) and interaction. GPD may involve unlinked genetic markers, even residing on different chromosomes. Often GPD is eliminated in GWAIS, via feature selection schemes or so-called pruning algorithms, to obtain unconfounded epistasis results. However, little is known about the optimal degree of GPD/LD-pruning that gives a balance between false positive control and sufficient power of epistasis detection statistics. Here, we focus on Model-Based Multifactor Dimensionality Reduction as one large-scale epistasis detection tool. Its performance has been thoroughly investigated in terms of false positive control and power, under a variety of scenarios involving different trait types and study designs, as well as error-free and noisy data, but never with respect to multicollinear SNPs.ResultsUsing real-life human LD patterns from a homogeneous subpopulation of British ancestry, we investigated the impact of LD-pruning on the statistical sensitivity of MB-MDR. We considered three different non-fully penetrant epistasis models with varying effect sizes. There is a clear advantage in pre-analysis pruning using sliding windows at r(2) of 0.75 or lower, but using a threshold of 0.20 has a detrimental effect on the power to detect a functional interactive SNP pair (power <25%). Signal sensitivity, directly using LD-block information to determine whether an epistasis signal is present or not, benefits from LD-pruning as well (average power across scenarios: 87%), but is largely hampered by functional loci residing at the boundaries of an LD-block.ConclusionsOur results confirm that LD patterns and the position of causal variants in LD blocks do have an impact on epistasis detection, and that pruning strategies and LD-blocks definitions combined need careful attention, if we wish to maximize the power of large-scale epistasis screenings.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Confounding of linkage disequilibrium patterns in large scale DNA based gene-gene interaction studies
    Marc Joiret
    Jestinah M. Mahachie John
    Elena S. Gusareva
    Kristel Van Steen
    BioData Mining, 12
  • [2] Correction: Confounding of linkage disequilibrium patterns in large scale DNA based gene-gene interaction studies
    Marc Joiret
    Jestinah M. Mahachie John
    Elena S. Gusareva
    Kristel Van Steen
    BioData Mining, 15
  • [3] Confounding of linkage disequilibrium patterns in large scale DNA based gene-gene interaction studies (vol 12, 11 2019)
    Joiret, Marc
    Mahachie John, Jestinah M.
    Gusareva, Elena S.
    Van Steen, Kristel
    BIODATA MINING, 2022, 15 (01)
  • [4] DNA polymorphisms and linkage disequilibrium in the angiotensinogen gene
    Morgan, L
    Pipkin, FB
    Kalsheker, N
    HUMAN GENETICS, 1996, 98 (02) : 194 - 198
  • [5] Allelic Based Gene-Gene Interaction in Case-Control Studies
    Jung, Jeesun
    Zhao, Yiqiang
    HUMAN HEREDITY, 2010, 69 (01) : 14 - 27
  • [6] Potential for gene-gene confounding bias in case-parental control studies
    Lee, WC
    Ho, YY
    ANNALS OF EPIDEMIOLOGY, 2003, 13 (04) : 261 - 266
  • [7] THE BF SYSTEM IN DIABETES - GENE INTERACTION OR LINKAGE DISEQUILIBRIUM
    WOLF, E
    CUDWORTH, AG
    MARKWICK, JR
    GORSUCH, AN
    SPENCER, KM
    BODANSKY, HJ
    DIABETOLOGIA, 1982, 22 (02) : 85 - 88
  • [8] Variation of gene-based SNPs and linkage disequilibrium patterns in the human genome
    Tsunoda, T
    Lathrop, GM
    Sekine, A
    Yamada, R
    Takahashi, A
    Ohnishi, Y
    Tanaka, T
    Nakamura, Y
    HUMAN MOLECULAR GENETICS, 2004, 13 (15) : 1623 - 1632
  • [9] A New Correction for Multiple Testing in Gene-Gene Interaction Studies
    Babron, Marie-Claude
    Etcheto, Adrien
    Dizier, Marie-Helene
    ANNALS OF HUMAN GENETICS, 2015, 79 (05) : 380 - 384
  • [10] Gene-based interaction analysis by incorporating external linkage disequilibrium information
    He, Jing
    Wang, Kai
    Edmondson, Andrew C.
    Rader, Daniel J.
    Li, Chun
    Li, Mingyao
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2011, 19 (02) : 164 - 172