Quantifying the amount of missing information in genetic association studies

被引:18
|
作者
Nicolae, Dan L.
机构
[1] Univ Chicago, Dept Med, Chicago, IL 60637 USA
[2] Univ Chicago, Dept Stat, Chicago, IL 60637 USA
关键词
information content; multi-locus linkage disequilibrium; asymptotic relative efficiency; association testing; case-control design;
D O I
10.1002/gepi.20181
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Many genetic analyses are done with incomplete information; for example, unknown phase in haplotype-based association studies. Measures of the amount of available information can be used for efficient planning of studies and/or analyses. In particular, the linkage disequilibrium (LD) between two sets of markers can be interpreted as the amount of information one set of markers contains for testing allele frequency differences in the second set, and measuring LD can be viewed as quantifying information in a missing data problem. We introduce a framework for measuring the association between two sets of variables; for example, genotype data for two distinct groups of markers, or haplotype and genotype data for a given set of polymorphisms. The goal is to quantify how much information is in one data set, e.g. genotype data for a set of SNPs, for estimating parameters that are functions of frequencies in the second data set, e.g. haplotype frequencies, relative to the ideal case of actually observing the complete data, e.g. haplotypes. In the case of genotype data on two mutually exclusive sets of markers, the measure determines the amount of multi-locus LD, and is equal to the classical measure r(2), if the sets consist each of one bi-allelic marker. In general, the measures are interpreted as the asymptotic ratio of sample sizes necessary to achieve the same power in case-control testing. The focus of this paper is on case-control allele/haplotype tests, but the framework can be extended easily to other settings like regressing quantitative traits on allele/haplotype counts, or tests on genotypes or diplotypes. We highlight applications of the approach, including tools for navigating the HapMap database [The International HapMap Consortium, 2003], and genotyping strategies for positional cloning studies. Genet. Epidemiol. 30:703-717, 2006. (c) 2006 Wiley-Liss, Inc.
引用
收藏
页码:703 / 717
页数:15
相关论文
共 50 条
  • [1] Quantifying the Fraction of Missing Information for Hypothesis Testing in Statistical and Genetic Studies
    Nicolae, Dan L.
    Meng, Xiao-Li
    Kong, Augustine
    [J]. STATISTICAL SCIENCE, 2008, 23 (03) : 287 - 312
  • [2] Comment: Quantifying the Fraction of Missing Information for Hypothesis Testing in Statistical and Genetic Studies
    Zheng, Tian
    Lo, Shaw-Hwa
    [J]. STATISTICAL SCIENCE, 2008, 23 (03) : 321 - 324
  • [3] Quantifying the Amount of Missing Tissue in a Digital Breast Tomosynthesis
    Goodenough, D.
    Olafsdottir, H.
    Olafsson, I.
    Fredriksson, J.
    Mallozzi, R.
    Kristinsson, S.
    Healy, A.
    Oskarsdottir, G.
    Kristbjornsson, A.
    Levy, J.
    [J]. MEDICAL PHYSICS, 2015, 42 (06) : 3233 - 3233
  • [4] Estimating Genetic Effects and Quantifying Missing Heritability for Rare Variant Complex Trait Association Studies via Sequence Data
    Leal, Suzanne M.
    Liu, Dajiang J.
    [J]. GENETIC EPIDEMIOLOGY, 2012, 36 (02) : 135 - 135
  • [5] Quantifying and Correcting for the Winner's Curse in Genetic Association Studies
    Xiao, Rui
    Boehnke, Michael
    [J]. GENETIC EPIDEMIOLOGY, 2009, 33 (05) : 453 - 462
  • [6] Quantifying risk of bias in systematic reviews of genetic association studies
    Sohani, Zahra N.
    Sarma, Shohinee
    de Souza, Russell J.
    Meyre, David
    Anand, Sonia S.
    [J]. GENETIC EPIDEMIOLOGY, 2016, 40 (07) : 663 - 663
  • [7] Quantifying missing heritability and estimating genetic effects for complex traits due to rare variants: application to sequence based association studies
    Liu, Dajiang J.
    Leal, Suzanne M.
    [J]. ANNALS OF HUMAN GENETICS, 2012, 76 : 420 - 421
  • [8] Genetic Association Studies: An Information Content Perspective
    Wu, Cen
    Li, Shaoyu
    Cui, Yuehua
    [J]. CURRENT GENOMICS, 2012, 13 (07) : 566 - 573
  • [9] Analysis of Genetic Association Studies Incorporating Prior Information of Genetic Models
    Gang Zheng
    Wei Zhang
    Qizhai Li
    Jinfeng Xu
    Ao Yuan
    Joseph L. Gastwirth
    [J]. Journal of Agricultural, Biological, and Environmental Statistics, 2015, 20 : 173 - 191
  • [10] Analysis of Genetic Association Studies Incorporating Prior Information of Genetic Models
    Zheng, Gang
    Zhang, Wei
    Li, Qizhai
    Xu, Jinfeng
    Yuan, Ao
    Gastwirth, Joseph L.
    [J]. JOURNAL OF AGRICULTURAL BIOLOGICAL AND ENVIRONMENTAL STATISTICS, 2015, 20 (02) : 173 - 191