Cleaning Genotype Data from Diversity Outbred Mice

被引:13
|
作者
Broman, Karl W. [1 ]
Gatti, Daniel M. [2 ]
Svenson, Karen L. [2 ]
Sen, Saunak [3 ]
Churchill, Gary A. [2 ]
机构
[1] Univ Wisconsin, Dept Biostat & Med Informat, 2126 Genet Biotechnol Ctr,425 Henry Mall, Madison, WI 53706 USA
[2] Jackson Lab, 600 Main St, Bar Harbor, ME 04609 USA
[3] Univ Tennessee, Hlth Sci Ctr, Dept Prevent Med, Memphis, TN 38163 USA
来源
G3-GENES GENOMES GENETICS | 2019年 / 9卷 / 05期
基金
美国国家卫生研究院;
关键词
data cleaning; QTL; quantitative trait loci; data diagnostics; Multiparent Advanced Generation Inter-Cross (MAGIC); multiparental populations; MPP; QUANTITATIVE TRAIT LOCI; SAMPLE MIX-UPS; PROBABILITIES;
D O I
10.1534/g3.119.400165
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Data cleaning is an important first step in most statistical analyses, including efforts to map the genetic loci that contribute to variation in quantitative traits. Here we illustrate approaches to quality control and cleaning of array-based genotyping data for multiparent populations (experimental crosses derived from more than two founder strains), using MegaMUGA array data from a set of 291 Diversity Outbred (DO) mice. Our approach employs data visualizations that can reveal problems at the level of individual mice or with individual SNP markers. We find that the proportion of missing genotypes for each mouse is an effective indicator of sample quality. We use microarray probe intensities for SNPs on the X and Y chromosomes to confirm the sex of each mouse, and we use the proportion of matching SNP genotypes between pairs of mice to detect sample duplicates. We use a hidden Markov model (HMM) reconstruction of the founder haplotype mosaic across each mouse genome to estimate the number of crossovers and to identify potential genotyping errors. To evaluate marker quality, we find that missing data and genotyping error rates are the most effective diagnostics. We also examine the SNP genotype frequencies with markers grouped according to their minor allele frequency in the founder strains. For markers with high apparent error rates, a scatterplot of the allele-specific probe intensities can reveal the underlying cause of incorrect genotype calls. The decision to include or exclude low-quality samples can have a significant impact on the mapping results for a given study. We find that the impact of low-quality markers on a given study is often minimal, but reporting problematic markers can improve the utility of the genotyping array across many studies.
引用
收藏
页码:1571 / 1579
页数:9
相关论文
共 50 条
  • [21] Age and diet shape the genetic architecture of body weight in diversity outbred mice
    Wright, Kevin M.
    Deighan, Andrew G.
    Di Francesco, Andrea
    Freund, Adam
    Jojic, Vladimir
    Churchill, Gary A.
    Raj, Anil
    [J]. ELIFE, 2022, 11
  • [22] GENETIC ANALYSIS OF INTERMITTENT ACCESS VOLUNTARY ETHANOL CONSUMPTION IN DIVERSITY OUTBRED MICE
    Mignogna, K. M.
    Macleod, L. M.
    Wolstenholme, J. T.
    Jurmain, J. L.
    Miles, M. F.
    [J]. ALCOHOLISM-CLINICAL AND EXPERIMENTAL RESEARCH, 2017, 41 : 184A - 184A
  • [23] Genetic loci associated with voluntary ethanol consumption and preference in diversity outbred mice
    Mignogna, Kristin
    Macleod, Lorna
    Jurmain, Jessica
    Miles, Michael
    [J]. BEHAVIOR GENETICS, 2018, 48 (06) : 497 - 497
  • [24] The diversity outbred mouse population
    Gary A. Churchill
    Daniel M. Gatti
    Steven C. Munger
    Karen L. Svenson
    [J]. Mammalian Genome, 2012, 23 : 713 - 718
  • [25] The diversity outbred mouse population
    Churchill, Gary A.
    Gatti, Daniel M.
    Munger, Steven C.
    Svenson, Karen L.
    [J]. MAMMALIAN GENOME, 2012, 23 (9-10) : 713 - 718
  • [26] Analyses in diversity outbred mice identify new genetic loci associated with sleep phenotypes
    Keenan, B.
    Lian, J.
    Guo, X.
    Svenson, K.
    Churchill, G.
    Pack, A.
    [J]. JOURNAL OF SLEEP RESEARCH, 2022, 31
  • [27] Sensitivity to hepatotoxicity due to epigallocatechin gallate is affected by genetic background in diversity outbred mice
    Church, Rachel J.
    Gatti, Daniel M.
    Urban, Thomas J.
    Long, Nanye
    Yang, Xi
    Shi, Qiang
    Eaddy, J. Scott
    Mosedale, Merrie
    Ballard, Shawn
    Churchill, Gary A.
    Navarro, Victor
    Watkins, Paul B.
    Threadgill, David W.
    Harrill, Alison H.
    [J]. FOOD AND CHEMICAL TOXICOLOGY, 2015, 76 : 19 - 26
  • [28] Population Variation in Micronucleus Response to Inhaled Benzene in Diversity Outbred (J:DO) Mice
    Hobbs, C. A.
    Morgan, D. L.
    Shepard, K.
    Price, H. C.
    Kissling, G. E.
    Shockley, K. R.
    Recio, L.
    Witt, K. L.
    French, J. E.
    [J]. ENVIRONMENTAL AND MOLECULAR MUTAGENESIS, 2012, 53 : S26 - S26
  • [29] Age-Associated Changes in Hippocampal-Dependent Cognition in Diversity Outbred Mice
    Koh, Ming Teng
    Spiegel, Amy M.
    Gallagher, Michela
    [J]. HIPPOCAMPUS, 2014, 24 (11) : 1300 - 1307
  • [30] Genetic variation and correlations between genotype and locomotor physiology in outbred laboratory house mice (Mus domesticus)
    Carter, PA
    Garland, T
    Dohm, MR
    Hayes, JP
    [J]. COMPARATIVE BIOCHEMISTRY AND PHYSIOLOGY A-MOLECULAR & INTEGRATIVE PHYSIOLOGY, 1999, 123 (02): : 155 - 162