Identification and Frequency Estimation of Inversion Polymorphisms from Haplotype Data

被引:20
|
作者
Sindi, Suzanne S. [1 ,2 ]
Raphael, Benjamin J. [1 ,3 ]
机构
[1] Brown Univ, Dept Comp Sci, Providence, RI 02912 USA
[2] Brown Univ, Div Appl Math, Providence, RI 02912 USA
[3] Brown Univ, Ctr Computat Mol Biol, Providence, RI 02912 USA
关键词
algorithms; DNA; genetic variation; genomes; haplotypes; STRUCTURAL VARIATION; DELETION POLYMORPHISMS; COMMON INVERSION; HUMAN GENOME; FINE-SCALE; RESOLUTION; INFERENCE; BLOCKS;
D O I
10.1089/cmb.2009.0185
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Structural rearrangements, including copy-number alterations and inversions, are increasingly recognized as an important contributor to human genetic variation. Copy number variants are readily measured via array-based techniques like comparative genomic hybridization, but copy-neutral variants such as inversion polymorphisms remain difficult to identify without whole genome sequencing. We introduce a method to identify inversion polymorphisms and estimate their frequency in a population using readily available single nucleotide polymorphism (SNP) data. Our method uses a probabilistic model to describe a population as a mixture of forward and inverted chromosomes and identifies putative inversions by characteristic differences in haplotype frequencies around inversion breakpoints. On simulated data, our method accurately predicts inversions with frequencies as low as 25% in the population and reliably estimates inversion frequencies over a wide range. On the human HapMap Phase 2 data, we predict between 88 and 142 inversion polymorphisms with frequency ranging from 20 to 81 percent. Many of these correspond to known inversions or have other evidence supporting them, and the predicted inversion frequencies largely agree with the limited information presently available.
引用
收藏
页码:517 / 531
页数:15
相关论文
共 50 条
  • [31] Hapl-o-Mat: open-source software for HLA haplotype frequency estimation from ambiguous and heterogeneous data
    Christian Schäfer
    Alexander H. Schmidt
    Jürgen Sauter
    BMC Bioinformatics, 18
  • [32] THE ESTIMATION OF RECRUITMENT FROM STAGE FREQUENCY DATA
    MILLS, NJ
    OECOLOGIA, 1981, 51 (02) : 212 - 216
  • [33] ALLELE FREQUENCY ESTIMATION FROM DATA ON RELATIVES
    BOEHNKE, M
    AMERICAN JOURNAL OF HUMAN GENETICS, 1991, 48 (01) : 22 - 25
  • [34] Leveraging reads that span multiple single nucleotide polymorphisms for haplotype inference from sequencing data
    Yang, Wen-Yun
    Hormozdiari, Farhad
    Wang, Zhanyong
    He, Dan
    Pasaniuc, Bogdan
    Eskin, Eleazar
    BIOINFORMATICS, 2013, 29 (18) : 2245 - 2252
  • [35] Identification of fractional models from frequency data
    Valerio, Duarte
    da Costa, Jose Sa
    ADVANCES IN FRACTIONAL CALCULUS: THEORETICAL DEVELOPMENTS AND APPLICATIONS IN PHYSICS AND ENGINEERING, 2007, : 229 - +
  • [36] Automated identification of single nucleotide polymorphisms from sequencing data
    Takahashi, M
    Matsuda, F
    Margetic, N
    Lathrop, M
    CSB2002: IEEE COMPUTER SOCIETY BIOINFORMATICS CONFERENCE, 2002, : 87 - 93
  • [37] Accuracy of haplotype reconstruction from haplotype-tagging single-nucleotide polymorphisms
    Forton, J
    Kwiatkowski, D
    Rockett, K
    Luoni, G
    Kimber, M
    Hull, J
    AMERICAN JOURNAL OF HUMAN GENETICS, 2005, 76 (03) : 438 - 448
  • [38] Closed Form Haplotype Frequency Estimation with Applications to the KIR Loci
    van der Burg, L.
    Baldauf, H.
    Schetelig, J.
    de Wreede, L.
    Bohringer, S.
    HUMAN HEREDITY, 2020, 84 (4-5) : 206 - 206
  • [39] CSHAP: efficient haplotype frequency estimation based on sparse representation
    Zhou, Yinsheng
    Zhang, Han
    Yang, Yaning
    BIOINFORMATICS, 2019, 35 (16) : 2827 - 2833
  • [40] ESTIMATION OF HAPLOTYPE FREQUENCY AND LINKAGE DISEQUILIBRIUM PARAMETER IN THE HLA SYSTEM
    YASUDA, N
    TISSUE ANTIGENS, 1978, 12 (05): : 315 - 322