A nearly linear-time general algorithm for genome-wide bi-allele haplotype phasing

被引:0
|
作者
Casey, W
Mishra, B
机构
[1] NYU, Courant Inst Math Sci, New York, NY 10003 USA
[2] Cold Spring Harbor Lab, Cold Spring Harbor, NY 11724 USA
[3] Tata Inst Fundamental Res, Bombay 400005, Maharashtra, India
来源
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The determination of feature maps, such as STSs (sequence tag sites), SNPs (single nucleotide polymorphisms) or RFLP (restriction fragment length polymorphisms) maps, for each chromosome copy or haplotype in an individual has important potential applications to genetics, clinical biology and association studies. We consider the problem of reconstructing two haplotypes of a diploid individual from genotype data generated by mapping experiments, and present an algorithm to recover haplotypes. The problem of optimizing existing methods of SNP phasing with a population of diploid genotypes has been investigated in [7] and found to be NP-hard. In contrast, using single molecule methods, we show that although haplotypes are not known and data are further confounded by the mapping error model, reasonable assumptions on the mapping process allow us to recover the co-associations of allele types across consecutive loci and estimate the haplotypes with an efficient algorithm. The haplotype reconstruction algorithm requires two stages: Stage I is the detection of polymorphic marker types, this is done by modifying an EM-algorithm for Gaussian mixture models and an example is given for RFLP sizing. Stage II focuses on the problem of phasing and presents a method of local maximum likelihood for the inference of haplotypes in an individual. The algorithm presented is nearly linear in the number of polymorphic loci. The algorithm results, run on simulated RFLP sizing data, are encouraging, and suggest that the method will prove practical for haplotype phasing.
引用
收藏
页码:204 / 215
页数:12
相关论文
共 27 条
  • [11] A linear-time algorithm for reconstructing zero-recombinant haplotype configuration on pedigrees without mating loops
    Liu, Lan
    Jiang, Tao
    JOURNAL OF COMBINATORIAL OPTIMIZATION, 2010, 19 (02) : 217 - 240
  • [12] A linear-time algorithm for reconstructing zero-recombinant haplotype configuration on pedigrees without mating loops
    Lan Liu
    Tao Jiang
    Journal of Combinatorial Optimization, 2010, 19 : 217 - 240
  • [13] Simultaneous Genotype Calling and Haplotype Phasing Improves Genotype Accuracy and Reduces False-Positive Associations for Genome-wide Association Studies
    Browning, Brian L.
    Yu, Zhaoxia
    AMERICAN JOURNAL OF HUMAN GENETICS, 2009, 85 (06) : 847 - 861
  • [14] BOBEA : A Bi-Objective Biclustering Evolutionary Algorithm for Genome-Wide Association Analysis
    Maatouk, Ons
    Ayari, Emna
    Bouziri, Hend
    Ayadi, Wassim
    PROCEEDINGS OF THE 2022 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION, GECCO 2022, 2022, : 344 - 347
  • [15] A fast-linear mixed model for genome-wide haplotype association analysis: application to agronomic traits in maize
    Chen, Heli
    Hao, Zhiyu
    Zhao, Yunfeng
    Yang, Runqing
    BMC GENOMICS, 2020, 21 (01)
  • [16] A fast-linear mixed model for genome-wide haplotype association analysis: application to agronomic traits in maize
    Heli Chen
    Zhiyu Hao
    Yunfeng Zhao
    Runqing Yang
    BMC Genomics, 21
  • [17] Genome-wide exploratory analysis for NARAC dataset with preparation for haplotype block partitioning through minor allele frequency quality control viewpoint
    Mohamed N. Saad
    Galena W. Zareef
    Fatma S. Ibrahim
    Ashraf M. Said
    Hisham F. A. Hamed
    Iran Journal of Computer Science, 2023, 6 (4) : 387 - 396
  • [18] A Linear-Time Complexity Algorithm for Solving the Dyck-CFL Reachability Problem on Bi-directed Trees
    Sun Xiaoshan
    Zhang Yang
    Cheng Liang
    FIFTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2012): COMPUTER VISION, IMAGE ANALYSIS AND PROCESSING, 2013, 8783
  • [19] A linear-time self-stabilizing algorithm for the minimal 2-dominating set problem in general networks
    Huang, Tetz C.
    Chen, Chin-Yuan
    Wang, Cheng-Pin
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2008, 24 (01) : 175 - 187
  • [20] Fine-Scale Genetic Structure and Natural Selection Signatures of Southwestern Hans Inferred From Patterns of Genome-Wide Allele, Haplotype, and Haplogroup Lineages
    Wang, Mengge
    Yuan, Didi
    Zou, Xing
    Wang, Zheng
    Yeh, Hui-Yuan
    Liu, Jing
    Wei, Lan-Hai
    Wang, Chuan-Chao
    Zhu, Bofeng
    Liu, Chao
    He, Guanglin
    FRONTIERS IN GENETICS, 2021, 12