The combinatorics of tandem duplication trees

被引:19
|
作者
Gascuel, O
Hendy, MD
Jean-Marie, A
McLachlan, R
机构
[1] LIRMM, Dept Informat Fondamentale & Applicat, F-34392 Montpellier, France
[2] Massey Univ, Allan Wilson Ctr Mol Ecol & Evolut, Palmerston North, New Zealand
[3] Massey Univ, Inst Fundamental Sci, Palmerston North, New Zealand
关键词
asymptotic enumeration; random generation; recognition; recursion; tandem duplication trees;
D O I
10.1080/10635150390132821
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We developed a recurrence relation that counts the number of tandem duplication trees (either rooted or unrooted) that are consistent with a set of n tandemly repeated sequences generated under the standard unequal recombination (or crossover) model of tandem duplications. The number of rooted duplication trees is exactly twice the number of unrooted trees, which means that on average only two positions for a root on a duplication tree are possible. Using the recurrence, we tabulated these numbers for small values of n. We also developed an asymptotic formula that for large n provides estimates for these numbers. These numbers give a priori probabilities for phylogenies of the repeated sequences to be duplication trees. This work extends earlier studies where exhaustive counts of the numbers for small n were obtained. One application showed the significance of finding that most maximum-parsimony trees constructed from repeat sequences from human immunoglobins and T-cell receptors were tandem duplication trees. Those findings provided strong support to the proposed mechanisms of tandem gene duplication. The recurrence relation also suggests efficient algorithms to recognize duplication trees and to generate random duplication trees for simulation. We present a linear-time recognition algorithm.
引用
收藏
页码:110 / 118
页数:9
相关论文
共 50 条
  • [1] The combinatorics of tandem duplication
    Penso-Dolfin, L.
    Wu, T.
    Greenman, C. D.
    [J]. DISCRETE APPLIED MATHEMATICS, 2015, 194 : 1 - 22
  • [2] On counting tandem duplication trees
    Yang, YL
    Zhang, LX
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2004, 21 (06) : 1160 - 1163
  • [3] Combinatorics of Tandem Duplication Random Loss Mutations on Circular Genomes
    Hartmann, Tom
    Chu, An-Chiang
    Middendorf, Martin
    Bernt, Matthias
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2018, 15 (01) : 83 - 95
  • [4] A similarity measure between tandem duplication trees
    Koperwas, Jakub
    Walczak, Krzysztof
    [J]. INTELLIGENT INFORMATION PROCESSING AND WEB MINING, PROCEEDINGS, 2006, : 163 - +
  • [5] Topological rearrangements and local search method for tandem duplication trees
    Bertrand, D
    Gascuel, O
    [J]. ALGORITHMS IN BIOINFORMATICS, PROCEEDINGS, 2004, 3240 : 374 - 387
  • [6] Topological rearrangements and local search method for tandem duplication trees
    Bertrand, D
    Gascuel, O
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2005, 2 (01) : 15 - 28
  • [7] On the combinatorics of leftist trees
    Nogueira, P
    [J]. DISCRETE APPLIED MATHEMATICS, 2001, 109 (03) : 253 - 278
  • [8] An efficient and accurate distance based algorithm to reconstruct tandem duplication trees
    Elemento, O
    Gascuel, O
    [J]. BIOINFORMATICS, 2002, 18 : S92 - S99
  • [9] COMBINATORICS ON IDEALS AND FORCING WITH TREES
    GROSZEK, MJ
    [J]. JOURNAL OF SYMBOLIC LOGIC, 1987, 52 (03) : 582 - 593
  • [10] The combinatorics of directed planar trees
    Poirier, Kate
    Tradler, Thomas
    [J]. JOURNAL OF COMBINATORIAL THEORY SERIES A, 2018, 160 : 31 - 61