The combinatorics of tandem duplication trees

被引:19
|
作者
Gascuel, O
Hendy, MD
Jean-Marie, A
McLachlan, R
机构
[1] LIRMM, Dept Informat Fondamentale & Applicat, F-34392 Montpellier, France
[2] Massey Univ, Allan Wilson Ctr Mol Ecol & Evolut, Palmerston North, New Zealand
[3] Massey Univ, Inst Fundamental Sci, Palmerston North, New Zealand
关键词
asymptotic enumeration; random generation; recognition; recursion; tandem duplication trees;
D O I
10.1080/10635150390132821
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We developed a recurrence relation that counts the number of tandem duplication trees (either rooted or unrooted) that are consistent with a set of n tandemly repeated sequences generated under the standard unequal recombination (or crossover) model of tandem duplications. The number of rooted duplication trees is exactly twice the number of unrooted trees, which means that on average only two positions for a root on a duplication tree are possible. Using the recurrence, we tabulated these numbers for small values of n. We also developed an asymptotic formula that for large n provides estimates for these numbers. These numbers give a priori probabilities for phylogenies of the repeated sequences to be duplication trees. This work extends earlier studies where exhaustive counts of the numbers for small n were obtained. One application showed the significance of finding that most maximum-parsimony trees constructed from repeat sequences from human immunoglobins and T-cell receptors were tandem duplication trees. Those findings provided strong support to the proposed mechanisms of tandem gene duplication. The recurrence relation also suggests efficient algorithms to recognize duplication trees and to generate random duplication trees for simulation. We present a linear-time recognition algorithm.
引用
收藏
页码:110 / 118
页数:9
相关论文
共 50 条
  • [31] Construction of tandem duplication correcting codes
    Zeraatpisheh, Mohamadbagher
    Esmaeili, Morteza
    Gulliver, T. Aaron
    [J]. IET COMMUNICATIONS, 2019, 13 (15) : 2217 - 2225
  • [32] A SPONTANEOUS TANDEM DUPLICATION IN A DROSOPHILA CHROMOSOME
    BAIMAI, V
    KITTHAWEE, S
    [J]. EXPERIENTIA, 1981, 37 (04): : 345 - 346
  • [33] GENERATION OF DIRECTED TREES AND 2-TREES WITHOUT DUPLICATION
    PAUL, AJ
    [J]. IEEE TRANSACTIONS ON CIRCUIT THEORY, 1967, CT14 (03): : 354 - &
  • [34] STRUCTURE OF RR TANDEM DUPLICATION IN MAIZE
    DOONER, HK
    KERMICLE, JL
    [J]. GENETICS, 1971, 67 (03) : 427 - +
  • [35] Capacity and Expressiveness of Genomic Tandem Duplication
    Jain, Siddharth
    Farnoud , Farzad
    Bruck, Jehoshua
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2017, 63 (10) : 6129 - 6138
  • [36] Capacity and Expressiveness of Genomic Tandem Duplication
    Jain, Siddharth
    Farnoud , Farzad
    Bruck, Jehoshua
    [J]. 2015 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2015, : 1946 - 1950
  • [37] The combinatorics of discrete time-trees: theory and open problems
    Alex Gavryushkin
    Chris Whidden
    Frederick A. Matsen
    [J]. Journal of Mathematical Biology, 2018, 76 : 1101 - 1121
  • [38] The combinatorics of discrete time-trees: theory and open problems
    Gavryushkin, Alex
    Whidden, Chris
    Matsen, Frederick A.
    [J]. JOURNAL OF MATHEMATICAL BIOLOGY, 2018, 76 (05) : 1101 - 1121
  • [39] INVERTED TANDEM DUPLICATION GENERATES A DUPLICATION DEFICIENCY OF CHROMOSOME-8P
    DILL, FJ
    SCHERTZER, M
    SANDERCOCK, J
    TISCHLER, B
    WOOD, S
    [J]. CLINICAL GENETICS, 1987, 32 (02) : 109 - 113
  • [40] Reported tandem duplication/deletion of 9q is actually an inverted duplication
    Wyandt, HE
    [J]. AMERICAN JOURNAL OF MEDICAL GENETICS, 2001, 100 (01): : 82 - 83