The combinatorics of tandem duplication trees

被引:19
|
作者
Gascuel, O
Hendy, MD
Jean-Marie, A
McLachlan, R
机构
[1] LIRMM, Dept Informat Fondamentale & Applicat, F-34392 Montpellier, France
[2] Massey Univ, Allan Wilson Ctr Mol Ecol & Evolut, Palmerston North, New Zealand
[3] Massey Univ, Inst Fundamental Sci, Palmerston North, New Zealand
关键词
asymptotic enumeration; random generation; recognition; recursion; tandem duplication trees;
D O I
10.1080/10635150390132821
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We developed a recurrence relation that counts the number of tandem duplication trees (either rooted or unrooted) that are consistent with a set of n tandemly repeated sequences generated under the standard unequal recombination (or crossover) model of tandem duplications. The number of rooted duplication trees is exactly twice the number of unrooted trees, which means that on average only two positions for a root on a duplication tree are possible. Using the recurrence, we tabulated these numbers for small values of n. We also developed an asymptotic formula that for large n provides estimates for these numbers. These numbers give a priori probabilities for phylogenies of the repeated sequences to be duplication trees. This work extends earlier studies where exhaustive counts of the numbers for small n were obtained. One application showed the significance of finding that most maximum-parsimony trees constructed from repeat sequences from human immunoglobins and T-cell receptors were tandem duplication trees. Those findings provided strong support to the proposed mechanisms of tandem gene duplication. The recurrence relation also suggests efficient algorithms to recognize duplication trees and to generate random duplication trees for simulation. We present a linear-time recognition algorithm.
引用
收藏
页码:110 / 118
页数:9
相关论文
共 50 条
  • [41] Reported tandem duplication/deletion of 9q is actually an inverted duplication
    Wyandt, HE
    AMERICAN JOURNAL OF MEDICAL GENETICS, 2001, 100 (01): : 82 - 83
  • [42] Efficient methods for inferring tandem duplication history
    Zhang, LX
    Ma, B
    Wang, LS
    ALGORITHMS IN BIOINFORMATICS, PROCEEDINGS, 2002, 2452 : 97 - 111
  • [43] Greedy method for inferring tandem duplication history
    Zhang, LX
    Ma, B
    Wang, LS
    Xu, Y
    BIOINFORMATICS, 2003, 19 (12) : 1497 - 1504
  • [44] The Tandem Duplication Distance Is NP-Hard
    Lafond, Manuel
    Zhu, Binhai
    Zou, Peng
    37TH INTERNATIONAL SYMPOSIUM ON THEORETICAL ASPECTS OF COMPUTER SCIENCE (STACS 2020), 2020, 154
  • [45] CNS tumor with BCOR internal tandem duplication
    Lili-Naz Hazrati
    Maryam Monajemzadeh
    Zohreh Habibi
    Bahar Moeini
    Moeinadin Safavi
    Child's Nervous System, 2023, 39 : 321 - 324
  • [46] Origin of alternative splicing by tandem exon duplication
    Kondrashov, FA
    Koonin, EV
    HUMAN MOLECULAR GENETICS, 2001, 10 (23) : 2661 - 2669
  • [47] Codes Correcting Bounded Length Tandem Duplication
    Nazirkhanova, Kamilla
    Medova, Luiza
    Kruglik, Stanislav
    Frolov, Alexey
    PROCEEDINGS OF 2020 INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY AND ITS APPLICATIONS (ISITA2020), 2020, : 299 - 303
  • [48] Genomic duplication problems for unrooted gene trees
    Paszek, Jaroslaw
    Gorecki, Pawel
    BMC GENOMICS, 2016, 17
  • [49] TANDEM DUPLICATION OF PROXIMAL-5Q
    ROJASMARTINEZ, A
    GARCIACRUZ, D
    MEDINA, C
    MOLLER, M
    RESTREPO, CM
    RIVERA, H
    ANNALES DE GENETIQUE, 1990, 33 (04): : 228 - 230
  • [50] CNS tumor with BCOR internal tandem duplication
    Hazrati, Lili-Naz
    Monajemzadeh, Maryam
    Habibi, Zohreh
    Moeini, Bahar
    Safavi, Moeinadin
    CHILDS NERVOUS SYSTEM, 2023, 39 (02) : 321 - 324