From Gene Trees to Species Trees II: Species Tree Inference by Minimizing Deep Coalescence Events

被引:40
|
作者
Zhang, Louxin [1 ]
机构
[1] Natl Univ Singapore, Dept Math, Singapore 119076, Singapore
关键词
Gene tree and species tree reconciliation; deep coalescence; gene duplication and loss; the parsimony principle; NP-hardness; DUPLICATION; PHYLOGENY; TIME; RECONSTRUCTION; PROBABILITY; DIVERGENCE; EVOLUTION; LINEAGE;
D O I
10.1109/TCBB.2011.83
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
When gene copies are sampled from various species, the resulting gene tree might disagree with the containing species tree. The primary causes of gene tree and species tree discord include incomplete lineage sorting, horizontal gene transfer, and gene duplication and loss. Each of these events yields a different parsimony criterion for inferring the (containing) species tree from gene trees. With incomplete lineage sorting, species tree inference is to find the tree minimizing extra gene lineages that had to coexist along species lineages; with gene duplication, it becomes to find the tree minimizing gene duplications and/or losses. In this paper, we present the following results: 1) The deep coalescence cost is equal to the number of gene losses minus two times the gene duplication cost in the reconciliation of a uniquely leaf labeled gene tree and a species tree. The deep coalescence cost can be computed in linear time for any arbitrary gene tree and species tree. 2) The deep coalescence cost is always not less than the gene duplication cost in the reconciliation of an arbitrary gene tree and a species tree. 3) Species tree inference by minimizing deep coalescence events is NP-hard.
引用
收藏
页码:1685 / 1691
页数:7
相关论文
共 50 条
  • [1] Species Tree Estimation from Gene Trees by Minimizing Deep Coalescence and Maximizing Quartet Consistency: A Comparative Study and the Presence of Pseudo Species Tree Terraces
    Farah, Ishrat Tanzila
    Islam, Muktadirul
    Zinat, Kazi Tasnim
    Rahman, Atif Hasan
    Bayzid, Shamsuzzoha
    [J]. SYSTEMATIC BIOLOGY, 2021, 70 (06) : 1213 - 1231
  • [2] Estimating Optimal Species Trees from Incomplete Gene Trees Under Deep Coalescence
    Bayzid, Md Shamsuzzoha
    Warnow, Tandy
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2012, 19 (06) : 591 - 605
  • [3] The Inference of Gene Trees with Species Trees
    Szoellosi, Gergely J.
    Tannier, Eric
    Daubin, Vincent
    Boussau, Bastien
    [J]. SYSTEMATIC BIOLOGY, 2015, 64 (01) : E42 - E62
  • [4] EXACT SOLUTIONS FOR SPECIES TREE INFERENCE FROM DISCORDANT GENE TREES
    Chang, Wen-Chieh
    Gorecki, Pawel
    Eulenstein, Oliver
    [J]. JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2013, 11 (05)
  • [5] PRANC: ML species tree estimation from the ranked gene trees under coalescence
    Kim, Anastasiia
    Degnan, James H.
    [J]. BIOINFORMATICS, 2020, 36 (18) : 4819 - 4821
  • [6] From gene trees to species trees
    Ma, B
    Li, M
    Zhang, LX
    [J]. SIAM JOURNAL ON COMPUTING, 2000, 30 (03) : 729 - 752
  • [7] Species Tree Inference by Minimizing Deep Coalescences
    Than, Cuong
    Nakhleh, Luay
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2009, 5 (09)
  • [8] Reconciling gene trees to a species tree
    Bonizzoni, P
    Della Vedova, G
    Dondi, R
    [J]. ALGORITHMS AND COMPLEXITY, PROCEEDINGS, 2003, 2653 : 120 - 131
  • [9] STEM: species tree estimation using maximum likelihood for gene trees under coalescence
    Kubatko, Laura S.
    Carstens, Bryan C.
    Knowles, L. Lacey
    [J]. BIOINFORMATICS, 2009, 25 (07) : 971 - 973
  • [10] Gene trees in species trees
    Maddison, WP
    [J]. SYSTEMATIC BIOLOGY, 1997, 46 (03) : 523 - 536