Quartets MaxCut: A Divide and Conquer Quartets Algorithm

被引:58
|
作者
Snir, Sagi [1 ]
Rao, Satish [2 ]
机构
[1] Univ Haifa, Inst Evolut, IL-31905 Haifa, Israel
[2] Univ Calif Berkeley, Div Comp Sci, Berkeley, CA 94720 USA
基金
美国国家科学基金会;
关键词
Phylogenetic reconstruction; quartets; MaxCut; supertree; PHYLOGENETIC INFERENCE; LOGS SUFFICE; DATA SETS; TREES; LIKELIHOOD; SUPERTREE; PERFORMANCE; BUILD; CUT;
D O I
10.1109/TCBB.2008.133
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Accurate phylogenetic reconstruction methods are currently limited to a maximum of few dozens of taxa. Supertree methods construct a large tree over a large set of taxa, from a set of small trees over overlapping subsets of the complete taxa set. Hence, in order to construct the tree of life over a million and a half different species, the use of a supertree method over the product of accurate methods, is inevitable. Perhaps the simplest version of this task that is still widely applicable, yet quite challenging, is quartet-based reconstruction. This problem lies at the root of many tree reconstruction methods and theoretical as well as experimental results have been reported. Nevertheless, dealing with false, conflicting quartet trees remains problematic. In this paper, we describe an algorithm for constructing a tree from a set of input quartet trees even with a significant fraction of errors. We show empirically that conflicts in the inputs are handled satisfactorily and that it significantly outperforms and outraces the Matrix Representation with Parsimony (MRP) methods that have previously been most successful in dealing with supertrees. Our algorithm is based on a divide and conquer algorithm where our divide step uses a semidefinite programming (SDP) formulation of MaxCut. We remark that this builds on previous work of ours [29] for piecing together trees from rooted triplet trees. The recursion for unrooted quartets, however, is more complicated in that even with completely consistent set of quartet trees the problem is NP-hard, as opposed to the problem for triples where there is a linear time algorithm. This complexity leads to several issues and some solutions of possible independent interest.
引用
下载
收藏
页码:704 / 718
页数:15
相关论文
共 50 条
  • [1] An experimental study of Quartets MaxCut and other supertree methods
    Swenson, M. Shel
    Suri, Rahul
    Linder, C. Randal
    Warnow, Tandy
    ALGORITHMS FOR MOLECULAR BIOLOGY, 2011, 6
  • [2] An experimental study of Quartets MaxCut and other supertree methods
    M Shel Swenson
    Rahul Suri
    C Randal Linder
    Tandy Warnow
    Algorithms for Molecular Biology, 6
  • [3] 'QUARTETS'
    FULLER, R
    SOUTHERN REVIEW-BATON ROUGE, 1977, 13 (01): : 150 - 151
  • [4] THE HAYDN QUARTETS
    LANDON, HCR
    STRAD, 1989, 100 (1187): : 222 - 222
  • [5] Four Quartets
    Macaulay, Alastair
    DANCING TIMES, 2019, 109 (1307): : 55 - 55
  • [6] Targeting by quartets
    Laurie A Dempsey
    Nature Immunology, 2015, 16 (7) : 688 - 688
  • [7] More quartets
    Collier, C
    NATURAL HISTORY, 1998, 107 (04) : 6 - 6
  • [8] Four quartets
    Eliot, T. S.
    CARAVELLE-CAHIERS DU MONDE HISPANIQUE ET LUSO-BRESILIEN, 2016, (106): : 180 - 180
  • [9] ROMANTIC QUARTETS
    ANDERSON, R
    MUSICAL TIMES, 1985, 126 (1707): : 290 - 290
  • [10] Floral quartets
    Günter Theißen
    Heinz Saedler
    Nature, 2001, 409 : 469 - 471