Towards Improving Searches for Optimal Phylogenies

被引:2
|
作者
Ford, Eric [1 ]
St John, Katherine [1 ,2 ]
Wheeler, Ward C. [3 ]
机构
[1] CUNY, Dept Comp Sci, Grad Ctr, New York, NY 10016 USA
[2] CUNY Herbert H Lehman Coll, Dept Math & Comp Sci, Bronx, NY 10468 USA
[3] Amer Museum Nat Hist, Div Invertebrate Zool, New York, NY 10024 USA
基金
美国国家科学基金会;
关键词
Character compatibility; exact search; maximum-parsimony optimality criterion; phylogenetic islands; tree search; TREES; EVOLUTION; ALGORITHMS; COMPLEXITY;
D O I
10.1093/sysbio/syu065
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Finding the optimal evolutionary history for a set of taxa is a challenging computational problem, even when restricting possible solutions to be "tree-like" and focusing on the maximum-parsimony optimality criterion. This has led to much work on using heuristic tree searches to find approximate solutions. We present an approach for finding exact optimal solutions that employs and complements the current heuristic methods for finding optimal trees. Given a set of taxa and a set of aligned sequences of characters, there may be subsets of characters that are compatible, and for each such subset there is an associated (possibly partially resolved) phylogeny with edges corresponding to each character state change. These perfect phylogenies serve as anchor trees for our constrained search space. We show that, for sequences with compatible sites, the parsimony score of any tree T is at least the parsimony score of the anchor trees plus the number of inferred changes between T and the anchor trees. As the maximum-parsimony optimality score is additive, the sum of the lower bounds on compatible character partitions provides a lower bound on the complete alignment of characters. This yields a region in the space of trees within which the best tree is guaranteed to be found; limiting the search for the optimal tree to this region can significantly reduce the number of trees that must be examined in a search of the space of trees. We analyze this method empirically using four different biological data sets as well as surveying 400 data sets from the TreeBASE repository, demonstrating the effectiveness of our technique in reducing the number of steps in exact heuristic searches for trees under the maximum-parsimony optimality criterion.
引用
收藏
页码:56 / 65
页数:10
相关论文
共 50 条
  • [1] Improving Literature Searches
    Adorno, Marie
    Garbee, Deborah
    Marix, Mary L.
    [J]. CLINICAL NURSE SPECIALIST, 2016, 30 (02) : 74 - 80
  • [2] Improving animal phylogenies with genomic data
    Telford, Maximilian J.
    Copley, Richard R.
    [J]. TRENDS IN GENETICS, 2011, 27 (05) : 186 - 195
  • [3] OPTIMAL SEARCHES FOR ASTEROIDS
    TAFF, LG
    [J]. ICARUS, 1984, 57 (02) : 259 - 266
  • [4] Towards Assisted Moderation in Online Healthcare Social Networks: Improving Trust in YouTube Searches
    Canon, Daniel E.
    Lopez, Diego M.
    Blobel, Bernd
    [J]. PHEALTH 2014, 2014, 200 : 146 - 152
  • [5] Improving literature searches - Response
    Boks, Simone S.
    [J]. RADIOLOGY, 2007, 243 (01) : 301 - U16
  • [6] A SUGGESTION FOR IMPROVING PATENT OFFICE SEARCHES
    Thurber, W. L.
    [J]. JOURNAL OF THE PATENT OFFICE SOCIETY, 1918, 1 (04): : 167 - 168
  • [7] Improving conformational searches by geometric screening
    Zhang, M
    White, RA
    Wang, LQ
    Goldman, R
    Kavraki, L
    Hassett, B
    [J]. BIOINFORMATICS, 2005, 21 (05) : 624 - 630
  • [8] Improving reinforcement learning algorithms: Towards optimal learning rate policies
    Mounjid, Othmane
    Lehalle, Charles-Albert
    [J]. MATHEMATICAL FINANCE, 2024, 34 (02) : 588 - 621
  • [9] Improving Literature Searches in Gene Expression Studies
    Arrais, Joel P.
    Rodrigues, Joao G. L. M.
    Oliveira, Jose Luis
    [J]. 2ND INTERNATIONAL WORKSHOP ON PRACTICAL APPLICATIONS OF COMPUTATIONAL BIOLOGY AND BIOINFORMATICS (IWPACBB 2008), 2009, 49 : 74 - 82
  • [10] Improving Web searches with distributed buckets structures
    Costa, V. Gil
    Printista, A. M.
    Marin, M.
    [J]. LA-WEB 06: FOURTH LATIN AMERICAN WEB CONGRESS, PROCEEDINGS, 2006, : 119 - +