A divide-and-conquer method for scalable phylogenetic network inference from multilocus data

被引:11
|
作者
Zhu, Jiafan [1 ]
Liu, Xinhao [1 ]
Ogilvie, Huw A. [1 ]
Nakhleh, Luay K. [1 ,2 ]
机构
[1] Rice Univ, Dept Comp Sci, Houston, TX 77005 USA
[2] Rice Univ, Dept BioSci, Houston, TX 77005 USA
关键词
TREES;
D O I
10.1093/bioinformatics/btz359
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation Reticulate evolutionary histories, such as those arising in the presence of hybridization, are best modeled as phylogenetic networks. Recently developed methods allow for statistical inference of phylogenetic networks while also accounting for other processes, such as incomplete lineage sorting. However, these methods can only handle a small number of loci from a handful of genomes. Results In this article, we introduce a novel two-step method for scalable inference of phylogenetic networks from the sequence alignments of multiple, unlinked loci. The method infers networks on subproblems and then merges them into a network on the full set of taxa. To reduce the number of trinets to infer, we formulate a Hitting Set version of the problem of finding a small number of subsets, and implement a simple heuristic to solve it. We studied their performance, in terms of both running time and accuracy, on simulated as well as on biological datasets. The two-step method accurately infers phylogenetic networks at a scale that is infeasible with existing methods. The results are a significant and promising step towards accurate, large-scale phylogenetic network inference. Availability and implementation We implemented the algorithms in the publicly available software package PhyloNet (https://bioinfocs.rice.edu/PhyloNet). Supplementary information Supplementary data are available at Bioinformatics online.
引用
收藏
页码:I370 / I378
页数:9
相关论文
共 50 条
  • [41] N-graphs: Scalable topology and design of balanced divide-and-conquer algorithms
    Gorlatch, S
    [J]. PARALLEL COMPUTING, 1997, 23 (06) : 687 - 698
  • [42] A scalable divide-and-conquer algorithm combining coarse and fine-grain parallelization
    Goh, SK
    Sosa, CP
    St-Amant, A
    [J]. THEORETICAL CHEMISTRY ACCOUNTS, 1998, 99 (03) : 197 - 206
  • [43] Genetic-fuzzy data mining with divide-and-conquer strategy
    Hong, Tzung-Pei
    Chen, Chun-Hao
    Lee, Yeong-Chyi
    Wu, Yu-Lung
    [J]. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2008, 12 (02) : 252 - 265
  • [44] A scalable divide-and-conquer algorithm combining coarse and fine-grain parallelization
    Goh S.K.
    Sosa C.P.
    St-Amant A.
    [J]. Theoretical Chemistry Accounts, 1998, 99 (3) : 197 - 206
  • [45] A divide-and-conquer strategy in shape from shading problem
    Cho, SI
    Saito, H
    Ozawa, S
    [J]. 1997 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1997, : 413 - 419
  • [46] A STRUCTURE-PRESERVING DIVIDE-AND-CONQUER METHOD FOR PSEUDOSYMMETRIC MATRICES
    Benner, Peter
    Nakatsukasa, Yuji
    Penke, Carolin
    [J]. SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 2023, 44 (03) : 1245 - 1270
  • [47] A divide-and-conquer method with approximate Fermi levels for parallel computations
    Takeshi Yoshikawa
    Hiromi Nakai
    [J]. Theoretical Chemistry Accounts, 2015, 134
  • [48] Applications of the semiempirical divide-and-conquer molecular orbital method.
    Dixon, SL
    Vincent, JJ
    van der Vaart, A
    Merz, KM
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1998, 215 : U207 - U207
  • [49] A divide-and-conquer method for space-time series prediction
    Deng, Min
    Yang, Wentao
    Liu, Qiliang
    Zhang, Yunfei
    [J]. JOURNAL OF GEOGRAPHICAL SYSTEMS, 2017, 19 (01) : 1 - 19
  • [50] The divide-and-conquer subgoal-ordering algorithm for speeding up logic inference
    Ledeniov, O
    Markovitch, S
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1998, 9 : 37 - 97