Fast Fitch-parsimony algorithms for large data sets

被引:23
|
作者
Ronquist, F [1 ]
机构
[1] Univ Uppsala, Dept Zool, SE75236 Uppsala, Sweden
关键词
D O I
10.1111/j.1096-0031.1998.tb00346.x
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The speed of analytical algorithms becomes increasingly important as systematists accumulate larger data sets. In this paper I discuss several time-saving modifications to published Fitch-parsimony tree search algorithms, including shortcuts that allow rapid evaluation of tree lengths and fast reoptimization of trees after clipping or joining of subtrees, as well as search strategies that allows one to successively increase the exhaustiveness of branch swapping. I also describe how Fitch-parsimony algorithms can be restructured to take full advantage of the computing power of modern microprocessors by horizontal or vertical packing of characters, allowing simultaneous processing of many characters, and by avoidance of conditional branches that disturb instruction flow. These new multicharacter algorithms are particularly useful for large data sets of characters with a small number of states, such as nucleotide characters. As an example, the multicharacter algorithms are estimated to be 3.6-10 times faster than single-character equivalents on a PowerPC 604. The speed gain is even larger on processors using MMX, Altivec or similar technologies allowing single instructions to be performed on multiple data simultaneously. (C) 1998 The Willi Hennig Society.
引用
收藏
页码:387 / 400
页数:14
相关论文
共 50 条
  • [1] Fast algorithms for nonparametric population modeling of large data sets
    Pillonetto, Gianluigi
    De Nicolao, Giuseppe
    Chierici, Marco
    Cobelli, Claudio
    [J]. AUTOMATICA, 2009, 45 (01) : 173 - 179
  • [2] Fast Dual Selection using Genetic Algorithms for Large Data Sets
    Ros, Frederic
    Harba, Rachid
    Pintore, Marco
    [J]. 2012 12TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2012, : 815 - 820
  • [3] Efficient algorithms for fast integration on large data sets from multiple sources
    Mi, Tian
    Rajasekaran, Sanguthevar
    Aseltine, Robert
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2012, 12
  • [4] Efficient algorithms for fast integration on large data sets from multiple sources
    Tian Mi
    Sanguthevar Rajasekaran
    Robert Aseltine
    [J]. BMC Medical Informatics and Decision Making, 12
  • [5] Engineering Algorithms for Large Data Sets
    Sanders, Peter
    [J]. SOFSEM 2013: Theory and Practice of Computer Science, 2013, 7741 : 29 - 32
  • [6] Empirical comparison of fast partitioning-based clustering algorithms for large data sets
    Wei, CP
    Lee, YH
    Hsu, CM
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2003, 24 (04) : 351 - 363
  • [7] Clustering Algorithms for Large Temporal Data Sets
    Scepi, Germana
    [J]. DATA ANALYSIS AND CLASSIFICATION, 2010, : 369 - 377
  • [8] GPU acceleration of Fitch’s parsimony on protein data: from Kepler to Turing
    Sergio Santander-Jiménez
    Miguel A. Vega-Rodríguez
    Antonio Zahinos-Márquez
    Leonel Sousa
    [J]. The Journal of Supercomputing, 2020, 76 : 9827 - 9853
  • [9] GPU acceleration of Fitch's parsimony on protein data: from Kepler to Turing
    Santander-Jimenez, Sergio
    Vega-Rodriguez, Miguel A.
    Zahinos-Marquez, Antonio
    Sousa, Leonel
    [J]. JOURNAL OF SUPERCOMPUTING, 2020, 76 (12): : 9827 - 9853
  • [10] Fast algorithms using minimal data structures for common topological relationships in large, irregularly spaced topographic data sets
    Meyer, Thomas H.
    [J]. COMPUTERS & GEOSCIENCES, 2007, 33 (03) : 325 - 334