Multiple sequence alignment accuracy and phylogenetic inference

被引:172
|
作者
Ogden, TH [1 ]
Rosenberg, MS
机构
[1] Arizona State Univ, Biodesign Inst, Ctr Evolut Funct Genom, Tempe, AZ 85287 USA
[2] Arizona State Univ, Sch Life Sci, Tempe, AZ 85287 USA
关键词
Bayesian; maximum likelihood; maximum parsimony; multiple sequence alignment; neighbor joining; phylogenetics; simulation; tree reconstruction;
D O I
10.1080/10635150500541730
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Phylogenies are often thought to be more dependent upon the species of the sequence alignment rather than on the method of reconstruction. Simulation of sequences containing insertion and deletion events was performed in order to determine the role that alignment accuracy plays during phylogenetic inference. Data sets were simulated for pectinate, balanced, and random tree shapes under different conditions (ultrametric equal branch length, ultrametric random branch length, nonultrametric random branch length). Comparisons between hypothesized alignments and true alignments enabled determination of two measures of alignment accuracy, that of the total data set and that of individual branches. In general, our results indicate that as alignment error increases, topological accuracy decreases. This trend was much more pronounced for data sets derived from more pectinate topologies. In contrast, for balanced, ultrametric, equal branch length tree shapes, alignment inaccuracy had little average effect on tree reconstruction. These conclusions are based on average trends of many analyses under different conditions, and any one specific analysis, independent of the alignment accuracy, may recover very accurate or inaccurate topologies. Maximum likelihood and Bayesian, in general, outperformed neighbor joining and maximum parsimony in terms of tree reconstruction accuracy. Results also indicated that as the length of the branch and of the neighboring branches increase, alignment accuracy decreases, and the length of the neighboring branches is the major factor in topological accuracy. Thus, multiple-sequence alignment can be an important factor in downstream effects on topological reconstruction.
引用
收藏
页码:314 / 328
页数:15
相关论文
共 50 条
  • [41] Multiple sequence alignment
    Edgar, Robert C.
    Batzoglou, Serafim
    [J]. CURRENT OPINION IN STRUCTURAL BIOLOGY, 2006, 16 (03) : 368 - 373
  • [42] Size, frequency, and phylogenetic signal of multiple-residue indels in sequence alignment of introns
    Pons, J
    Vogler, AP
    [J]. CLADISTICS, 2006, 22 (02) : 144 - 156
  • [43] A protein alignment partitioning method for protein phylogenetic inference
    Thu Kim Le
    Vinh Sy Le
    [J]. 2020 RIVF INTERNATIONAL CONFERENCE ON COMPUTING & COMMUNICATION TECHNOLOGIES (RIVF 2020), 2020, : 82 - 86
  • [44] Multiple sequence alignment, phylogenetic analysis, and computational structure determination of phospholipase D.
    Meisch, M
    Laederach, AT
    Reilly, PJ
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2001, 221 : U134 - U134
  • [45] THE LIMITS OF PROTEIN SECONDARY STRUCTURE PREDICTION ACCURACY FROM MULTIPLE SEQUENCE ALIGNMENT
    RUSSELL, RB
    BARTON, GJ
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1993, 234 (04) : 951 - 957
  • [46] MAGUS plus eHMMs: improved multiple sequence alignment accuracy for fragmentary sequences
    Shen, Chengze
    Zaharias, Paul
    Warnow, Tandy
    [J]. BIOINFORMATICS, 2022, 38 (04) : 918 - 924
  • [47] Predicting the accuracy of multiple sequence alignment algorithms by using computational intelligent techniques
    Ortuno, Francisco M.
    Valenzuela, Olga
    Pomares, Hector
    Rojas, Fernando
    Florido, Javier P.
    Urquiza, Jose M.
    Rojas, Ignacio
    [J]. NUCLEIC ACIDS RESEARCH, 2013, 41 (01)
  • [48] Improvement in accuracy of multiple sequence alignment using novel group-to-group sequence alignment algorithm with piecewise linear gap cost
    Yamada, Shinsuke
    Gotoh, Osamu
    Yamana, Hayato
    [J]. BMC BIOINFORMATICS, 2006, 7 (1)
  • [49] Improvement in accuracy of multiple sequence alignment using novel group-to-group sequence alignment algorithm with piecewise linear gap cost
    Shinsuke Yamada
    Osamu Gotoh
    Hayato Yamana
    [J]. BMC Bioinformatics, 7
  • [50] Multiple alignment by sequence annealing
    Schwartz, Ariel S.
    Pachter, Lior
    [J]. BIOINFORMATICS, 2007, 23 (02) : E24 - E29