Do Alignment and Trimming Methods Matter for Phylogenomic (UCE) Analyses?

被引:24
|
作者
Portik, Daniel M. [1 ,2 ]
Wiens, John J. [1 ]
机构
[1] Univ Arizona, Dept Ecol & Evolutionary Biol, Tucson, AZ 85721 USA
[2] Calif Acad Sci, San Francisco, CA 94118 USA
基金
美国国家科学基金会;
关键词
Alignment; concatenated analysis; phylogenomics; sequence length heterogeneity; species-tree analysis; trimming; MULTIPLE SEQUENCE ALIGNMENT; SPECIES TREE ESTIMATION; SQUAMATE REPTILES LIZARDS; ULTRACONSERVED ELEMENTS; REVISED CLASSIFICATION; RAPID RADIATION; NUCLEAR LOCI; EXON CAPTURE; MISSING DATA; SNAKES;
D O I
10.1093/sysbio/syaa064
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Alignment is a crucial issue in molecular phylogenetics because different alignment methods can potentially yield very different topologies for individual genes. But it is unclear if the choice of alignment methods remains important in phylogenomic analyses, which incorporate data from hundreds or thousands of genes. For example, problematic biases in alignment might be multiplied across many loci, whereas alignment errors in individual genes might become irrelevant. The issue of alignment trimming (i.e., removing poorly aligned regions or missing data from individual genes) is also poorly explored. Here, we test the impact of 12 different combinations of alignment and trimming methods on phylogenomic analyses. We compare these methods using published phylogenomic data from ultraconserved elements (UCEs) from squamate reptiles (lizards and snakes), birds, and tetrapods. We compare the properties of alignments generated by different alignment and trimming methods (e.g., length, informative sites, missing data). We also test whether these data sets can recover well-established clades when analyzed with concatenated (RAxML) and species-tree methods (ASTRAL-III), using the full data (similar to 5000 loci) and subsampled data sets (10% and 1% of loci). We show that different alignment and trimming methods can significantly impact various aspects of phylogenomic data sets (e.g., length, informative sites). However, these different methods generally had little impact on the recovery and support values for well-established clades, even across very different numbers of loci. Nevertheless, our results suggest several "best practices" for alignment and trimming. Intriguingly, the choice of phylogenetic methods impacted the phylogenetic results most strongly, with concatenated analyses recovering significantly more well-established clades (with stronger support) than the species-tree analyses.
引用
收藏
页码:440 / 462
页数:23
相关论文
共 50 条
  • [1] ClipKIT: A multiple sequence alignment trimming software for accurate phylogenomic inference
    Steenwyk, Jacob L.
    Buida, Thomas J., III
    Li, Yuanning
    Shen, Xing-Xing
    Rokas, Antonis
    [J]. PLOS BIOLOGY, 2020, 18 (12)
  • [2] Do methods in meta-analyses matter?
    Jesper M. Kivelä
    [J]. European Journal of Pediatrics, 2022, 181 : 3989 - 3989
  • [3] Do methods in meta-analyses matter?
    Kivela, Jesper M.
    [J]. EUROPEAN JOURNAL OF PEDIATRICS, 2022, 181 (11) : 3989 - 3989
  • [4] Fast and accurate methods for phylogenomic analyses
    Yang, Jimmy
    Warnow, Tandy
    [J]. BMC BIOINFORMATICS, 2011, 12
  • [5] Fast and accurate methods for phylogenomic analyses
    Jimmy Yang
    Tandy Warnow
    [J]. BMC Bioinformatics, 12
  • [6] Disk covering methods improve phylogenomic analyses
    Md Shamsuzzoha Bayzid
    Tyler Hunt
    Tandy Warnow
    [J]. BMC Genomics, 15
  • [7] Disk covering methods improve phylogenomic analyses
    Bayzid, Md Shamsuzzoha
    Hunt, Tyler
    Warnow, Tandy
    [J]. BMC GENOMICS, 2014, 15
  • [8] trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses
    Capella-Gutierrez, Salvador
    Silla-Martinez, Jose M.
    Gabaldon, Toni
    [J]. BIOINFORMATICS, 2009, 25 (15) : 1972 - 1973
  • [9] Reply to Kivelä JM (2022) “Do methods in meta-analyses matter?”
    Ilari Kuitunen
    [J]. European Journal of Pediatrics, 2022, 181 : 3991 - 3991
  • [10] Reply to Kivela JM (2022) "Do methods in meta-analyses matter?"
    Kuitunen, Ilari
    [J]. EUROPEAN JOURNAL OF PEDIATRICS, 2022, 181 (11) : 3991 - 3991