trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses

被引:6741
|
作者
Capella-Gutierrez, Salvador [1 ]
Silla-Martinez, Jose M. [1 ]
Gabaldon, Toni [1 ]
机构
[1] Ctr Genom Regulat CRG, Comparat Genom Grp, Bioinformat & Genom Programme, Barcelona 08003, Spain
关键词
SEQUENCE ALIGNMENTS; BLOCKS;
D O I
10.1093/bioinformatics/btp348
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Multiple sequence alignments are central to many areas of bioinformatics. It has been shown that the removal of poorly aligned regions from an alignment increases the quality of subsequent analyses. Such an alignment trimming phase is complicated in large-scale phylogenetic analyses that deal with thousands of alignments. Here, we present trimAl, a tool for automated alignment trimming, which is especially suited for large-scale phylogenetic analyses. trimAl can consider several parameters, alone or in multiple combinations, for selecting the most reliable positions in the alignment. These include the proportion of sequences with a gap, the level of amino acid similarity and, if several alignments for the same set of sequences are provided, the level of consistency across different alignments. Moreover, trimAl can automatically select the parameters to be used in each specific alignment so that the signal-to-noise ratio is optimized.
引用
下载
收藏
页码:1972 / 1973
页数:2
相关论文
共 50 条
  • [1] SMA: An efficient tool for large-scale multiple alignment
    Shen, Shiyi
    Dong, Liuhuan
    Wang, Kui
    Hu, Gang
    2005 27TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2005, : 2836 - 2838
  • [2] CLAST: CUDA implemented large-scale alignment search tool
    Yano, Masahiro
    Mori, Hiroshi
    Akiyama, Yutaka
    Yamada, Takuji
    Kurokawa, Ken
    BMC BIOINFORMATICS, 2014, 15
  • [3] CLAST: CUDA implemented large-scale alignment search tool
    Masahiro Yano
    Hiroshi Mori
    Yutaka Akiyama
    Takuji Yamada
    Ken Kurokawa
    BMC Bioinformatics, 15
  • [4] ParBaum: Large-Scale Maximum Likelihood-Based Phylogenetic Analyses
    Ott, Michael
    Zola, Jaroslaw
    Aluru, Srinivas
    Stamatakis, Alexandros
    HIGH PERFORMANCE COMPUTING IN SCIENCE AND ENGINEERING, GARCH/MUNICH 2007, 2009, : 111 - +
  • [5] STEM: A software tool for large-scale proteomic data analyses
    Shinkawa, T
    Taoka, M
    Yamauchi, Y
    Ichimura, T
    Kaji, H
    Takahashi, N
    Isobe, T
    JOURNAL OF PROTEOME RESEARCH, 2005, 4 (05) : 1826 - 1831
  • [6] Large-scale phylogenetic analyses reveal the causes of high tropical amphibian diversity
    Pyron, R. Alexander
    Wiens, John J.
    PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2013, 280 (1770)
  • [7] DupTree: a program for large-scale phylogenetic analyses using gene tree parsimony
    Wehe, Andre
    Bansal, Mukul S.
    Burleigh, J. Gordon
    Eulenstein, Oliver
    BIOINFORMATICS, 2008, 24 (13) : 1540 - 1541
  • [8] MINERVA: An automated resource provisioning tool for large-scale storage systems
    Alvarez, GA
    Borowsky, E
    Go, S
    Romer, TH
    Becker-Szendy, R
    Golding, R
    Merchant, A
    Spasojevic, M
    Veitch, A
    Wilkes, J
    ACM TRANSACTIONS ON COMPUTER SYSTEMS, 2001, 19 (04): : 483 - 518
  • [9] VinJ: An Automated Tool for Large-Scale Software Vulnerability Data Generation
    Nong, Yu
    Yang, Haoran
    Chen, Feng
    Cai, Haipeng
    COMPANION PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, FSE COMPANION 2024, 2024, : 567 - 571
  • [10] Uncovering the boundaries of Campylobacter species through large-scale phylogenetic and nucleotide identity analyses
    Wu, Ruochen
    Payne, Michael
    Zhang, Li
    Lan, Ruiting
    MSYSTEMS, 2024,