Protein multiple alignments: sequence-based versus structure-based programs

被引:21
|
作者
Carpentier, Mathilde [1 ]
Chomilier, Jacques [2 ]
机构
[1] Sorbonne Univ, MNHN, CNRS, Inst Systemat Evolut Biodiversite ISYEBI,EPHE, F-75005 Paris, France
[2] Sorbonne Univ, MNHN, CNRS,IRD, IMPMC, F-75005 Paris, France
基金
美国国家科学基金会;
关键词
CLUSTAL-W; ALGORITHM; DATABASE; CLASSIFICATION; DIALIGN; TOOL; PERFORMANCE; BENCHMARKS; MODEL; MAFFT;
D O I
10.1093/bioinformatics/btz236
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Multiple sequence alignment programs have proved to be very useful and have already been evaluated in the literature yet not alignment programs based on structure or both sequence and structure. In the present article we wish to evaluate the added value provided through considering structures. Results:"We compared the multiple alignments resulting from 25 programs either based on sequence, structure or both, to reference alignments deposited in five databases (BALIBASE 2 and 3, HOMSTRAD, OXBENCH and SISYPHUS). On the whole, the structure-based methods compute more reliable alignments than the sequence-based ones, and even than the sequence+structure-based programs whatever the databases. Two programs lead, MAMMOTH and MATRAS, nevertheless the performances of MUSTANG, MATT, 3DCOMB, TCOFFEE+TM_ALIGN and TCOFFEE+SAP are better for some alignments. The advantage of structure-based methods increases at low levels of sequence identity, or for residues in regular secondary structures or buried ones. Concerning gap management, sequence-based programs set less gaps than structure-based programs. Concerning the databases, the alignments of the manually built databases are more challenging for the programs.
引用
收藏
页码:3970 / 3980
页数:11
相关论文
共 50 条
  • [1] Comparative mapping of sequence-based and structure-based protein domains
    Ya Zhang
    John-Marc Chandonia
    Chris Ding
    Stephen R Holbrook
    [J]. BMC Bioinformatics, 6
  • [2] Comparative mapping of sequence-based and structure-based protein domains
    Zhang, Y
    Chandonia, JM
    Ding, C
    Holbrook, SR
    [J]. BMC BIOINFORMATICS, 2005, 6 (1)
  • [3] HOMSTRAD: adding sequence information to structure-based alignments of homologous protein families
    de Bakker, PIW
    Bateman, A
    Burke, DF
    Miguel, RN
    Mizuguchi, K
    Shi, J
    Shirai, H
    Blundell, TL
    [J]. BIOINFORMATICS, 2001, 17 (08) : 748 - 749
  • [4] Comparing protein sequence-based and predicted secondary structure-based methods for identification of remote homologs
    Geetha, V
    Di Francesco, V
    Garnier, J
    Munson, PJ
    [J]. PROTEIN ENGINEERING, 1999, 12 (07): : 527 - 534
  • [5] Comparison of sequence-based and structure-based phylogenetic trees of homologous proteins: Inferences on protein evolution
    S Balaji
    N Srinivasan
    [J]. Journal of Biosciences, 2007, 32 : 83 - 96
  • [6] Comparison of sequence-based and structure-based phylogenetic trees of homologous proteins: Inferences on protein evolution
    Balaji, S.
    Srinivasan, N.
    [J]. JOURNAL OF BIOSCIENCES, 2007, 32 (01) : 83 - 96
  • [7] Iterative refinement of structure-based sequence alignments by Seed Extension
    Changhoon Kim
    Chin-Hsien Tai
    Byungkook Lee
    [J]. BMC Bioinformatics, 10
  • [8] Iterative refinement of structure-based sequence alignments by Seed Extension
    Kim, Changhoon
    Tai, Chin-Hsien
    Lee, Byungkook
    [J]. BMC BIOINFORMATICS, 2009, 10
  • [9] S4: structure-based sequence alignments of SCOP superfamilies
    Casbon, J
    Saqi, MAS
    [J]. NUCLEIC ACIDS RESEARCH, 2005, 33 : D219 - D222
  • [10] Comparison of sequence-based and structure-based energy functions for the reversible folding of a peptide
    Cavalli, A
    Vendruscolo, M
    Paci, E
    [J]. BIOPHYSICAL JOURNAL, 2005, 88 (05) : 3158 - 3166