BCL::Align-Sequence alignment and fold recognition with a custom scoring function online

被引:13
|
作者
Dong, Elizabeth
Smith, Jarrod
Heinze, Sten
Alexander, Nathan
Meiler, Jens
机构
[1] Vanderbilt Univ, Dept Chem, Nashville, TN USA
[2] Vanderbilt Univ, Struct Biol Ctr, Nashville, TN USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
dynamic programming; fold recognition; multiple sequence alignment; Needleman-Wunsch algorithm; parametric sequence alignment;
D O I
10.1016/j.gene.2008.06.006
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
BCL::Align is a multiple sequence alignment tool that utilizes the dynamic programming method in combination with a customizable scoring function for sequence alignment and fold recognition. The scoring function is a weighted sum of the traditional PAM and BLOSUM scoring matrices, position-specific scoring matrices output by PSI-BIAST, secondary structure predicted by a variety of methods, chemical properties, and gap penalties. By adjusting the weights, the method can be tailored for fold recognition or sequence alignment tasks at different levels of sequence identity. A Monte Carlo algorithm was used to determine optimized weight sets for sequence alignment and fold recognition that most accurately reproduced the SABmark reference alignment test set. In an evaluation of sequence alignment performance, BCL::Align ranked best in alignment accuracy (Cline score of 22.90 for sequences in the Twilight Zone) when compared with Align-m, ClustalW, T-Coffee, and MUSCLE. ROC curve analysis indicates BCL::Align's ability to correctly recognize protein folds with over 80% accuracy. The flexibility of the program allows it to be optimized for specific classes of proteins (e.g. membrane proteins) or fold families (e.g. TIM-barrel proteins). BCL::Align is free for academic use and available online at http://www.meilerlab.org/. (C) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:41 / 46
页数:6
相关论文
共 6 条
  • [1] Sequence alignment and fold recognition of fucosyltransferases
    Breton, C
    Oriol, R
    Imberty, A
    GLYCOBIOLOGY, 1996, 6 (07) : R7 - R12
  • [2] Fast protein fold recognition and accurate sequence-structure alignment
    Zimmer, R
    Thiele, R
    BIOINFORMATICS, 1997, 1278 : 137 - 146
  • [3] Structure-based evaluation of sequence comparison and fold recognition alignment accuracy
    Domingues, FS
    Lackner, P
    Andreeva, A
    Sippl, MJ
    JOURNAL OF MOLECULAR BIOLOGY, 2000, 297 (04) : 1003 - 1013
  • [4] Fold recognition and accurate sequence-structure alignment of sequences directing β-sheet proteins
    McDonnell, Andrew V.
    Menke, Matthew
    Palmer, Nathan
    King, Jonathan
    Cowen, Lenore
    Berger, Bonnie
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2006, 63 (04) : 976 - 985
  • [5] Fold recognition by combining sequence profiles derived from evolution and from depth-dependent structural alignment of fragments
    Zhou, HY
    Zhou, YQ
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 58 (02) : 321 - 328
  • [6] DeepMSA: constructing deep multiple sequence alignment to improve contact prediction and fold-recognition for distant-homology proteins
    Zhang, Chengxin
    Zheng, Wei
    Mortuza, S. M.
    Li, Yang
    Zhang, Yang
    BIOINFORMATICS, 2020, 36 (07) : 2105 - 2112