SSALN: An alignment algorithm using structure-dependent substitution matrices and gap penalties learned from structurally aligned protein pairs

被引:69
|
作者
Qiu, J [1 ]
Elber, R [1 ]
机构
[1] Cornell Univ, Dept Comp Sci, Ithaca, NY 14853 USA
关键词
alignment accuracy; secondary structure; relative solvent accessibility; position-specific gap penalties; structurally aligned protein pairs;
D O I
10.1002/prot.20854
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
In template-based modeling of protein structures, the generation of the alignment between the target and the template is a critical step that significantly affects the accuracy of the final model. This paper proposes an alignment algorithm SSALN that learns substitution matrices and position-specific gap penalties from a database of structurally aligned protein pairs. In addition to the amino acid sequence information, secondary structure and solvent accessibility information of a position are used to derive substitution scores and position-specific gap penalties. In a test set of CASP5 targets, SSALN outperforms sequence alignment methods such as a Smith-Waterman algorithm with BLOSUM50 and PSI BLAST. SSALN also generates better alignments than PSI BLAST in the CASP6 test set. LOOPP server prediction based on an SSALN alignment is ranked the best for target T0280_1 in CASP6. SSALN is also compared with several threading methods and sequence alignment methods on the ProSup benchmark. SSALN has the highest alignment accuracy among the methods compared. On the Fischer's benchmark, SSALN performs better than CLUSTALW and GenTHREADER, and generates more alignments with accuracy > 50%, > 60% or > 70% than FUGUE, but fewer alignments with accuracy > 80% than FUGUE. All the supplemental materials can be found at http://www.cs.cornell. edu/-similar to jianq/research.htm.
引用
收藏
页码:881 / 891
页数:11
相关论文
共 4 条