A comparison of scoring functions for protein sequence profile alignment

被引:81
|
作者
Edgar, RC [1 ]
Sjölander, K [1 ]
机构
[1] Univ Calif Berkeley, Dept Bioengn, Berkeley, CA 94720 USA
关键词
D O I
10.1093/bioinformatics/bth090
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation:In recent years, several methods have been proposed for aligning two protein sequence profiles, with reported improvements in alignment accuracy and homolog discrimination versus sequence-sequence methods (e.g. BLAST) and profile-sequence methods (e.g. PSI-BLAST). Profile-profile alignment is also the iterated step in progressive multiple sequence alignment algorithms such as CLUSTALW. However, little is known about the relative performance of different profile-profile scoring functions. In this work, we evaluate the alignment accuracy of 23 different profile-profile scoring functions by comparing alignments of 488 pairs of sequences with identity less than or equal to30% against structural alignments. We optimize parameters for all scoring functions on the same training set and use profiles of alignments from both PSI-BLAST and SAM-T99. Structural alignments are constructed from a consensus between the FSSP database and CE structural aligner. We compare the results with sequence-sequence and sequence-profile methods, including BLAST and PSI-BLAST. Results: We find that profile-profile alignment gives an average improvement over our test set of typically 2-3% over profile-sequence alignment and similar to40% over sequence-sequence alignment. No statistically significant difference is seen in the relative performance of most of the scoring functions tested. Significantly better results are obtained with profiles constructed from SAM-T99 alignments than from PSI-BLAST alignments.
引用
收藏
页码:1301 / 1308
页数:8
相关论文
共 50 条
  • [31] Dynamic Scoring-Based Sequence Alignment for Process Diagnostics
    Esgin, Eren
    Karagoz, Pinar
    CURRENT APPROACHES IN APPLIED ARTIFICIAL INTELLIGENCE, 2015, 9101 : 742 - 752
  • [32] Scoring Consensus of Multiple ECG Annotators by Optimal Sequence Alignment
    Haghpanahi, Masoumeh
    Sameni, Reza
    Borkholder, David A.
    2014 36TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2014, : 1855 - 1859
  • [33] Improved algorithms for DNA sequence alignment and revision of scoring matrix
    Bandyopadhyay, SS
    Paul, S
    Konar, A
    2005 INTERNATIONAL CONFERENCE ON INTELLIGENT SENSING AND INFORMATION PROCESSING, PROCEEDINGS, 2005, : 485 - 490
  • [34] Learning Scoring Schemes for Sequence Alignment from Partial Examples
    Kim, Eagu
    Kececioglu, John
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2008, 5 (04) : 546 - 556
  • [35] Multiple DNA and protein sequence alignment based on segment-to-segment comparison
    Morgenstern, B
    Dress, A
    Werner, T
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1996, 93 (22) : 12098 - 12103
  • [36] Large-scale comparison of protein sequence alignment algorithms with structure alignments
    Sauder, JM
    Arthur, JW
    Dunbrack, RL
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2000, 40 (01) : 6 - 22
  • [37] A new approach to sequence comparison:: normalired sequence alignment
    Arslan, AN
    Egecioglu, Ö
    Pevzner, PA
    BIOINFORMATICS, 2001, 17 (04) : 327 - 337
  • [38] Scoring functions in protein folding and design
    Dima, RI
    Banavar, JR
    Maritan, A
    PROTEIN SCIENCE, 2000, 9 (04) : 812 - 819
  • [39] Parallel hardware for sequence comparison and alignment
    Hughey, R
    COMPUTER APPLICATIONS IN THE BIOSCIENCES, 1996, 12 (06): : 473 - 479
  • [40] Derivative-free neural network for optimizing the scoring functions associated with dynamic programming of pairwise-profile alignment
    Yamada, Kazunori D.
    ALGORITHMS FOR MOLECULAR BIOLOGY, 2018, 13