Evaluation measures of multiple sequence alignments

被引:20
|
作者
Gonnet, GH [1 ]
Korostensky, C [1 ]
Benner, S [1 ]
机构
[1] ETH Zurich, Inst Comp Sci, CH-8092 Zurich, Switzerland
关键词
multiple sequence alignment; phylogenetic tree; scoring function; TSP; evolution;
D O I
10.1089/10665270050081513
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Multiple sequence alignments (MSAs) are frequently used in the study of families of protein sequences or DNA/RNA sequences. They are a fundamental tool for the understanding of the structure, functionality and, ultimately, the evolution of proteins. A new algorithm, the Circular Sum (CS) method, is presented for formally evaluating the quality of an MSA, It is based on the use of a solution to the Traveling Salesman Problem, which identifies a circular tour through an evolutionary tree connecting the sequences in a protein family. With this approach, the calculation of an evolutionary tree and the errors that it mould introduce can be avoided altogether, The algorithm gives an upper bound, the best score that can possibly be achieved by any MSA for a given set of protein sequences. Alternatively, if presented with a specific MSA, the algorithm provides a formal score for the MSA, which serves as an absolute measure of the quality of the MSA, The CS measure yields a direct connection between an MSA and the associated evolutionary tree, The measure can be used as a tool for evaluating different methods for producing MSAs, A brief example of the last application is provided, Because it weights all evolutionary events on a tree identically, but does not require the reconstruction of a tree, the CS algorithm has advantages over the frequently used sum-of-pairs measures for scoring MSAs, which weight some evolutionary events more strongly than others. Compared to other weighted sum-of-pairs measures, it has the advantage that no evolutionary tree must be constructed, because we can find a circular tour without knowing the tree.
引用
收藏
页码:261 / 276
页数:16
相关论文
共 50 条
  • [1] Multiple sequence alignments
    Wallace, IM
    Blackshields, G
    Higgins, DG
    CURRENT OPINION IN STRUCTURAL BIOLOGY, 2005, 15 (03) : 261 - 266
  • [2] Multithreaded multiple sequence alignments
    Bai, Joanne
    Rezael, Siamak
    2005 27TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2005, : 2863 - 2866
  • [3] Building multiple sequence alignments with a flavor of HSSP alignments
    Higa, Roberto Hiroshi
    Braga da Cruz, Sergio Aparecido
    Kuser, Paula Regina
    Beleza Yamagishi, Michel Eduardo
    Fileto, Renato
    de Medeiros Oliveira, Stanley Robson
    Mazoni, Ivan
    dos Santos, Edgard Henrique
    Mancini, Adauto Luiz
    Neshich, Goran
    GENETICS AND MOLECULAR RESEARCH, 2006, 5 (01): : 127 - 137
  • [4] Evaluation of Appropriateness of the Score Functions Used In Multiple Sequence Alignments Problem
    Ergezer, Halit
    Leblebicioglu, Kemal
    2008 IEEE 16TH SIGNAL PROCESSING, COMMUNICATION AND APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2008, : 376 - 378
  • [5] Using CLUSTAL for multiple sequence alignments
    Higgins, DG
    Thompson, JD
    Gibson, TJ
    COMPUTER METHODS FOR MACROMOLECULAR SEQUENCE ANALYSIS, 1996, 266 : 383 - 402
  • [6] Assessing the Discordance of Multiple Sequence Alignments
    Prakash, Amol
    Tompa, Martin
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2009, 6 (04) : 542 - 551
  • [7] Evolving better multiple sequence alignments
    Sheneman, L
    Foster, JA
    GENETIC AND EVOLUTIONARY COMPUTATION - GECCO 2004, PT 1, PROCEEDINGS, 2004, 3102 : 449 - 460
  • [8] Parallel computation for multiple sequence alignments
    Du, ZH
    Lin, F
    ICICS-PCM 2003, VOLS 1-3, PROCEEDINGS, 2003, : 300 - 303
  • [9] ADVANTAGES OF USING MULTIPLE SEQUENCE ALIGNMENTS OVER PAIRWISE ALIGNMENTS WHEN SEQUENCE SIMILARITY IS LOW
    BABBITT, PC
    DUNAWAYMARIANO, D
    KENYON, GL
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1992, 203 : 60 - BIOL
  • [10] ADVANTAGES OF USING MULTIPLE SEQUENCE ALIGNMENTS OVER PAIRWISE ALIGNMENTS WHEN SEQUENCE SIMILARITY IS LOW
    BABBITT, PC
    DUNAWAYMARIANO, D
    KENYON, GL
    BIOCHEMISTRY, 1992, 31 (07) : 2198 - 2198