Evolving better multiple sequence alignments

被引:0
|
作者
Sheneman, L [1 ]
Foster, JA [1 ]
机构
[1] Univ Idaho, Dept Comp Sci, IBEST, Moscow, ID 83844 USA
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Aligning multiple DNA or protein sequences is a fundamental step in the analyses of phylogeny, homology and molecular structure. Heuristic algorithms are applied because optimal multiple sequence alignment is prohibitively expensive. Heuristic alignment algorithms represent a practical trade-off between speed and accuracy, but they can be improved. We present EVALYN (EVolved ALYNments), a novel approach to multiple sequence alignment in which sequences are progressively aligned based on a guide tree optimized by a genetic algorithm. We hypothesize that a genetic algorithm can find better guide trees than traditional, deterministic clustering algorithms. We compare our novel evolutionary approach to CLUSTAL W and find that EVALYN performs consistently and significantly better as measured by a common alignment scoring technique. Additionally, we hypothesize that evolutionary guide tree optimization is inherently efficient and has less time complexity than the commonly-used neighbor-joining algorithm. We present a compelling analysis in support of this scalability hypothesis.
引用
收藏
页码:449 / 460
页数:12
相关论文
共 50 条
  • [1] Multiple sequence alignments
    Wallace, IM
    Blackshields, G
    Higgins, DG
    CURRENT OPINION IN STRUCTURAL BIOLOGY, 2005, 15 (03) : 261 - 266
  • [2] Multithreaded multiple sequence alignments
    Bai, Joanne
    Rezael, Siamak
    2005 27TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2005, : 2863 - 2866
  • [3] Building multiple sequence alignments with a flavor of HSSP alignments
    Higa, Roberto Hiroshi
    Braga da Cruz, Sergio Aparecido
    Kuser, Paula Regina
    Beleza Yamagishi, Michel Eduardo
    Fileto, Renato
    de Medeiros Oliveira, Stanley Robson
    Mazoni, Ivan
    dos Santos, Edgard Henrique
    Mancini, Adauto Luiz
    Neshich, Goran
    GENETICS AND MOLECULAR RESEARCH, 2006, 5 (01): : 127 - 137
  • [4] Using CLUSTAL for multiple sequence alignments
    Higgins, DG
    Thompson, JD
    Gibson, TJ
    COMPUTER METHODS FOR MACROMOLECULAR SEQUENCE ANALYSIS, 1996, 266 : 383 - 402
  • [5] Assessing the Discordance of Multiple Sequence Alignments
    Prakash, Amol
    Tompa, Martin
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2009, 6 (04) : 542 - 551
  • [6] Evaluation measures of multiple sequence alignments
    Gonnet, GH
    Korostensky, C
    Benner, S
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2000, 7 (1-2) : 261 - 276
  • [7] Parallel computation for multiple sequence alignments
    Du, ZH
    Lin, F
    ICICS-PCM 2003, VOLS 1-3, PROCEEDINGS, 2003, : 300 - 303
  • [8] ADVANTAGES OF USING MULTIPLE SEQUENCE ALIGNMENTS OVER PAIRWISE ALIGNMENTS WHEN SEQUENCE SIMILARITY IS LOW
    BABBITT, PC
    DUNAWAYMARIANO, D
    KENYON, GL
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1992, 203 : 60 - BIOL
  • [9] ADVANTAGES OF USING MULTIPLE SEQUENCE ALIGNMENTS OVER PAIRWISE ALIGNMENTS WHEN SEQUENCE SIMILARITY IS LOW
    BABBITT, PC
    DUNAWAYMARIANO, D
    KENYON, GL
    BIOCHEMISTRY, 1992, 31 (07) : 2198 - 2198
  • [10] Progressive multiple sequence alignments from triplets
    Matthias Kruspe
    Peter F Stadler
    BMC Bioinformatics, 8