Markovian structures in biological sequence alignments

被引:50
|
作者
Liu, JS [1 ]
Neuwald, AF
Lawrence, CE
机构
[1] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
[2] Cold Spring Harbor Lab, Cold Spring Harbor, NY 11724 USA
[3] New York State Dept Hlth, Wadsworth Ctr Labs & Res, Biometr Lab, Albany, NY 12201 USA
关键词
DNA sequence; evolution; Gibbs sampler; GTPase; hidden Markov model; MAP criterion; model selection; protein sequence; sequence comparisons;
D O I
10.2307/2669673
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The alignment of multiple homologous biopolymer sequences is crucial in research on protein modeling and engineering, molecular evolution, and prediction in terms of both gene function and gene product structure. In this article we provide a coherent view of the two recent models used for multiple sequence alignment-the hidden Markov model (HMM) and the block-based motif model-to develop a set of new algorithms that have both the sensitivity of the block-based model and the flexibility of the HMM. In particular, we decompose the standard HMM into two components: the insertion component, which is captured by the so-called "propagation model," and the deletion component, which is described by a deletion vector. Such a decomposition serves as a basis for rational compromise between biological specificity and model flexibility. Furthermore, we introduce a Bayesian model selection criterion that-in combination with the propagation model, genetic algorithm, and other computational aspects-forms the core of PROBE, a multiple alignment and database search methodology. The application of our method to a GTPase family of protein sequences yields an alignment that is confirmed by comparison with known tertiary structures.
引用
下载
收藏
页码:1 / 15
页数:15
相关论文
共 50 条
  • [1] A comparative analysis of multiple sequence alignments for biological data
    Manzoor, Umar
    Shahid, Sarosh
    Zafar, Bassam
    BIO-MEDICAL MATERIALS AND ENGINEERING, 2015, 26 : S1781 - S1789
  • [2] Parallel biological sequence alignments on the Cell Broadband Engine
    Sarje, Abhinav
    Aluru, Srinivas
    2008 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-8, 2008, : 2047 - 2057
  • [3] Multiple Sequence Alignments Enhance Boundary Definition of RNA Structures
    Sabarinathan, Radhakrishnan
    Anthon, Christian
    Gorodkin, Jan
    Seemann, Stefan E.
    GENES, 2018, 9 (12):
  • [4] Multiple sequence alignments
    Wallace, IM
    Blackshields, G
    Higgins, DG
    CURRENT OPINION IN STRUCTURAL BIOLOGY, 2005, 15 (03) : 261 - 266
  • [5] OPTIMAL SEQUENCE ALIGNMENTS
    FITCH, WM
    SMITH, TF
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA-BIOLOGICAL SCIENCES, 1983, 80 (05): : 1382 - 1386
  • [6] Biological Sequence Alignments: A Review of Hardware Accelerators and a New PE Computing Strategy
    Isa, M. N.
    Ahmad, M. I.
    Murad, S. A. Z.
    Ismail, R. C.
    Benkrid, K.
    2014 IEEE REGION 10 SYMPOSIUM, 2014, : 39 - 44
  • [7] SEQUENCE DATABASE SEARCHES AND SEQUENCE ALIGNMENTS
    WATERMAN, M
    FASEB JOURNAL, 1994, 8 (07): : A1260 - A1260
  • [8] A Hardware Accelerator for the Fast Retrieval of DIALIGN Biological Sequence Alignments in Linear Space
    Boukerche, Azzedine
    Correa, Jan M.
    de Melo, Alba Cristina M. A.
    Jacobi, Ricardo P.
    IEEE TRANSACTIONS ON COMPUTERS, 2010, 59 (06) : 808 - 821
  • [9] Building multiple sequence alignments with a flavor of HSSP alignments
    Higa, Roberto Hiroshi
    Braga da Cruz, Sergio Aparecido
    Kuser, Paula Regina
    Beleza Yamagishi, Michel Eduardo
    Fileto, Renato
    de Medeiros Oliveira, Stanley Robson
    Mazoni, Ivan
    dos Santos, Edgard Henrique
    Mancini, Adauto Luiz
    Neshich, Goran
    GENETICS AND MOLECULAR RESEARCH, 2006, 5 (01): : 127 - 137
  • [10] Alignments of RNA Structures
    Blin, Guillaume
    Denise, Alain
    Dulucq, Serge
    Herrbach, Claire
    Touzet, Helene
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2010, 7 (02) : 309 - 322