Probabilistic description of protein alignments for sequences and structures

被引:4
|
作者
Koike, R
Kinoshita, K
Kidera, A
机构
[1] Yokohama City Univ, Grad Sch Integrated Sci, Yokohama, Kanagawa 2300045, Japan
[2] Kyoto Univ, Grad Sch Sci, Dept Chem, Sakyo Ku, Kyoto 6068502, Japan
[3] Japan Sci & Technol Agcy, PRESTO, Struct & Funct Biomol, Kawaguchi 3320012, Japan
关键词
protein sequence comparison; protein structure comparison; probabilistic alignment; periodic boundary; circular permutation;
D O I
10.1002/prot.20067
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A number of equally optimal alignments inherently exist in the sequence and structure comparisons among proteins. To represent the sub-optimal alignments systematically, we have developed a method of generating probabilistic alignments for sequences and structures, by which the correspondence between pairs of residues is evaluated in a probabilistic manner. Our method uses the periodic boundary condition to avoid the entropy artifact favoring full-length matches. In the structure comparison, the environmental effects are incorporated by the mean-field approximation. We applied this method in comparisons of two pairs of proteins with internal symmetry; the first set were proteins of TIM-barrel fold and the second were R-trefoil fold. These pairs are expected to have distinct sub-optimal alignments suitable for probabilistic description with the periodic boundary. It was shown that the sequence and structure alignments are consistent with each other and that the alignments with the highest probability represent circular permutation. (C) 2004 Wiley-Liss, Inc.
引用
收藏
页码:157 / 166
页数:10
相关论文
共 50 条
  • [31] Automatic classification of protein structures relying on similarities between alignments
    Santini, Guillaume
    Soldano, Henry
    Pothier, Joel
    BMC BIOINFORMATICS, 2012, 13
  • [32] MotEvo: integrated Bayesian probabilistic methods for inferring regulatory sites and motifs on multiple alignments of DNA sequences
    Arnold, Phil
    Erb, Ionas
    Pachkov, Mikhail
    Molina, Nacho
    van Nimwegen, Erik
    BIOINFORMATICS, 2012, 28 (04) : 487 - 494
  • [33] NcDNAlign:: Plausible multiple alignments of non-protein-coding genomic sequences
    Rose, Dominic
    Hertel, Jana
    Reiche, Kristin
    Stadler, Peter F.
    Hackermueller, Joerg
    GENOMICS, 2008, 92 (01) : 65 - 74
  • [34] An approach to improving multiple alignments of protein sequences using predicted secondary structure
    Jennings, AJ
    Edge, CM
    Sternberg, MJE
    PROTEIN ENGINEERING, 2001, 14 (04): : 227 - 231
  • [35] A Tool for Computing Probabilistic Trace Alignments
    Bergami, Giacomo
    Maggi, Fabrizio Maria
    Montali, Marco
    Penaloza, Rafael
    INTELLIGENT INFORMATION SYSTEMS, CAISE FORUM 2021, 2021, 424 : 118 - 126
  • [36] Probabilistic annotation of protein sequences based on functional classifications
    Levy, ED
    Ouzounis, CA
    Gilks, WR
    Audit, B
    BMC BIOINFORMATICS, 2005, 6 (1)
  • [37] Using the T-Coffee package to build multiple sequence alignments of protein, RNA, DNA sequences and 3D structures
    Jean-Francois Taly
    Cedrik Magis
    Giovanni Bussotti
    Jia-Ming Chang
    Paolo Di Tommaso
    Ionas Erb
    Jose Espinosa-Carrasco
    Carsten Kemena
    Cedric Notredame
    Nature Protocols, 2011, 6 : 1669 - 1682
  • [38] Alignments of RNA Structures
    Blin, Guillaume
    Denise, Alain
    Dulucq, Serge
    Herrbach, Claire
    Touzet, Helene
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2010, 7 (02) : 309 - 322
  • [39] Probabilistic annotation of protein sequences based on functional classifications
    Emmanuel D Levy
    Christos A Ouzounis
    Walter R Gilks
    Benjamin Audit
    BMC Bioinformatics, 6
  • [40] Using the T-Coffee package to build multiple sequence alignments of protein, RNA, DNA sequences and 3D structures
    Taly, Jean-Francois
    Magis, Cedrik
    Bussotti, Giovanni
    Chang, Jia-Ming
    Di Tommaso, Paolo
    Erb, Ionas
    Espinosa-Carrasco, Jose
    Kemena, Carsten
    Notredame, Cedric
    NATURE PROTOCOLS, 2011, 6 (11) : 1669 - 1682