A unified statistical model of protein multiple sequence alignment integrating direct coupling and insertions

被引:5
|
作者
Kinjo, Akira R. [1 ]
机构
[1] Osaka Univ, Inst Prot Res, 3-2 Yamadaoka, Suita, Osaka 5650871, Japan
关键词
long-range interactions; short-range interactions; molecular evolution; protein structure; sequence conservation;
D O I
10.2142/biophysico.13.0_45
中图分类号
Q6 [生物物理学];
学科分类号
071011 ;
摘要
The multiple sequence alignment (MSA) of a protein family provides a wealth of information in terms of the conservation pattern of amino acid residues not only at each alignment site but also between distant sites. In order to statistically model the MSA incorporating both short-range and long-range correlations as well as insertions, I have derived a lattice gas model of the MSA based on the principle of maximum entropy. The partition function, obtained by the transfer matrix method with a mean-field approximation, accounts for all possible alignments with all possible sequences. The model parameters for short-range and long-range interactions were determined by a self-consistent condition and by a Gaussian approximation, respectively. Using this model with and without long-range interactions, I analyzed the globin and V-set domains by increasing the "temperature" and by "mutating" a site. The correlations between residue conservation and various measures of the system's stability indicate that the long-range interactions make the conservation pattern more specific to the structure, and increasingly stabilize better conserved residues.
引用
收藏
页码:45 / 62
页数:18
相关论文
共 50 条
  • [41] A parallel hybrid genetic algorithm for multiple protein sequence alignment
    Nguyen, HD
    Yoshihara, I
    Yamamori, K
    Yasunaga, M
    [J]. CEC'02: PROCEEDINGS OF THE 2002 CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1 AND 2, 2002, : 309 - 314
  • [42] Protein Multiple Sequence Alignment Based on Secondary Structure Similarity
    Hamidi, Sarvenaz
    Naghibzadeh, Mahmoud
    Sadri, Javad
    [J]. 2013 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2013, : 1224 - 1229
  • [43] OXBench: A benchmark for evaluation of protein multiple sequence alignment accuracy
    GPS Raghava
    Stephen MJ Searle
    Patrick C Audley
    Jonathan D Barber
    Geoffrey J Barton
    [J]. BMC Bioinformatics, 4
  • [44] PROTEIN MULTIPLE SEQUENCE ALIGNMENT AND FLEXIBLE PATTERN-MATCHING
    BARTON, GJ
    [J]. METHODS IN ENZYMOLOGY, 1990, 183 : 403 - 428
  • [45] Influence of Parameters in Multiple Sequence Alignment Methods for Protein Sequences
    Manikandan, P.
    Ramyachitra, D.
    [J]. PROGRESS IN COMPUTING, ANALYTICS AND NETWORKING, ICCAN 2017, 2018, 710 : 183 - 191
  • [46] Intuitionistic fuzzy approach improve protein multiple sequence alignment
    Hajieghrari, Behzad
    Farrokhi, Naser
    Kamalizadeh, Mojahed
    [J]. NETWORK MODELING AND ANALYSIS IN HEALTH INFORMATICS AND BIOINFORMATICS, 2021, 10 (01):
  • [47] Splice-Aware Multiple Sequence Alignment of Protein Isoforms
    Nord, Alex
    Hornbeck, Peter
    Carey, Kaitlin
    Wheeler, Travis
    [J]. ACM-BCB'18: PROCEEDINGS OF THE 2018 ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, 2018, : 200 - 210
  • [48] MULTIPLE DNA AND PROTEIN-SEQUENCE ALIGNMENT ON A WORKSTATION AND A SUPERCOMPUTER
    TAJIMA, K
    [J]. COMPUTER APPLICATIONS IN THE BIOSCIENCES, 1988, 4 (04): : 467 - 471
  • [49] Evidence of Statistical Inconsistency of Phylogenetic Methods in the Presence of Multiple Sequence Alignment Uncertainty
    Hossain, A. S. Md Mukarram
    Blackburne, Benjamin P.
    Shah, Abhijeet
    Whelan, Simon
    [J]. GENOME BIOLOGY AND EVOLUTION, 2015, 7 (08): : 2102 - 2116
  • [50] A probabilistic model of local sequence alignment that simplifies statistical significance estimation
    Eddy, Sean R.
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2008, 4 (05)