The utility of artificially evolved sequences in protein threading and fold recognition

被引:7
|
作者
Brylinski, Michal [1 ,2 ]
机构
[1] Louisiana State Univ, Dept Biol Sci, Baton Rouge, LA 70803 USA
[2] Louisiana State Univ, Ctr Computat & Technol, Baton Rouge, LA 70803 USA
关键词
Artificial sequences; Evolved sequences; Protein threading; Protein structure modeling; Template-based modeling; HIDDEN MARKOV-MODELS; STRUCTURE PREDICTION; LIKELY COMPLETENESS; DATA FUSION; ALIGNMENTS; RESIDUES; CLASSIFICATION; FINDSITE; LIBRARY; SERVER;
D O I
10.1016/j.jtbi.2013.03.018
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Template-based protein structure prediction plays an important role in Functional Genomics by providing structural models of gene products, which can be utilized by structure-based approaches to function inference. From a systems level perspective, the high structural coverage of gene products in a given organism is critical. Despite continuous efforts towards the development of more sensitive threading approaches, confident structural models cannot be constructed for a considerable fraction of proteins due to difficulties in recognizing low-sequence identity templates with a similar fold to the target. Here we introduce a new modeling stratagem, which employs a library of synthetic sequences to improve template ranking in fold recognition by sequence profile-based methods. We developed a new method for the optimization of generic protein-like amino acid sequences to stabilize the respective structures using a combined empirical scoring function, which is compatible with these commonly used in protein threading and fold recognition. We show that the artificially evolved sequences, whose average sequence identity to the wild-type sequences is as low as 13.8%, have significant capabilities to recognize the correct structures. Importantly, the quality of the corresponding threading alignments is comparable to these constructed using conventional wild-type approaches (the average TM-score is 0.48 and 0.54, respectively). Fold recognition that uses data fusion to combine ranks calculated for both wild-type and synthetic template libraries systematically improves the detection of structural analogs. Depending on the threading algorithm used, it yields on average 4-16% higher recognition rates than using the wildtype template library alone. Synthetic sequences artificially evolved for the template structures provide an orthogonal source of signal that could be exploited to detect these templates unrecognized by standard modeling techniques. It opens up new directions in the development of more sensitive threading methods with the enhanced capabilities of targeting difficult, midnight zone templates. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:77 / 88
页数:12
相关论文
共 50 条
  • [31] Protein fold recognition by total alignment probability
    Bienkowska, JR
    Yu, LH
    Zarakhovich, S
    Rogers, RG
    Smith, TF
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2000, 40 (03) : 451 - 462
  • [32] Classification Modeling and Recognition of Protein Fold Type
    Liu Yue
    Li Xiao-Qin
    Xu Hai-Song
    Qiao Hui
    ACTA PHYSICO-CHIMICA SINICA, 2009, 25 (12) : 2558 - 2564
  • [33] Improving protein fold recognition by random forest
    Taeho Jo
    Jianlin Cheng
    BMC Bioinformatics, 15
  • [34] Classification and recognition of Rossmann-fold protein
    Liu, Yue
    Li, Xiaoqin
    Xu, Haisong
    Qiao, Hui
    FBIE: 2008 INTERNATIONAL SEMINAR ON FUTURE BIOMEDICAL INFORMATION ENGINEERING, PROCEEDINGS, 2008, : 78 - 81
  • [35] A NEW CLASSIFIER FOR PROTEIN FOLD CLASS RECOGNITION
    Yanev, Nicola
    Traykov, Metodi
    Milanov, Peter
    Yurukov, Borislav
    COMPTES RENDUS DE L ACADEMIE BULGARE DES SCIENCES, 2018, 71 (07): : 885 - 892
  • [36] A novel ensemble of classifiers for protein fold recognition
    Nanni, Loris
    NEUROCOMPUTING, 2006, 69 (16-18) : 2434 - 2437
  • [37] Ensemble classifier for protein fold pattern recognition
    Shen, Hong-Bin
    Chou, Kuo-Chen
    BIOINFORMATICS, 2006, 22 (14) : 1717 - 1722
  • [38] A parallel classification framework for protein fold recognition
    Elham Hekmatnia
    Hedieh Sajedi
    Ali Habib Agahi
    Evolutionary Intelligence, 2020, 13 : 525 - 535
  • [39] A parallel classification framework for protein fold recognition
    Hekmatnia, Elham
    Sajedi, Hedieh
    Habib Agahi, Ali
    EVOLUTIONARY INTELLIGENCE, 2020, 13 (03) : 525 - 535
  • [40] Recognition of a protein fold in the context of the SCOP classification
    Dubchak, I
    Muchnik, I
    Mayor, C
    Dralyuk, I
    Kim, SH
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 1999, 35 (04) : 401 - 407