Prediction of GTP interacting residues, dipeptides and tripeptides in a protein from its evolutionary information

被引:44
|
作者
Chauhan, Jagat S. [1 ]
Mishra, Nitish K. [1 ]
Raghava, Gajendra P. S. [1 ]
机构
[1] Inst Microbial Technol IMTECH, Bioinformat Ctr, Chandigarh 160036, India
来源
BMC BIOINFORMATICS | 2010年 / 11卷
关键词
SUPPORT VECTOR MACHINES; AMINO-ACID-COMPOSITION; DNA-BINDING PROTEINS; ATP-BINDING; GUANINE; SITES; DISCRIMINATION; IDENTIFICATION; RECEPTOR; ADENINE;
D O I
10.1186/1471-2105-11-301
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Guanosine triphosphate (GTP)-binding proteins play an important role in regulation of G-protein. Thus prediction of GTP interacting residues in a protein is one of the major challenges in the field of the computational biology. In this study, an attempt has been made to develop a computational method for predicting GTP interacting residues in a protein with high accuracy (Acc), precision (Prec) and recall (Rc). Result: All the models developed in this study have been trained and tested on a non-redundant (40% similarity) dataset using five-fold cross-validation. Firstly, we have developed neural network based models using single sequence and PSSM profile and achieved maximum Matthews Correlation Coefficient (MCC) 0.24 (Acc 61.30%) and 0.39 (Acc 68.88%) respectively. Secondly, we have developed a support vector machine (SVM) based models using single sequence and PSSM profile and achieved maximum MCC 0.37 (Prec 0.73, Rc 0.57, Acc 67.98%) and 0.55 (Prec 0.80, Rc 0.73, Acc 77.17%) respectively. In this work, we have introduced a new concept of predicting GTP interacting dipeptide (two consecutive GTP interacting residues) and tripeptide (three consecutive GTP interacting residues) for the first time. We have developed SVM based model for predicting GTP interacting dipeptides using PSSM profile and achieved MCC 0.64 with precision 0.87, recall 0.74 and accuracy 81.37%. Similarly, SVM based model have been developed for predicting GTP interacting tripeptides using PSSM profile and achieved MCC 0.70 with precision 0.93, recall 0.73 and accuracy 83.98%. Conclusion: These results show that PSSM based method performs better than single sequence based method. The prediction models based on dipeptides or tripeptides are more accurate than the traditional model based on single residue. A web server "GTPBinder" http://www.imtech.res.in/raghava/gtpbinder/based on above models has been developed for predicting GTP interacting residues in a protein.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Robust and accurate prediction of protein-protein interactions by exploiting evolutionary information
    Li, Yang
    Wang, Zheng
    Li, Li-Ping
    You, Zhu-Hong
    Huang, Wen-Zhun
    Zhan, Xin-Ke
    Wang, Yan-Bin
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [32] Prediction of DNA-binding residues from protein sequence information using random forests
    Wang, Liangjiang
    Yang, Mary Qu
    Yang, Jack Y.
    BMC GENOMICS, 2009, 10
  • [33] Prediction of DNA-binding residues from protein sequence information using random forests
    Liangjiang Wang
    Mary Qu Yang
    Jack Y Yang
    BMC Genomics, 10
  • [34] Prediction of protein-protein interactions by label propagation with protein evolutionary and chemical information derived from heterogeneous network
    Wen, Yu-Ting
    Lei, Hai-Jun
    You, Zhu-Hong
    Lei, Bai-Ying
    Chen, Xing
    Li, Li-Ping
    JOURNAL OF THEORETICAL BIOLOGY, 2017, 430 : 9 - 20
  • [35] Ensemble Architecture for Prediction of Enzyme-ligand Binding Residues Using Evolutionary Information
    Pai, Priyadarshini P.
    Dattatreya, Rohit Kadam
    Mondal, Sukanta
    MOLECULAR INFORMATICS, 2017, 36 (11)
  • [36] Improving protein-protein interaction prediction using evolutionary information from low-quality MSAs
    Varnai, Csilla
    Burkoff, Nikolas S.
    Wild, David L.
    PLOS ONE, 2017, 12 (02):
  • [37] Protein folding computed from evolutionary information
    Sander, Chris
    ACTA CRYSTALLOGRAPHICA A-FOUNDATION AND ADVANCES, 2021, 77 : A256 - A256
  • [38] CPIELA: Computational Prediction of Plant Protein-Protein Interactions by Ensemble Learning Approach From Protein Sequences and Evolutionary Information
    Li, Li-Ping
    Zhang, Bo
    Cheng, Li
    FRONTIERS IN GENETICS, 2022, 13
  • [39] PAIRpred: Partner-specific prediction of interacting residues from sequence and structure
    Minhas, Fayyaz ul Amir Afsar
    Geiss, Brian J.
    Ben-Hur, Asa
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2014, 82 (07) : 1142 - 1155
  • [40] TERTIARY STRUCTURAL CONSTRAINTS ON PROTEIN EVOLUTIONARY DIVERSITY - TEMPLATES, KEY RESIDUES AND STRUCTURE PREDICTION
    OVERINGTON, J
    JOHNSON, MS
    SALI, A
    BLUNDELL, TL
    PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 1990, 241 (1301) : 132 - 145