Prediction of GTP interacting residues, dipeptides and tripeptides in a protein from its evolutionary information

被引:44
|
作者
Chauhan, Jagat S. [1 ]
Mishra, Nitish K. [1 ]
Raghava, Gajendra P. S. [1 ]
机构
[1] Inst Microbial Technol IMTECH, Bioinformat Ctr, Chandigarh 160036, India
来源
BMC BIOINFORMATICS | 2010年 / 11卷
关键词
SUPPORT VECTOR MACHINES; AMINO-ACID-COMPOSITION; DNA-BINDING PROTEINS; ATP-BINDING; GUANINE; SITES; DISCRIMINATION; IDENTIFICATION; RECEPTOR; ADENINE;
D O I
10.1186/1471-2105-11-301
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Guanosine triphosphate (GTP)-binding proteins play an important role in regulation of G-protein. Thus prediction of GTP interacting residues in a protein is one of the major challenges in the field of the computational biology. In this study, an attempt has been made to develop a computational method for predicting GTP interacting residues in a protein with high accuracy (Acc), precision (Prec) and recall (Rc). Result: All the models developed in this study have been trained and tested on a non-redundant (40% similarity) dataset using five-fold cross-validation. Firstly, we have developed neural network based models using single sequence and PSSM profile and achieved maximum Matthews Correlation Coefficient (MCC) 0.24 (Acc 61.30%) and 0.39 (Acc 68.88%) respectively. Secondly, we have developed a support vector machine (SVM) based models using single sequence and PSSM profile and achieved maximum MCC 0.37 (Prec 0.73, Rc 0.57, Acc 67.98%) and 0.55 (Prec 0.80, Rc 0.73, Acc 77.17%) respectively. In this work, we have introduced a new concept of predicting GTP interacting dipeptide (two consecutive GTP interacting residues) and tripeptide (three consecutive GTP interacting residues) for the first time. We have developed SVM based model for predicting GTP interacting dipeptides using PSSM profile and achieved MCC 0.64 with precision 0.87, recall 0.74 and accuracy 81.37%. Similarly, SVM based model have been developed for predicting GTP interacting tripeptides using PSSM profile and achieved MCC 0.70 with precision 0.93, recall 0.73 and accuracy 83.98%. Conclusion: These results show that PSSM based method performs better than single sequence based method. The prediction models based on dipeptides or tripeptides are more accurate than the traditional model based on single residue. A web server "GTPBinder" http://www.imtech.res.in/raghava/gtpbinder/based on above models has been developed for predicting GTP interacting residues in a protein.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Improving the accuracy of transmembrane protein topology prediction using evolutionary information
    Jones, David T.
    BIOINFORMATICS, 2007, 23 (05) : 538 - 544
  • [42] Enhancing the prediction of protein pairings between interacting families using orthology information
    Jose MG Izarzugaza
    David Juan
    Carles Pons
    Florencio Pazos
    Alfonso Valencia
    BMC Bioinformatics, 9
  • [43] SVM based prediction of RNA-binding proteins using binding residues and evolutionary information
    Kumar, Manish
    Gromiha, M. Michael
    Raghava, Gajendra P. S.
    JOURNAL OF MOLECULAR RECOGNITION, 2011, 24 (02) : 303 - 313
  • [44] Enhancing the prediction of protein pairings between interacting families using orthology information
    Izarzugaza, Jose M. G.
    Juan, David
    Pons, Carles
    Pazos, Florencio
    Valencia, Alfonso
    BMC BIOINFORMATICS, 2008, 9 (1)
  • [45] Improving prediction of protein subcellular localization using evolutionary information and sequence-order information
    Wang, Minghui
    Li, Ao
    Xie, Dan
    Fan, Zhewen
    Jiang, Zhaohui
    Feng, Huanqing
    2005 27TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2005, : 4434 - 4436
  • [46] Predicting Self-Interacting Proteins Using a Recurrent Neural Network and Protein Evolutionary Information
    An, Ji-Yong
    Zhou, Yong
    Yan, Zi-Ji
    Zhao, Yu-Jun
    EVOLUTIONARY BIOINFORMATICS, 2020, 16
  • [47] Predicting RNA-binding residues from evolutionary information and sequence conservation
    Yu-Feng Huang
    Li-Yuan Chiu
    Chun-Chin Huang
    Chien-Kang Huang
    BMC Genomics, 11
  • [48] Predicting RNA-binding residues from evolutionary information and sequence conservation
    Huang, Yu-Feng
    Chiu, Li-Yuan
    Huang, Chun-Chin
    Huang, Chien-Kang
    BMC GENOMICS, 2010, 11
  • [49] CHARACTERIZATION OF A PEPTIDASE FROM LACTOCOCCUS-LACTIS SSP CREMORIS HP THAT HYDROLYZES DIPEPTIDES AND TRIPEPTIDES CONTAINING PROLINE OR HYDROPHOBIC RESIDUES AS THE AMINOTERMINAL AMINO-ACID
    BAANKREIS, R
    EXTERKATE, FA
    SYSTEMATIC AND APPLIED MICROBIOLOGY, 1991, 14 (04) : 317 - 323
  • [50] Combining Evolutionary Information and an Iterative Sampling Strategy for Accurate Protein Structure Prediction
    Braun, Tatjana
    Leman, Julia Koehler
    Lange, Oliver F.
    PLOS COMPUTATIONAL BIOLOGY, 2015, 11 (12)