Prediction of vitamin interacting residues in a vitamin binding protein using evolutionary information

被引:20
|
作者
Panwar, Bharat [1 ]
Gupta, Sudheer [1 ]
Raghava, Gajendra P. S. [1 ]
机构
[1] CSIR, Inst Microbial Technol, Bioinformat Ctr, Sect 39A, Chandigarh, India
来源
BMC BIOINFORMATICS | 2013年 / 14卷
关键词
Vitamin-interacting residue; Pyridoxal-5-phosphate; SVM; PSSM; VitaPred; PARASITE PLASMODIUM-FALCIPARUM; MACHINE-BASED METHOD; AMINO-ACID-RESIDUES; SUBCELLULAR-LOCALIZATION; DOPA DECARBOXYLASE; CRYSTAL-STRUCTURE; PRIMARY SEQUENCE; SITES; DNA; IDENTIFICATION;
D O I
10.1186/1471-2105-14-44
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The vitamins are important cofactors in various enzymatic-reactions. In past, many inhibitors have been designed against vitamin binding pockets in order to inhibit vitamin-protein interactions. Thus, it is important to identify vitamin interacting residues in a protein. It is possible to detect vitamin-binding pockets on a protein, if its tertiary structure is known. Unfortunately tertiary structures of limited proteins are available. Therefore, it is important to develop in-silico models for predicting vitamin interacting residues in protein from its primary structure. Results: In this study, first we compared protein-interacting residues of vitamins with other ligands using Two Sample Logo (TSL). It was observed that ATP, GTP, NAD, FAD and mannose preferred {G, R, K, S, H}, {G, K, T, S, D, N}, {T, G, Y}, {G, Y, W} and {Y, D, W, N, E} residues respectively, whereas vitamins preferred {Y, F, S, W, T, G, H} residues for the interaction with proteins. Furthermore, compositional information of preferred and non-preferred residues along with patterns-specificity was also observed within different vitamin-classes. Vitamins A, B and B6 preferred {F, I, W, Y, L, V}, {S, Y, G, T, H, W, N, E} and {S, T, G, H, Y, N} interacting residues respectively. It suggested that protein-binding patterns of vitamins are different from other ligands, and motivated us to develop separate predictor for vitamins and their sub-classes. The four different prediction modules, (i) vitamin interacting residues (VIRs), (ii) vitamin-A interacting residues (VAIRs), (iii) vitamin-B interacting residues (VBIRs) and (iv) pyridoxal-5-phosphate (vitamin B6) interacting residues (PLPIRs) have been developed. We applied various classifiers of SVM, BayesNet, NaiveBayes, ComplementNaiveBayes, NaiveBayesMultinomial, RandomForest and IBk etc., as machine learning techniques, using binary and Position-Specific Scoring Matrix (PSSM) features of protein sequences. Finally, we selected best performing SVM modules and obtained highest MCC of 0.53, 0.48, 0.61, 0.81 for VIRs, VAIRs, VBIRs, PLPIRs respectively, using PSSM-based evolutionary information. All the modules developed in this study have been trained and tested on non-redundant datasets and evaluated using five-fold cross-validation technique. The performances were also evaluated on the balanced and different independent datasets. Conclusions: This study demonstrates that it is possible to predict VIRs, VAIRs, VBIRs and PLPIRs from evolutionary information of protein sequence. In order to provide service to the scientific community, we have developed web-server and standalone software VitaPred (http://crdd.osdd.net/raghava/vitapred/).
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Prediction of vitamin interacting residues in a vitamin binding protein using evolutionary information
    Bharat Panwar
    Sudheer Gupta
    Gajendra P S Raghava
    BMC Bioinformatics, 14
  • [2] MOWGLI: prediction of protein-MannOse interacting residues With ensemble classifiers usinG evoLutionary Information
    Pai, Priyadarshini P.
    Mondal, Sukanta
    JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2016, 34 (10): : 2069 - 2083
  • [3] Prediction of FAD interacting residues in a protein from its primary sequence using evolutionary information
    Nitish K Mishra
    Gajendra PS Raghava
    BMC Bioinformatics, 11
  • [4] Prediction of FAD interacting residues in a protein from its primary sequence using evolutionary information
    Mishra, Nitish K.
    Raghava, Gajendra P. S.
    BMC BIOINFORMATICS, 2010, 11
  • [5] Random Forests for Prediction of DNA-Binding Residues in Protein Sequences Using Evolutionary Information
    Wang, Liangjiang
    FGCN: PROCEEDINGS OF THE 2008 SECOND INTERNATIONAL CONFERENCE ON FUTURE GENERATION COMMUNICATION AND NETWORKING, VOLS 1 AND 2, 2008, : 976 - 981
  • [6] Prediction of GTP interacting residues, dipeptides and tripeptides in a protein from its evolutionary information
    Chauhan, Jagat S.
    Mishra, Nitish K.
    Raghava, Gajendra P. S.
    BMC BIOINFORMATICS, 2010, 11
  • [7] Prediction of GTP interacting residues, dipeptides and tripeptides in a protein from its evolutionary information
    Jagat S Chauhan
    Nitish K Mishra
    Gajendra PS Raghava
    BMC Bioinformatics, 11
  • [8] Prediction of RNA-interacting residues in a protein using CNN and evolutionary profile
    Patiyal, Sumeet
    Dhall, Anjali
    Bajaj, Khushboo
    Sahu, Harshita
    Raghava, Gajendra P. S.
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (01)
  • [9] SVM based prediction of RNA-binding proteins using binding residues and evolutionary information
    Kumar, Manish
    Gromiha, M. Michael
    Raghava, Gajendra P. S.
    JOURNAL OF MOLECULAR RECOGNITION, 2011, 24 (02) : 303 - 313
  • [10] Multidomain protein structure prediction using information about residues interacting on multimeric protein interfaces
    Matsuno, Shumpei
    Ohue, Masahito
    Akiyama, Yutaka
    BIOPHYSICS AND PHYSICOBIOLOGY, 2020, 17 : 2 - 13