Classification of conformational stability of protein mutants from 2D graph representation of protein sequences using support vector machines

被引:5
|
作者
Fernandez, M. [1 ]
Caballero, J.
Fernandez, L.
Abreu, J. I.
Acostas, G.
机构
[1] Univ Matanzas, Ctr Biotechnol Studies, Fac Agron, Mol Modelling Grp, Matanzas 44740, Cuba
[2] Univ Talca, Ctr Bioinformat & Simulac Mol, Talca, Chile
[3] Univ Matanzas, Fac Informat, Artificial Intelligence Lab, Matanzas 44740, Cuba
[4] Natl Bioinformat Ctr, Havana 10200, Cuba
关键词
protein stability prediction; point mutations; kernel-based methods; graph similarity;
D O I
10.1080/08927020701377070
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
Euclidean distance counts derived from the protein 2D graphs were used for encoding protein structural information. A total of 35 amino acid 2D distance count (AA2DC) descriptors were calculated from the Euclidean distance matrices (EDM) derived from the 2D graphs at distances ranging from 0.05 to 1.8 units with a lag of 0.05 units. AA2DC descriptors were tested for building predictive classification model of the signs of the change of thermal unfolding Gibbs free energy change (Delta Delta G) of a large data set of 2048 single point mutations on 64 proteins. A support vector machine (SVM) classifier with a Radial Basis Function kernel was implemented for classifying the conformational stability of protein mutants. Temperature and pH of the Delta Delta G experimental measurements were also conveniently used for SVM training in addition to calculated AA2DC descriptors. The optimum SVM model correctly predicted about 72% of Delta Delta G signs in crossvalidation test for all the dataset and also for stable and unstable mutant separately. To the best of our knowledge, this level of accuracy for stable mutant recognition is the highest ever reported for a predictor using sequence information. Furthermore, the classifier adequately recognized unstable mutants of human prion protein and human transthyretin associated to diseases.
引用
收藏
页码:889 / 896
页数:8
相关论文
共 50 条
  • [21] Classification of LIBS Protein Spectra Using Support Vector Machines and Adaptive Local Hyperplanes
    Vance, Tia
    Reljin, Natasa
    Lazarevic, Aleksandar
    Pokrajac, Dragoljub
    Kecman, Vojislav
    Melikechi, Noureddine
    Marcano, Aristides
    Markushin, Yuri
    McDaniel, Samantha
    [J]. 2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [22] PCVMZM: Using the Probabilistic Classification Vector Machines Model Combined with a Zernike Moments Descriptor to Predict Protein-Protein Interactions from Protein Sequences
    Wang, Yanbin
    You, Zhuhong
    Li, Xiao
    Chen, Xing
    Jiang, Tonghai
    Zhang, Jingting
    [J]. INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2017, 18 (05)
  • [23] Multi-class protein subcellular localization classification using support vector machines
    Meng, PW
    Rajapakse, JC
    [J]. PROCEEDINGS OF THE 2005 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2005, : 526 - 533
  • [24] Prediction of protein-protein interaction sites using support vector machines
    Minakuchi, Y
    Satou, K
    Konagaya, A
    [J]. METMBS'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON MATHEMATICS AND ENGINEERING TECHNIQUES IN MEDICINE AND BIOLOGICAL SCIENCES, 2003, : 22 - 28
  • [25] Prediction of protein-protein interaction sites using support vector machines
    Koike, A
    Takagi, T
    [J]. PROTEIN ENGINEERING DESIGN & SELECTION, 2004, 17 (02): : 165 - 173
  • [26] Feature extraction and wall motion classification of 2D stress echocardiography with support vector machines
    Chykeyuk, Kiryl
    Clifton, David A.
    Noble, J. Alison
    [J]. MEDICAL IMAGING 2011: COMPUTER-AIDED DIAGNOSIS, 2011, 7963
  • [27] Prediction of protein domains from sequence information using support vector machines
    Zou, Shuxue
    Huang, Yanxin
    Wang, Yan
    Zhou, Chunguang
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2006, PT 3, PROCEEDINGS, 2006, 3973 : 674 - 681
  • [28] g-MARS: Protein Classification Using Gapped Markov Chains and Support Vector Machines
    Ji, Xiaonan
    Bailey, James
    Ramamohanarao, Kotagiri
    [J]. PATTERN RECOGNITION IN BIOINFORMATICS, PROCEEDINGS, 2008, 5265 : 165 - 177
  • [29] Text Classification Using Combined Sparse Representation Classifiers and Support Vector Machines
    Sharma, Neeraj
    Sharma, Anshu
    Thenkanidiyoor, Veena
    Dileep, A. D.
    [J]. 2016 4TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL AND BUSINESS INTELLIGENCE (ISCBI), 2016, : 181 - 185
  • [30] Graph segmentation and support vector machines for bare earth classification from LiDAR
    Shorter, Nicholas S.
    Smith, O'Neil
    Smith, Philip
    Rahmes, Mark
    [J]. LASER RADAR TECHNOLOGY AND APPLICATIONS XIX; AND ATMOSPHERIC PROPAGATION XI, 2014, 9080