SMpred: A Support Vector Machine Approach to Identify Structural Motifs in Protein Structure Without Using Evolutionary Information

被引:4
|
作者
Pugalenthi, Ganesan [3 ]
Kandaswamy, Krishna Kumar [4 ,5 ]
Suganthan, P. N. [1 ]
Sowdhamini, R. [2 ]
Martinetz, Thomas [4 ]
Kolatkar, Prasanna R. [3 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
[2] Univ Agr Sci Bangalore, Natl Ctr Biol Sci, Bangalore 560065, Karnataka, India
[3] Genome Inst Singapore, Lab Struct Biochem, Singapore 138672, Singapore
[4] Univ Lubeck, Inst Neuro & Bioinformat, D-23538 Lubeck, Germany
[5] Univ Lubeck, Grad Sch Comp Med & Life Sci, D-23538 Lubeck, Germany
来源
关键词
Protein folding; Structural motifs; Support vector machine; Fingerprint; Protein function; FUNCTIONAL DOMAIN COMPOSITION; MYCOBACTERIUM-TUBERCULOSIS; SEQUENCE MOTIFS; CLEAVAGE SITES; RNA-POLYMERASE; PREDICTION; SUPERFAMILIES; DATABASE; CLASSIFICATION; IDENTIFICATION;
D O I
10.1080/07391102.2010.10507369
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Knowledge of three dimensional structure is essential to understand the function of a protein. Although the overall fold is made from the whole details of its sequence, a small group of residues, often called as structural motifs, play a crucial role in determining the protein fold and its stability. Identification of such structural motifs requires sufficient number of sequence and structural homologs to define conservation and evolutionary information. Unfortunately, there are many structures in the protein structure databases have no homologous structures or sequences. In this work, we report an SVM method, SMpred, to identify structural motifs from single protein structure without using sequence and structural homologs. SMpred method was trained and tested using 132 proteins domains containing 581 motifs. SMpred method achieved 78.79% accuracy with 79.06% sensitivity and 78.53% specificity. The performance of SMpred was evaluated with MegaMotifBase using 188 proteins containing 1161 motifs. Out of 1161 motifs, SMpred correctly identified 1503 structural motifs reported in MegaMotifBase. Further, we showed that SMpred is useful approach for the length deviant superfamilies and single member superfamilies. This result suggests the usefulness of our approach for facilitating the identification of structural motifs in protein structure in the absence of sequence and structural homologs. The dataset and executable for the SMpred algorithm is available at http://www3.ntu.edu.sg/home/EPNSugan/index_files/SMpred.htm.
引用
收藏
页码:405 / 414
页数:10
相关论文
共 50 条
  • [41] Stochastic gradient support vector machine with local structural information for pattern recognition
    Liu, Liming
    Li, Ping
    Chu, Maoxiang
    Cai, Hongbin
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (08) : 2237 - 2254
  • [42] Prediction of Protein Residue Contact Using Support Vector Machine
    Chan, Weng Howe
    Mohamad, Mohd Saberi
    [J]. KNOWLEDGE TECHNOLOGY, 2012, 295 : 323 - 332
  • [43] β-Barrel Transmembrane Protein Predicting Using Support Vector Machine
    Chen, Cheng
    Wu, Hongjie
    Bian, Kaihui
    [J]. INTELLIGENT COMPUTING METHODOLOGIES, ICIC 2017, PT III, 2017, 10363 : 360 - 368
  • [44] Classify a Protein Domain using Sigmoid Support Vector Machine
    Hassan, Umi Kalsum
    Nawi, Nazri Mohd.
    Kasim, Shahreen
    [J]. 2014 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND APPLICATIONS (ICISA), 2014,
  • [45] Selecting bankruptcy predictors using a Support Vector Machine approach
    Fan, A
    Palaniswami, M
    [J]. IJCNN 2000: PROCEEDINGS OF THE IEEE-INNS-ENNS INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOL VI, 2000, : 354 - 359
  • [46] Detecting the number of clusters using a support vector machine approach
    Moguerza, JM
    Muñoz, A
    Martín-Merino, M
    [J]. ARTIFICIAL NEURAL NETWORKS - ICANN 2002, 2002, 2415 : 763 - 768
  • [47] Project success prediction using an evolutionary support vector machine inference model
    Cheng, Min-Yuan
    Wu, Yu-Wei
    Wu, Ching-Fang
    [J]. AUTOMATION IN CONSTRUCTION, 2010, 19 (03) : 302 - 307
  • [48] Document Class Recognition Using A Support Vector Machine Approach
    Mtimet, Jassem
    Amiri, Hamid
    [J]. 2016 2ND INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2016, : 161 - 166
  • [49] Evolutionary fuzzy decision model for construction management using support vector machine
    Cheng, Min-Yuan
    Roy, Andreas F. V.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (08) : 6061 - 6069
  • [50] A multi-objective evolutionary approach to training set selection for support vector machine
    Acampora, Giovanni
    Herrera, Francisco
    Tortora, Genoveffa
    Vitiello, Autilia
    [J]. KNOWLEDGE-BASED SYSTEMS, 2018, 147 : 94 - 108