Prediction of protein structural classes using support vector machines

被引:133
|
作者
Sun, X. -D. [1 ]
Huang, R. -B. [1 ]
机构
[1] Guangxi Univ, Coll Life Sci & Biotechnol, Nanning 530004, Guangxi, Peoples R China
关键词
support vector machines; CATH; multi-class; protein structural class prediction; jackknifing;
D O I
10.1007/s00726-005-0239-0
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The support vector machine, a machine-learning method, is used to predict the four structural classes, i.e. mainly alpha, mainly beta, alpha-beta and fss, from the topology-level of CATH protein structure database. For the binary classification, any two structural classes which do not share any secondary structure such as alpha and beta elements could be classified with as high as 90% accuracy. The accuracy, however, will decrease to less than 70% if the structural classes to be classified contain structure elements in common. Our study also shows that the dimensions of feature space 20(2) = 400 (for dipeptide) and 20(3) = 8 000 (for tripeptide) give nearly the same prediction accuracy. Among these 4 structural classes, multi-class classification gives an overall accuracy of about 52%, indicating that the multi-class classification technique in support of vector machines may still need to be further improved in future investigation.
引用
收藏
页码:469 / 475
页数:7
相关论文
共 50 条
  • [41] PIPELINE DEFECT PREDICTION USING SUPPORT VECTOR MACHINES
    Isa, Dino
    Rajkumar, Rajprasad
    APPLIED ARTIFICIAL INTELLIGENCE, 2009, 23 (08) : 758 - 771
  • [42] Hydrocarbon reservoir prediction using support vector machines
    Yao, KF
    Lu, WK
    Zhang, SW
    Xiao, HQ
    Li, YD
    ADVANCES IN NEURAL NETWORKS - ISNN 2004, PT 1, 2004, 3173 : 537 - 542
  • [43] Diabetes Diagnostic Prediction Using Vector Support Machines
    Viloria, Amelec
    Herazo-Beltran, Yaneth
    Cabrera, Danelys
    Pineda, Omar Bonerge
    11TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT) / THE 3RD INTERNATIONAL CONFERENCE ON EMERGING DATA AND INDUSTRY 4.0 (EDI40) / AFFILIATED WORKSHOPS, 2020, 170 : 376 - 381
  • [44] Prediction of contact maps using support vector machines
    Zhao, Y
    Karypis, G
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2005, 14 (05) : 849 - 865
  • [45] Prediction of arterial hypertension using support vector machines
    Seijas, Cesar
    Caralli, Antonino
    Villasana, Sergio
    Saenz, Laura
    Arteaga, Francisco
    INGENIERIA UC, 2006, 13 (03): : 13 - 18
  • [46] Prediction of repeat visitation using support vector machines
    Department of Logistics Engineering and Management, National Taichung Institute of Technology, No. 129, Sec. 3, Sanmin Rd., Taichung 404, Taiwan
    不详
    WSEAS Trans. Inf. Sci. Appl., 2007, 2 (369-376):
  • [47] Clustering support vector machines for protein local structure prediction
    Zhong, Wei
    He, Jieyue
    Harrison, Robert
    Tai, Phang C.
    Pan, Yi
    EXPERT SYSTEMS WITH APPLICATIONS, 2007, 32 (02) : 518 - 526
  • [48] Prediction of backbone dihedral angles and protein secondary structure using support vector machines
    Kountouris, Petros
    Hirst, Jonathan D.
    BMC BIOINFORMATICS, 2009, 10
  • [49] Protein Secondary Structure Prediction Using Support Vector Machines and a Codon Encoding Scheme
    Zamani, Masood
    Kremer, Stefan C.
    2012 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS (BIBMW), 2012,
  • [50] PROTEIN SECONDARY STRUCTURE PREDICTION USING SUPPORT VECTOR MACHINES AND A NEW FEATURE REPRESENTATION
    Gubbi, Jayavardhana
    Lai, Daniel T. H.
    Palaniswami, Marimuthu
    Parker, Michael
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2006, 6 (04) : 551 - 567