A new hybrid coding for protein secondary structure prediction based on primary structure similarity

被引:16
|
作者
Li, Zhong [1 ]
Wang, Jing [1 ]
Zhang, Shunpu [2 ]
Zhang, Qifeng [1 ]
Wu, Wuming [1 ]
机构
[1] Zhejiang Sci Tech Univ, Coll Sci, Hangzhou 30018, Zhejiang, Peoples R China
[2] Univ Cent Florida, Dept Stat, Orlando, FL 32816 USA
基金
中国国家自然科学基金;
关键词
Hybrid code; Protein secondary structure prediction; Protein primary structure; Support vector machine; GRAPHICAL REPRESENTATION;
D O I
10.1016/j.gene.2017.03.011
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The coding pattern of protein can greatly affect the prediction accuracy of protein secondary structure. In this paper, a novel hybrid coding method based on the physicochemical properties of amino acids and tendency factors is proposed for the prediction of protein secondary structure. The principal component analysis (PCA) is first applied to the physicochemical properties of amino acids to construct a 3-bit-code, and then the 3 tendency factors of amino acids are calculated to generate another 3-bit-code. Two 3-bit-codes are fused to form a novel hybrid 6-bit-code. Furthermore, we make a geometry-based similarity comparison of the protein primary structure between the reference set and the test set before the secondary structure prediction. We finally use the support vector machine (SVM) to predict those amino acids which are not detected by the primary structure similarity comparison. Experimental results show that our method achieves a satisfactory improvement in accuracy in the prediction of protein secondary structure. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:8 / 13
页数:6
相关论文
共 50 条
  • [31] Protein secondary structure prediction based on Ramachandran maps
    Chen, Yen-Ru
    Peng, Sheng-Lung
    Tsay, Yu-Wei
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, PROCEEDINGS: WITH ASPECTS OF THEORETICAL AND METHODOLOGICAL ISSUES, 2008, 5226 : 204 - +
  • [32] A novel hybrid GMM/SVM architecture for protein secondary structure prediction
    Samani, Emad Bahrami
    Homayounpour, M. Mehdi
    Gu, Hong
    APPLICATIONS OF FUZZY SETS THEORY, 2007, 4578 : 491 - +
  • [33] Association classification algorithm based on structure sequence in protein secondary structure prediction
    Zhou, Zhun
    Yang, Bingru
    Hou, Wei
    EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (09) : 6381 - 6389
  • [34] HYPROSP II - A knowledge-based hybrid method for protein secondary structure prediction based on local prediction confidence
    Lin, HN
    Chang, JM
    Wu, KP
    Sung, TY
    Hsu, WL
    BIOINFORMATICS, 2005, 21 (15) : 3227 - 3233
  • [35] Porter: a new, accurate server for protein secondary structure prediction
    Pollastri, G
    McLysaght, A
    BIOINFORMATICS, 2005, 21 (08) : 1719 - 1720
  • [36] Accurate Prediction of Docked Protein Structure Similarity
    Akbal-Delibas, Bahar
    Pomplun, Marc
    Haspel, Nurit
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2015, 22 (09) : 892 - 904
  • [37] PROTEIN SECONDARY STRUCTURE - ANALYSIS AND PREDICTION
    HIDER, RC
    HODGES, SJ
    BIOCHEMICAL EDUCATION, 1984, 12 (01): : 9 - 18
  • [38] Parallelized protein secondary structure prediction
    Qi, YT
    Lin, F
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 2074 - 2077
  • [39] SECONDARY STRUCTURE PREDICTION AND PROTEIN DESIGN
    GARNIER, J
    LEVIN, JM
    GIBRAT, JF
    BIOU, V
    BIOCHEMICAL SOCIETY SYMPOSIA, 1990, (57) : 11 - 24
  • [40] Prediction of protein secondary structure content
    Liu, WM
    Chou, KC
    PROTEIN ENGINEERING, 1999, 12 (12): : 1041 - 1050