A new hybrid coding for protein secondary structure prediction based on primary structure similarity

被引:16
|
作者
Li, Zhong [1 ]
Wang, Jing [1 ]
Zhang, Shunpu [2 ]
Zhang, Qifeng [1 ]
Wu, Wuming [1 ]
机构
[1] Zhejiang Sci Tech Univ, Coll Sci, Hangzhou 30018, Zhejiang, Peoples R China
[2] Univ Cent Florida, Dept Stat, Orlando, FL 32816 USA
基金
中国国家自然科学基金;
关键词
Hybrid code; Protein secondary structure prediction; Protein primary structure; Support vector machine; GRAPHICAL REPRESENTATION;
D O I
10.1016/j.gene.2017.03.011
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The coding pattern of protein can greatly affect the prediction accuracy of protein secondary structure. In this paper, a novel hybrid coding method based on the physicochemical properties of amino acids and tendency factors is proposed for the prediction of protein secondary structure. The principal component analysis (PCA) is first applied to the physicochemical properties of amino acids to construct a 3-bit-code, and then the 3 tendency factors of amino acids are calculated to generate another 3-bit-code. Two 3-bit-codes are fused to form a novel hybrid 6-bit-code. Furthermore, we make a geometry-based similarity comparison of the protein primary structure between the reference set and the test set before the secondary structure prediction. We finally use the support vector machine (SVM) to predict those amino acids which are not detected by the primary structure similarity comparison. Experimental results show that our method achieves a satisfactory improvement in accuracy in the prediction of protein secondary structure. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:8 / 13
页数:6
相关论文
共 50 条
  • [21] A Hybrid Method for Prediction of Protein Secondary Structure Based on Multiple Artificial Neural Networks
    Hasic, Haris
    Buza, Emir
    Akagic, Amila
    2017 40TH INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2017, : 1195 - 1200
  • [22] HYPROSP: a hybrid protein secondary structure prediction algorithm - a knowledge-based approach
    Wu, KP
    Lin, HN
    Chang, JM
    Sung, TY
    Hsu, WL
    NUCLEIC ACIDS RESEARCH, 2004, 32 (17) : 5059 - 5065
  • [23] PREDICTION OF PROTEIN SECONDARY STRUCTURE
    MRAZEK, J
    KYPR, J
    CHEMICKE LISTY, 1991, 85 (12): : 1203 - 1218
  • [24] PREDICTION OF PROTEIN SECONDARY STRUCTURE
    CHOU, PY
    FASMAN, GD
    BIOPHYSICAL JOURNAL, 1977, 17 (02) : A53 - A53
  • [25] PROTEIN SECONDARY STRUCTURE PREDICTION
    BARTON, GJ
    CURRENT OPINION IN STRUCTURAL BIOLOGY, 1995, 5 (03) : 372 - 376
  • [26] PESPAD: a new tool for protein secondary structure prediction based on decision trees
    Mazo, Claudia X.
    Bedoya, Oscar F.
    INGENIERIA Y COMPETITIVIDAD, 2010, 12 (02): : 9 - 22
  • [27] INVARIANT FEATURES OF GLOBIN PRIMARY STRUCTURE AND CODING OF THEIR SECONDARY STRUCTURE
    PTITSYN, OB
    JOURNAL OF MOLECULAR BIOLOGY, 1974, 88 (02) : 287 - 300
  • [28] The prediction of protein secondary structure based on auto encoder
    Zhang Shuai-yan
    Liu Yi-hui
    Cheng Jin-yong
    2017 13TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2017, : 2065 - 2069
  • [29] Protein Secondary Structure Prediction Based on Deep Learning
    Zheng, Lin
    Li, Hong-ling
    Wu, Nan
    Ao, Li
    3RD INTERNATIONAL SYMPOSIUM ON MECHATRONICS AND INDUSTRIAL INFORMATICS, (ISMII 2017), 2017, : 171 - 177
  • [30] Protein Secondary Structure Prediction Based on Statistical Dictionaries
    Yang, Wei
    Wang, Kuan-Quan
    Zuo, Wang-Meng
    2009 3RD INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING, VOLS 1-11, 2009, : 780 - 783