Identify Protein 8-Class Secondary Structure with Quadratic Discriminant Algorithm based on the Feature Combination

被引:7
|
作者
Zhao Wei [1 ]
Feng Yonge [1 ]
机构
[1] Inner Mongolia Agr Univ, Coll Sci, Hohhot 010018, Peoples R China
基金
美国国家科学基金会;
关键词
Chemical shifts; quadratic discriminant analysis; protein 8-class secondary structure; measure of diversity; hydrophobic; hydrophilic; CHEMICAL-SHIFTS; MEMBRANE-PROTEINS; GENERAL-FORM; PREDICTION; IDENTIFICATION; CLASSIFICATION; SEQUENCES;
D O I
10.2174/1570178614666170609105326
中图分类号
O62 [有机化学];
学科分类号
070303 ; 081704 ;
摘要
Background: The research of protein structure is one of the most important subjects in the 21 st century. However, the prediction of protein secondary structure is a key step in the prediction of protein three-dimensional structure. Protein eight-class secondary structure (SS) prediction has gained less attention and the implementation of three-class secondary structure (SS) prediction has been done in the past. Method: We introduced a model for the prediction of protein eight-class secondary structure using quadratic discriminant algorithm (QDA) based on the feature combination. We combined chemical shifts with the measure of diversity as features. The measure of diversity is based on the hydrophilic-hydrophobic residues and their dipeptides respectively. Firstly, we extracted the chemical shifts in protein as features. Then, we implemented the eight-class secondary structures prediction using these chemical shifts as features. In order to improve the accuracy, we constructed the measure of diversity based on the hydrophilic-hydrophobic residue. Finally, we combined chemical shifts with the measure of diversity to predict protein eight-class secondary structures. Results: We achieved the best accuracy of eight-class secondary structures (Q(8)) 80.7% in seven-fold cross-validation combining chemical shifts with the measure of diversity. In the same data set, we performed the prediction by C8-Scorpion sever, support vector machine (SVM) and random forest (RF) and the results showed that our prediction model is superior to other algorithms in terms of accuracy. Conclusion: The finding suggested that our model is an effective model for the prediction of protein eight-class secondary structures.
引用
收藏
页码:625 / 631
页数:7
相关论文
共 50 条
  • [42] Two-Stage Distance Feature-based Optimization Algorithm for De novo Protein Structure Prediction
    Zhang, Gui-Jun
    Wang, Xiao-Qi
    Ma, Lai-Fa
    Wang, Liu-Jing
    Hu, Jun
    Zhou, Xiao-Gen
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2020, 17 (06) : 2119 - 2130
  • [43] Prediction of protein secondary structure based on residue pair types and conformational states using dynamic programming algorithm
    Sadeghi, M
    Parto, S
    Arab, S
    Ranjbar, B
    [J]. FEBS LETTERS, 2005, 579 (16) : 3397 - 3400
  • [44] Low-homology protein structural class prediction from secondary structure based on visibility and horizontal visibility network
    Zhao Z.-Q.
    Luo L.
    Liu X.-Y.
    [J]. Zhao, Zhi-Qin (zhiqin_zhao2014@163.com), 2018, Science Publications (14): : 67 - 75
  • [45] High-accuracy prediction of protein structural class for low-similarity sequences based on predicted secondary structure
    Zhang, Shengli
    Ding, Shuyan
    Wang, Tianming
    [J]. BIOCHIMIE, 2011, 93 (04) : 710 - 714
  • [46] Protein Structural Class Prediction Based on Distance-related Statistical Features from Graphical Representation of Predicted Secondary Structure
    Kong, Liang
    Zhang, Lichao
    Han, Xiaodong
    Lv, Jinfeng
    [J]. LETTERS IN ORGANIC CHEMISTRY, 2019, 16 (04) : 317 - 324
  • [47] Template-based C8-SCORPION: a protein 8-state secondary structure prediction method using structural information and context-based features
    Yaseen, Ashraf
    Li, Yaohang
    [J]. BMC BIOINFORMATICS, 2014, 15
  • [48] Template-based C8-SCORPION: a protein 8-state secondary structure prediction method using structural information and context-based features
    Ashraf Yaseen
    Yaohang Li
    [J]. BMC Bioinformatics, 15
  • [49] A new protein folding algorithm based on hydrophobic compactness: Rigid unconnected secondary structure iterative assembly (RUSSIA). I: Methodology
    Znamenskiy, D
    Chomilier, J
    Le Tuan, K
    Mornon, JP
    [J]. PROTEIN ENGINEERING, 2003, 16 (12): : 925 - 935
  • [50] Protein secondary structure optimization using an improved artificial bee colony algorithm based on AB off-lattice model
    Li, Bai
    Li, Ya
    Gong, Ligang
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2014, 27 : 70 - 79