Prediction of Protein Secondary Structure Using Feature Selection and Analysis Approach

被引:14
|
作者
Feng, Yonge [1 ]
Lin, Hao [2 ]
Luo, Liaofu [3 ]
机构
[1] Inner Mongolia Agr Univ, Coll Sci, Hohhot 010018, Peoples R China
[2] Univ Elect Sci & Technol China, Key Lab NeuroInformat, Minist Educ, Ctr Bioinformat,Sch Life Sci & Technol, Chengdu 610054, Peoples R China
[3] Inner Mongolia Univ, Sch Phys Sci & Technol, Hohhot 010021, Peoples R China
关键词
Protein secondary structure prediction; Confidence level; Tetrapeptide structural words; Increment of diversity; Quadratic discriminant analysis; Error allowed scope; ACCESSIBLE SURFACE-AREA; MULTI-LABEL CLASSIFIER; AMINO-ACID-COMPOSITION; SUBCELLULAR-LOCALIZATION; FOLD RECOGNITION; GENERAL-FORM; BETA-SHEET; SINGLE; OPTIMIZATION; INFORMATION;
D O I
10.1007/s10441-013-9203-7
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The prediction of the secondary structure of a protein from its amino acid sequence is an important step towards the prediction of its three-dimensional structure. However, the accuracy of ab initio secondary structure prediction from sequence is about 80 % currently, which is still far from satisfactory. In this study, we proposed a novel method that uses binomial distribution to optimize tetrapeptide structural words and increment of diversity with quadratic discriminant to perform prediction for protein three-state secondary structure. A benchmark dataset including 2,640 proteins with sequence identity of less than 25 % was used to train and test the proposed method. The results indicate that overall accuracy of 87.8 % was achieved in secondary structure prediction by using ten-fold cross-validation. Moreover, the accuracy of predicted secondary structures ranges from 84 to 89 % at the level of residue. These results suggest that the feature selection technique can detect the optimized tetrapeptide structural words which affect the accuracy of predicted secondary structures.
引用
收藏
页码:1 / 14
页数:14
相关论文
共 50 条
  • [1] Prediction of Protein Secondary Structure Using Feature Selection and Analysis Approach
    Yonge Feng
    Hao Lin
    Liaofu Luo
    [J]. Acta Biotheoretica, 2014, 62 : 1 - 14
  • [2] Prediction of secondary protein structure content from primary sequence alone - A feature selection based approach
    Kurgan, L
    Homaeian, L
    [J]. MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, PROCEEDINGS, 2005, 3587 : 334 - 345
  • [3] Protein Secondary Structure Prediction Using Cascaded Feature Learning Model
    Geethu, S.
    Vimina, E. R.
    [J]. APPLIED SOFT COMPUTING, 2023, 140
  • [4] Prediction of Protein Quaternary Structure with Feature Selection and Analysis Based on Protein Biological Features
    Hu, Le-Le
    Feng, Kai-Yan
    Gu, Lei
    Liu, Xiao-Jun
    [J]. PROTEIN AND PEPTIDE LETTERS, 2012, 19 (01): : 23 - 28
  • [5] Protein secondary structure prediction with ICA feature extraction
    Melo, JCB
    Cavalcanti, GDC
    Guimaraes, KS
    [J]. 2003 IEEE XIII WORKSHOP ON NEURAL NETWORKS FOR SIGNAL PROCESSING - NNSP'03, 2003, : 13 - 22
  • [6] Feature selection algorithm based on graph theory and random forests for protein secondary structure prediction
    Altun, Gulsah
    Hu, Hae-Jin
    Gremalschi, Stefan
    Harrison, Robert W.
    Pan, Yi
    [J]. BIOINFORMATICS RESEARCH AND APPLICATIONS, PROCEEDINGS, 2007, 4463 : 590 - +
  • [7] PROTEIN SECONDARY STRUCTURE - ANALYSIS AND PREDICTION
    HIDER, RC
    HODGES, SJ
    [J]. BIOCHEMICAL EDUCATION, 1984, 12 (01): : 9 - 18
  • [8] Combined approach to protein secondary structure prediction
    Amirova, S. R.
    Machavariani, M. A.
    Filatov, L., V
    Milchevsky, Ju. V.
    Esipova, N. G.
    Tumanyan, V. G.
    [J]. Proceedings of the Fourth International Conference on Bioinformatics of Genome Regulation and Structure, Vol 1, 2004, : 231 - 234
  • [9] PROTEIN SECONDARY STRUCTURE PREDICTION USING SUPPORT VECTOR MACHINES AND A NEW FEATURE REPRESENTATION
    Gubbi, Jayavardhana
    Lai, Daniel T. H.
    Palaniswami, Marimuthu
    Parker, Michael
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2006, 6 (04) : 551 - 567
  • [10] Prediction of protein amidation sites by feature selection and analysis
    Weiren Cui
    Shen Niu
    Lulu Zheng
    Lele Hu
    Tao Huang
    Lei Gu
    Kaiyan Feng
    Ning Zhang
    Yudong Cai
    Yixue Li
    [J]. Molecular Genetics and Genomics, 2013, 288 : 391 - 400