An Integrated-OFFT Model for the Prediction of Protein Secondary Structure Class

被引:10
|
作者
Panda, Bishnupriya [1 ]
Majhi, Babita [2 ]
Thakur, Abhimanyu [3 ]
机构
[1] Siksha O Anusandhan Univ, Dept Comp Sci & Engn, Inst Tech Educ & Res, Bhubaneswar, Orissa, India
[2] Guru Ghashidas Vishwavidyalaya, Dept Comp Sci & Informat Technol, Bilaspur, Chhattisgarh, India
[3] Birla Inst Technol Mesra, Dept Pharmaceut Sci & Technol, Ranchi, Bihar, India
关键词
Protein; secondary structure prediction class; gaussian noise; computational biology; bioinformatics; SVM; AMINO-ACID-COMPOSITION; REPRESENTATION;
D O I
10.2174/1573409914666180828105228
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Background: Proteins are the utmost multi-purpose macromolecules, which play a crucial function in many aspects of biological processes. For a long time, sequence arrangement of amino acid has been utilized for the prediction of protein secondary structure. Besides, in major methods for the prediction of protein secondary structure class, the impact of Gaussian noise on sequence representation of amino acids has not been considered until now; which is one of the important constraints for the functionality of a protein. Methods: In the present research, the prediction of protein secondary structure class was accomplished by integrated application of Stockwell transformation and Amino Acid Composition (AAC), on equivalent Electron-ion Interaction Potential (EIIP) representation of raw amino acid sequence. The introduced method was evaluated by using 4 benchmark datasets of low sequence homology, namely PDB25, 498, 277, and 204. Furthermore, random forest algorithm together with the out-of-bag error estimate and Support Vector Machine (SVM), using k-fold cross validation demonstrated high feature representation potential of our reported approach. Results: The overall prediction accuracy for PDB25, 498, 277, and 204 datasets with randomforest classifier was 92.5%, 94.79%, 92.45%, and 88.04% respectively, whereas with SVM, the results were 84.66%, 95.32%, 89.29%, and 84.37% respectively. Conclusion: An integrated-order-function-frequency-time (OFFT) model has been proposed for the prediction of protein secondary structure class. For the first time, we reported the effect of Gaussian noise on the prediction accuracy of protein secondary structure class and proposed a robust integrated-OFFT model, which is effectively noise resistant.
引用
收藏
页码:45 / 54
页数:10
相关论文
共 50 条
  • [1] AN ALGORITHM FOR PROTEIN SECONDARY STRUCTURE PREDICTION BASED ON CLASS PREDICTION
    DELEAGE, G
    ROUX, B
    [J]. PROTEIN ENGINEERING, 1987, 1 (04): : 289 - 294
  • [2] Estimating the Class Posterior Probabilities in Protein Secondary Structure Prediction
    Guermeur, Yann
    Thomarat, Fabienne
    [J]. PATTERN RECOGNITION IN BIOINFORMATICS, 2011, 7036 : 260 - 271
  • [3] BNN: A novel model for prediction of protein secondary structure
    College of Life Sciences, China Jiliang University, Hangzhou 310018, China
    不详
    [J]. Jiliang Xuebao, 2006, 3 (281-285):
  • [4] A deep aggregated model for protein secondary structure prediction
    Hu, Yu
    Nie, Tiezheng
    Shen, Derong
    Yu, Ge
    [J]. INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2019, 22 (03) : 231 - 249
  • [5] PREDICTION OF PROTEIN SECONDARY STRUCTURE BY THE HIDDEN MARKOV MODEL
    ASAI, K
    HAYAMIZU, S
    HANDA, K
    [J]. COMPUTER APPLICATIONS IN THE BIOSCIENCES, 1993, 9 (02): : 141 - 146
  • [6] An effective feature extraction method on protein secondary structure class prediction
    Liu L.
    Yin R.
    Song W.
    Du C.
    [J]. Journal of Bionanoscience, 2017, 11 (05): : 446 - 454
  • [7] Transformer Encoder with Protein Language Model for Protein Secondary Structure Prediction
    Kazm, Ammar
    Ali, Aida
    Hashim, Haslina
    [J]. ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2024, 14 (02) : 13124 - 13132
  • [8] PREDICTION OF PROTEIN SECONDARY STRUCTURE
    CHOU, PY
    FASMAN, GD
    [J]. BIOPHYSICAL JOURNAL, 1977, 17 (02) : A53 - A53
  • [9] PREDICTION OF PROTEIN SECONDARY STRUCTURE
    MRAZEK, J
    KYPR, J
    [J]. CHEMICKE LISTY, 1991, 85 (12): : 1203 - 1218
  • [10] PROTEIN SECONDARY STRUCTURE PREDICTION
    BARTON, GJ
    [J]. CURRENT OPINION IN STRUCTURAL BIOLOGY, 1995, 5 (03) : 372 - 376