Improving protein fold recognition using the amalgamation of evolutionary-based and structural based information

被引:0
|
作者
Kuldip K Paliwal
Alok Sharma
James Lyons
Abdollah Dehzangi
机构
[1] Griffith University,School of Engineering
[2] University of the South Pacific,School of Engineering and Physics
[3] Institute for Integrated and Intelligent Systems (IIIS),undefined
[4] National ICT Australia (NICTA),undefined
来源
关键词
Support Vector Machine; Support Vector Machine Classifier; Feature Extraction Method; Fold Recognition; Position Specific Score Matrix;
D O I
暂无
中图分类号
学科分类号
摘要
Deciphering three dimensional structure of a protein sequence is a challenging task in biological science. Protein fold recognition and protein secondary structure prediction are transitional steps in identifying the three dimensional structure of a protein. For protein fold recognition, evolutionary-based information of amino acid sequences from the position specific scoring matrix (PSSM) has been recently applied with improved results. On the other hand, the SPINE-X predictor has been developed and applied for protein secondary structure prediction. Several reported methods for protein fold recognition have only limited accuracy. In this paper, we have developed a strategy of combining evolutionary-based information (from PSSM) and predicted secondary structure using SPINE-X to improve protein fold recognition. The strategy is based on finding the probabilities of amino acid pairs (AAP). The proposed method has been tested on several protein benchmark datasets and an improvement of 8.9% recognition accuracy has been achieved. We have achieved, for the first time over 90% and 75% prediction accuracies for sequence similarity values below 40% and 25%, respectively. We also obtain 90.6% and 77.0% prediction accuracies, respectively, for the Extended Ding and Dubchak and Taguchi and Gromiha benchmark protein fold recognition datasets widely used for in the literature.
引用
收藏
相关论文
共 50 条
  • [1] Improving protein fold recognition using the amalgamation of evolutionary-based and structural based information
    Paliwal, Kuldip K.
    Sharma, Alok
    Lyons, James
    Dehzangi, Abdollah
    BMC BIOINFORMATICS, 2014, 15
  • [2] A mixture of physicochemical and evolutionary-based feature extraction approaches for protein fold recognition
    Dehzangi, Abdollah
    Sharma, Alok
    Lyons, James
    Paliwal, Kuldip K.
    Sattar, Abdul
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2015, 11 (01) : 115 - 138
  • [3] Structural protein fold recognition based on secondary structure and evolutionary information using machine learning algorithms
    Qin, Xinyi
    Liu, Min
    Zhang, Lu
    Liu, Guangzhong
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2021, 91
  • [4] A Segmentation-Based Method to Extract Structural and Evolutionary Features for Protein Fold Recognition
    Dehzangi, Abdollah
    Paliwal, Kuldip
    Lyons, James
    Sharma, Alok
    Sattar, Abdul
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2014, 11 (03) : 510 - 519
  • [5] Improving the performance of evolutionary-based complex detection models in protein–protein interaction networks
    Bara’a A. Attea
    Qusay Z. Abdullah
    Soft Computing, 2018, 22 : 3721 - 3744
  • [6] Improving the performance of evolutionary-based complex detection models in protein-protein interaction networks
    Attea, Bara'a A.
    Abdullah, Qusay Z.
    SOFT COMPUTING, 2018, 22 (11) : 3721 - 3744
  • [7] Evolutionary-based framework for optimizing the spread of information on Twitter
    Butakov, Nikolay
    Chuprova, Yulia
    Knyazkov, Konstantin
    Shindyapina, Natalya
    Boukhanovsky, Alexander
    4TH INTERNATIONAL YOUNG SCIENTIST CONFERENCE ON COMPUTATIONAL SCIENCE, 2015, 66 : 287 - 296
  • [8] Improving taxonomy-based protein fold recognition by using global and local features
    Yang, Jian-Yi
    Chen, Xin
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2011, 79 (07) : 2053 - 2064
  • [9] A structural pattern-based method for protein fold recognition
    Taylor, WR
    Jonassen, I
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2004, 56 (02) : 222 - 234
  • [10] A novel fusion based on the evolutionary features for protein fold recognition using support vector machines
    Refahi, Mohammad Saleh
    Mir, A.
    Nasiri, Jalal A.
    SCIENTIFIC REPORTS, 2020, 10 (01)