Effect of Various Sequence Descriptors in Predicting Human Protein-protein Interactions Using ANN-based Prediction Models

被引:2
|
作者
Dholaniya, Pankaj Singh [1 ]
Rizvi, Samreen [1 ]
机构
[1] Univ Hyderabad, Sch Life Sci, Dept Biotechnol & Bioinformat, Hyderabad 500046, India
关键词
Protein-protein interactions; machine learning; protein descriptors; artificial neural network; PPI-predictions; hu-man PPI; CONSERVATION; ORDER;
D O I
10.2174/1574893616666210402114623
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Aims: A number of sequence-based descriptors for proteins have been proposed by many researchers. This study aims to evaluate the performance of these descriptors in predicting protein-protein interactions on the benchmark dataset. Background: The behavior of a protein inside or outside the cell is defined by its interaction with the elements present in the surrounding environment, which include small metabolites to the macromolecules such as RNA, DNA, or proteins. Of these, understanding protein-protein interactions (PPIs) is one of the important aspects to investigate the biological role of a protein. The interactions of a protein are determined by how it folds in 3-dimensional space, and this threedimensional folding of a protein largely depends on the linear sequence of amino acids. This information makes it possible to exploit the sequences for proteins to computationally determine the possible interactions among them. Objective: This study aims at studying the efficacy of various sequence-based descriptors in predicting protein-protein interactions. Methods: In this study, we have used the benchmark dataset of interacting and non-interacting protein pairs provided by Pan et al. to build the PPI prediction models using artificial neural networks. We have compared the efficacy of different descriptors on two types of datasets, one with all the protein pairs and the second with proteins having less than 25% identity. Result: The results show that conjoint-triad descriptors performed better than other descriptors in predicting PPIs. The feature selection on the conjoint triad was performed and the effect on the prediction model with reduced features versus all feature sets was studied. Conclusion: The classification model with conjoint-triad descriptors obtained the highest accuracy. The feature ranking for the conjoint triad descriptor was utilized and the model performance was compared with all and selected features. The model with reduced features shows less overfitting.
引用
收藏
页码:1024 / 1033
页数:10
相关论文
共 50 条
  • [1] Prediction of Protein-Protein Interactions from Protein Sequence Using Local Descriptors
    Yang, Lei
    Xia, Jun-Feng
    Gui, Jie
    PROTEIN AND PEPTIDE LETTERS, 2010, 17 (09): : 1085 - 1090
  • [2] Prediction of Protein-Protein Interactions Using An Effective Sequence Based Combined Method
    Goktepe, Yunus Emre
    Kodaz, Halife
    NEUROCOMPUTING, 2018, 303 : 68 - 74
  • [3] Efficient prediction of protein-protein interactions using sequence information
    Guarracino, Mario R.
    Nebbia, Adriano
    Manna, Valeria
    Chinchuluun, Altannar
    Pardalos, Panos M.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPLEX, INTELLIGENT AND SOFTWARE INTENSIVE SYSTEMS (CISIS 2010), 2010, : 677 - 682
  • [4] A novel matrix of sequence descriptors for predicting protein-protein interactions from amino acid sequences
    Wang, Xue
    Wu, Yuejin
    Wang, Rujing
    Wei, Yuanyuan
    Gui, Yuanmiao
    PLOS ONE, 2019, 14 (06):
  • [5] Protein-Protein Interactions Prediction Based on Graph Energy and Protein Sequence Information
    Xu, Da
    Xu, Hanxiao
    Zhang, Yusen
    Chen, Wei
    Gao, Rui
    MOLECULES, 2020, 25 (08):
  • [6] Sequence Representations and Their Utility for Predicting Protein-Protein Interactions
    Kimothi, Dhananjay
    Biyani, Pravesh
    Hogan, James M.
    Davis, Melissa J.
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (01) : 646 - 657
  • [7] Recent developments of sequence-based prediction of protein-protein interactions
    Murakami, Yoichi
    Mizuguchi, Kenji
    BIOPHYSICAL REVIEWS, 2022, 14 (06) : 1393 - 1411
  • [8] Sequence and functional annotations-based prediction of protein-protein interactions
    Ma, Yiwu
    Chen, Juan
    Chen, Haowen
    Li, Bo
    Cai, Lijun
    Journal of Computational and Theoretical Nanoscience, 2015, 12 (11) : 4679 - 4685
  • [9] ANN Based Protein Function Prediction Using Integrated Protein-Protein Interaction Data
    Shi, Lei
    Cho, Young-Rae
    Zhang, Aidong
    2009 INTERNATIONAL JOINT CONFERENCE ON BIOINFORMATICS, SYSTEMS BIOLOGY AND INTELLIGENT COMPUTING, PROCEEDINGS, 2009, : 271 - 277
  • [10] Predicting protein-protein interactions through sequence-based deep learning
    Hashemifar, Somaye
    Neyshabur, Behnam
    Khan, Aly A.
    Xu, Jinbo
    BIOINFORMATICS, 2018, 34 (17) : 802 - 810