Protein-protein interaction prediction using a hybrid feature representation and a stacked generalization scheme

被引:43
|
作者
Chen, Kuan-Hsi [1 ]
Wang, Tsai-Feng [2 ]
Hu, Yuh-Jyh [3 ]
机构
[1] Natl Chiao Tung Univ, Coll Comp Sci, Hsinchu 300, Taiwan
[2] Natl Chiao Tung Univ, Inst Data Sci & Engn, Hsinchu 300, Taiwan
[3] Natl Chiao Tung Univ, Inst Biomed Engn, Coll Comp Sci, Hsinchu 300, Taiwan
关键词
Protein-protein interaction; Stacked generalization; Gene ontology; Network topology; SEMANTIC SIMILARITY MEASURES; GENE ONTOLOGY; SEQUENCES; SCALE; TOOL; RESIDUES; CELL;
D O I
10.1186/s12859-019-2907-1
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundAlthough various machine learning-based predictors have been developed for estimating protein-protein interactions, their performances vary with dataset and species, and are affected by two primary aspects: choice of learning algorithm, and the representation of protein pairs. To improve the performance of predicting protein-protein interactions, we exploit the synergy of multiple learning algorithms, and utilize the expressiveness of different protein-pair features.ResultsWe developed a stacked generalization scheme that integrates five learning algorithms. We also designed three types of protein-pair features based on the physicochemical properties of amino acids, gene ontology annotations, and interaction network topologies. When tested on 19 published datasets collected from eight species, the proposed approach achieved a significantly higher or comparable overall performance, compared with seven competitive predictors.ConclusionWe introduced an ensemble learning approach for PPI prediction that integrated multiple learning algorithms and different protein-pair representations. The extensive comparisons with other state-of-the-art prediction tools demonstrated the feasibility and superiority of the proposed method.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] A general protein-protein interaction extraction architecture based on word representation and feature selection
    Jiang, Zhenchao
    Li, Lishuang
    Huang, Degen
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2016, 14 (03) : 276 - 291
  • [22] Human protein-protein interaction prediction
    Mark D McDowall
    Michelle S Scott
    Geoffrey J Barton
    BMC Bioinformatics, 11 (Suppl 10)
  • [23] Protein-protein interaction and site prediction using transfer learning
    Liu, Tuoyu
    Gao, Han
    Ren, Xiaopu
    Xu, Guoshun
    Liu, Bo
    Wu, Ningfeng
    Luo, Huiying
    Wang, Yuan
    Tu, Tao
    Yao, Bin
    Guan, Feifei
    Teng, Yue
    Huang, Huoqing
    Tian, Jian
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (06)
  • [24] Prediction of protein-protein interaction using graph neural networks
    Jha, Kanchan
    Saha, Sriparna
    Singh, Hiteshi
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [25] Protein-Protein Interaction Prediction Using Single Class SVM
    Lei, Hairong
    Kniss, Joe Michael
    SEVENTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2008, : 883 - +
  • [26] Prediction of Protein-Protein Interaction Relevance of Articles Using References
    Calli, Cagatay
    2009 24TH INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2009, : 189 - 192
  • [27] Prediction of protein-protein interaction sites using an ensemble method
    Lei Deng
    Jihong Guan
    Qiwen Dong
    Shuigeng Zhou
    BMC Bioinformatics, 10
  • [28] Prediction of protein-protein interaction sites using patch analysis
    Jones, S
    Thornton, JM
    JOURNAL OF MOLECULAR BIOLOGY, 1997, 272 (01) : 133 - 143
  • [29] Prediction of protein-protein interaction sites using an ensemble method
    Deng, Lei
    Guan, Jihong
    Dong, Qiwen
    Zhou, Shuigeng
    BMC BIOINFORMATICS, 2009, 10
  • [30] Protein-protein Interaction Prediction using Arabic Semantic Analysis
    Zaki, Nazar M.
    Alawar, Kalthoom A.
    Al Dhaheri, Amna A.
    Harous, Saad
    2013 9TH INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION TECHNOLOGY (IIT), 2013,