Protein-protein interaction prediction using a hybrid feature representation and a stacked generalization scheme

被引:43
|
作者
Chen, Kuan-Hsi [1 ]
Wang, Tsai-Feng [2 ]
Hu, Yuh-Jyh [3 ]
机构
[1] Natl Chiao Tung Univ, Coll Comp Sci, Hsinchu 300, Taiwan
[2] Natl Chiao Tung Univ, Inst Data Sci & Engn, Hsinchu 300, Taiwan
[3] Natl Chiao Tung Univ, Inst Biomed Engn, Coll Comp Sci, Hsinchu 300, Taiwan
关键词
Protein-protein interaction; Stacked generalization; Gene ontology; Network topology; SEMANTIC SIMILARITY MEASURES; GENE ONTOLOGY; SEQUENCES; SCALE; TOOL; RESIDUES; CELL;
D O I
10.1186/s12859-019-2907-1
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundAlthough various machine learning-based predictors have been developed for estimating protein-protein interactions, their performances vary with dataset and species, and are affected by two primary aspects: choice of learning algorithm, and the representation of protein pairs. To improve the performance of predicting protein-protein interactions, we exploit the synergy of multiple learning algorithms, and utilize the expressiveness of different protein-pair features.ResultsWe developed a stacked generalization scheme that integrates five learning algorithms. We also designed three types of protein-pair features based on the physicochemical properties of amino acids, gene ontology annotations, and interaction network topologies. When tested on 19 published datasets collected from eight species, the proposed approach achieved a significantly higher or comparable overall performance, compared with seven competitive predictors.ConclusionWe introduced an ensemble learning approach for PPI prediction that integrated multiple learning algorithms and different protein-pair representations. The extensive comparisons with other state-of-the-art prediction tools demonstrated the feasibility and superiority of the proposed method.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Prediction of protein-protein interaction sites by means of ensemble learning and weighted feature descriptor
    Du, Xiuquan
    Sun, Shiwei
    Hu, Changlin
    Li, Xinrui
    Xia, Junfeng
    JOURNAL OF BIOLOGICAL RESEARCH-THESSALONIKI, 2016, 23
  • [42] A hybrid method for protein-protein interface prediction
    Hwang, Howook
    Petrey, Donald
    Honig, Barry
    PROTEIN SCIENCE, 2016, 25 (01) : 159 - 165
  • [43] Prediction of contact matrix for protein-protein interaction
    Gonzalez, Alvaro J.
    Liao, Li
    Wu, Cathy H.
    BIOINFORMATICS, 2013, 29 (08) : 1018 - 1025
  • [44] Prediction of hot spots residues in protein-protein interface using network feature and microenvironment feature
    Ye, Ling
    Kuang, Qifan
    Jiang, Lin
    Luo, Jiesi
    Jiang, Yanping
    Ding, Zhanling
    Li, Yizhou
    Li, Menglong
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2014, 131 : 16 - 21
  • [45] NOXclass: prediction of protein-protein interaction types
    Zhu, HB
    Domingues, FS
    Sommer, I
    Lengauer, T
    BMC BIOINFORMATICS, 2006, 7
  • [46] NOXclass: Prediction of protein-protein interaction types
    Max-Planck-Institut für Informatik, Stuhlsatzenhausweg 85, 66123 Saarbrücken, Germany
    BMC Bioinform., 2006,
  • [47] NOXclass: prediction of protein-protein interaction types
    Hongbo Zhu
    Francisco S Domingues
    Ingolf Sommer
    Thomas Lengauer
    BMC Bioinformatics, 7 (1)
  • [48] Construction and prediction of protein-protein interaction maps
    Schächter, V
    BIOINFORMATICS AND GENOME ANALYSIS, 2002, 38 : 191 - 220
  • [49] Protein-Protein Interaction Prediction: Recent Advances
    Shatnawi, Maad
    2017 28TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA), 2017, : 69 - 73
  • [50] Protein-Protein Interaction: Prediction, Design, and Modulation
    Zhang Chang-Sheng
    Lai Lu-Hua
    ACTA PHYSICO-CHIMICA SINICA, 2012, 28 (10) : 2363 - 2380