Protein-protein interaction prediction using a hybrid feature representation and a stacked generalization scheme

被引:43
|
作者
Chen, Kuan-Hsi [1 ]
Wang, Tsai-Feng [2 ]
Hu, Yuh-Jyh [3 ]
机构
[1] Natl Chiao Tung Univ, Coll Comp Sci, Hsinchu 300, Taiwan
[2] Natl Chiao Tung Univ, Inst Data Sci & Engn, Hsinchu 300, Taiwan
[3] Natl Chiao Tung Univ, Inst Biomed Engn, Coll Comp Sci, Hsinchu 300, Taiwan
关键词
Protein-protein interaction; Stacked generalization; Gene ontology; Network topology; SEMANTIC SIMILARITY MEASURES; GENE ONTOLOGY; SEQUENCES; SCALE; TOOL; RESIDUES; CELL;
D O I
10.1186/s12859-019-2907-1
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundAlthough various machine learning-based predictors have been developed for estimating protein-protein interactions, their performances vary with dataset and species, and are affected by two primary aspects: choice of learning algorithm, and the representation of protein pairs. To improve the performance of predicting protein-protein interactions, we exploit the synergy of multiple learning algorithms, and utilize the expressiveness of different protein-pair features.ResultsWe developed a stacked generalization scheme that integrates five learning algorithms. We also designed three types of protein-pair features based on the physicochemical properties of amino acids, gene ontology annotations, and interaction network topologies. When tested on 19 published datasets collected from eight species, the proposed approach achieved a significantly higher or comparable overall performance, compared with seven competitive predictors.ConclusionWe introduced an ensemble learning approach for PPI prediction that integrated multiple learning algorithms and different protein-pair representations. The extensive comparisons with other state-of-the-art prediction tools demonstrated the feasibility and superiority of the proposed method.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Protein-protein interaction prediction using a hybrid feature representation and a stacked generalization scheme
    Kuan-Hsi Chen
    Tsai-Feng Wang
    Yuh-Jyh Hu
    BMC Bioinformatics, 20
  • [2] A novel feature extraction scheme for prediction of protein-protein interaction sites
    Du, Xiuquan
    Jing, Anqi
    Hu, Xinying
    MOLECULAR BIOSYSTEMS, 2015, 11 (02) : 475 - 485
  • [3] A Novel Feature Extraction Scheme with Ensemble Coding for Protein-Protein Interaction Prediction
    Du, Xiuquan
    Cheng, Jiaxing
    Zheng, Tingting
    Duan, Zheng
    Qian, Fulan
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2014, 15 (07) : 12731 - 12749
  • [4] Applying Feature Coupling Generalization for Protein-Protein Interaction Extraction
    Li, Yanpeng
    Lin, Hongfei
    Yang, Zhihao
    2009 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2009, : 396 - 400
  • [5] MM-StackEns: A new deep multimodal stacked generalization approach for protein-protein interaction prediction
    Albu, Alexandra-Ioana
    Bocicor, Maria-Iuliana
    Czibula, Gabriela
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 153
  • [6] A mixture of feature experts approach for protein-protein interaction prediction
    Yanjun Qi
    Judith Klein-Seetharaman
    Ziv Bar-Joseph
    BMC Bioinformatics, 8
  • [7] A mixture of feature experts approach for protein-protein interaction prediction
    Qi, Yanjun
    Klein-Seetharaman, Judith
    Bar-Joseph, Ziv
    BMC BIOINFORMATICS, 2007, 8 (Suppl 10)
  • [8] Prediction of Protein-Protein Interaction Based on Weighted Feature Fusion
    Zhang, Chunhua
    Guo, Sijia
    Zhang, Jingbo
    Jin, Xizi
    Li, Yanwen
    Du, Ning
    Sun, Pingping
    Jiang, Baohua
    LETTERS IN ORGANIC CHEMISTRY, 2019, 16 (04) : 263 - 274
  • [9] Protein-protein interaction site prediction by model ensembling with hybrid feature and self-attention
    Cong, Hanhan
    Liu, Hong
    Cao, Yi
    Liang, Cheng
    Chen, Yuehui
    BMC BIOINFORMATICS, 2023, 24 (01)
  • [10] Improving protein-protein interactions prediction accuracy using XGBoost feature selection and stacked ensemble classifier
    Chen, Cheng
    Zhang, Qingmei
    Yu, Bin
    Yu, Zhaomin
    Lawrence, Patrick J.
    Ma, Qin
    Zhang, Yan
    COMPUTERS IN BIOLOGY AND MEDICINE, 2020, 123