CPIELA: Computational Prediction of Plant Protein-Protein Interactions by Ensemble Learning Approach From Protein Sequences and Evolutionary Information

被引:0
|
作者
Li, Li-Ping [1 ,2 ]
Zhang, Bo [1 ,2 ]
Cheng, Li [3 ]
机构
[1] Xinjiang Agr Univ, Coll Grassland & Environm Sci, Urumqi, Peoples R China
[2] Xinjiang Key Lab Grassland Resources & Ecol, Urumqi, Peoples R China
[3] Chinese Acad Sci, Xinjiang Tech Inst Phys & Chem, Urumqi, Peoples R China
基金
美国国家科学基金会;
关键词
plant; proteinprotein interactions; machine learning; sequence; evolutionary information; PSI-BLAST; DATABASE; BIOLOGY;
D O I
10.3389/fgene.2022.857839
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Identification and characterization of plant protein-protein interactions (PPIs) are critical in elucidating the functions of proteins and molecular mechanisms in a plant cell. Although experimentally validated plant PPIs data have become increasingly available in diverse plant species, the high-throughput techniques are usually expensive and labor-intensive. With the incredibly valuable plant PPIs data accumulating in public databases, it is progressively important to propose computational approaches to facilitate the identification of possible PPIs. In this article, we propose an effective framework for predicting plant PPIs by combining the position-specific scoring matrix (PSSM), local optimal-oriented pattern (LOOP), and ensemble rotation forest (ROF) model. Specifically, the plant protein sequence is firstly transformed into the PSSM, in which the protein evolutionary information is perfectly preserved. Then, the local textural descriptor LOOP is employed to extract texture variation features from PSSM. Finally, the ROF classifier is adopted to infer the potential plant PPIs. The performance of CPIELA is evaluated via cross-validation on three plant PPIs datasets: Arabidopsis thaliana, Zea mays, and Oryza sativa. The experimental results demonstrate that the CPIELA method achieved the high average prediction accuracies of 98.63%, 98.09%, and 94.02%, respectively. To further verify the high performance of CPIELA, we also compared it with the other state-of-the-art methods on three gold standard datasets. The experimental results illustrate that CPIELA is efficient and reliable for predicting plant PPIs. It is anticipated that the CPIELA approach could become a useful tool for facilitating the identification of possible plant PPIs.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Computational prediction of protein-protein interactions' network in Arabidopsis thaliana
    Hekmati, Zhale
    Zahiri, Javad
    Aalami, Ali
    ACTA PHYSIOLOGIAE PLANTARUM, 2023, 45 (12)
  • [32] Predicting protein-protein interactions via multivariate mutual information of protein sequences
    Yijie Ding
    Jijun Tang
    Fei Guo
    BMC Bioinformatics, 17
  • [33] Predicting protein-protein interactions via multivariate mutual information of protein sequences
    Ding, Yijie
    Tang, Jijun
    Guo, Fei
    BMC BIOINFORMATICS, 2016, 17
  • [34] Improving protein-protein interactions prediction accuracy using protein evolutionary information and relevance vector machine model
    An, Ji-Yong
    Meng, Fan-Rong
    You, Zhu-Hong
    Chen, Xing
    Yan, Gui-Ying
    Hu, Ji-Pu
    PROTEIN SCIENCE, 2016, 25 (10) : 1825 - 1833
  • [35] Protein-protein interactions prediction based on ensemble deep neural networks
    Zhang, Long
    Yu, Guoxian
    Xia, Dawen
    Wang, Jun
    NEUROCOMPUTING, 2019, 324 : 10 - 19
  • [36] Highly Accurate Prediction of Protein-Protein Interactions via Incorporating Evolutionary Information and Physicochemical Characteristics
    Li, Zheng-Wei
    You, Zhu-Hong
    Chen, Xing
    Gui, Jie
    Nie, Ru
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2016, 17 (09):
  • [37] Computational Testing of Protein-Protein Interactions
    Katebi, Ataur R.
    Kloczkowski, Andrzej
    Jernigan, Robert L.
    BIBMW: 2009 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOP, 2009, : 142 - 149
  • [38] Detecting protein function and protein-protein interactions from genome sequences
    Marcotte, EM
    Pellegrini, M
    Ng, HL
    Rice, DW
    Yeates, TO
    Eisenberg, D
    SCIENCE, 1999, 285 (5428) : 751 - 753
  • [39] Computational design of protein-protein interactions
    Schreiber, Gideon
    Fleishman, Sarel J.
    CURRENT OPINION IN STRUCTURAL BIOLOGY, 2013, 23 (06) : 903 - 910
  • [40] Computational design of protein-protein interactions
    Kortemme, T
    Baker, D
    CURRENT OPINION IN CHEMICAL BIOLOGY, 2004, 8 (01) : 91 - 97