CPIELA: Computational Prediction of Plant Protein-Protein Interactions by Ensemble Learning Approach From Protein Sequences and Evolutionary Information

被引:0
|
作者
Li, Li-Ping [1 ,2 ]
Zhang, Bo [1 ,2 ]
Cheng, Li [3 ]
机构
[1] Xinjiang Agr Univ, Coll Grassland & Environm Sci, Urumqi, Peoples R China
[2] Xinjiang Key Lab Grassland Resources & Ecol, Urumqi, Peoples R China
[3] Chinese Acad Sci, Xinjiang Tech Inst Phys & Chem, Urumqi, Peoples R China
基金
美国国家科学基金会;
关键词
plant; proteinprotein interactions; machine learning; sequence; evolutionary information; PSI-BLAST; DATABASE; BIOLOGY;
D O I
10.3389/fgene.2022.857839
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Identification and characterization of plant protein-protein interactions (PPIs) are critical in elucidating the functions of proteins and molecular mechanisms in a plant cell. Although experimentally validated plant PPIs data have become increasingly available in diverse plant species, the high-throughput techniques are usually expensive and labor-intensive. With the incredibly valuable plant PPIs data accumulating in public databases, it is progressively important to propose computational approaches to facilitate the identification of possible PPIs. In this article, we propose an effective framework for predicting plant PPIs by combining the position-specific scoring matrix (PSSM), local optimal-oriented pattern (LOOP), and ensemble rotation forest (ROF) model. Specifically, the plant protein sequence is firstly transformed into the PSSM, in which the protein evolutionary information is perfectly preserved. Then, the local textural descriptor LOOP is employed to extract texture variation features from PSSM. Finally, the ROF classifier is adopted to infer the potential plant PPIs. The performance of CPIELA is evaluated via cross-validation on three plant PPIs datasets: Arabidopsis thaliana, Zea mays, and Oryza sativa. The experimental results demonstrate that the CPIELA method achieved the high average prediction accuracies of 98.63%, 98.09%, and 94.02%, respectively. To further verify the high performance of CPIELA, we also compared it with the other state-of-the-art methods on three gold standard datasets. The experimental results illustrate that CPIELA is efficient and reliable for predicting plant PPIs. It is anticipated that the CPIELA approach could become a useful tool for facilitating the identification of possible plant PPIs.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Hot spot prediction in protein-protein interactions by an ensemble system
    Liu, Quanya
    Chen, Peng
    Wang, Bing
    Zhang, Jun
    Li, Jinyan
    BMC SYSTEMS BIOLOGY, 2018, 12
  • [22] HPiP: an R/Bioconductor package for predicting host-pathogen protein-protein interactions from protein sequences using ensemble machine learning approach
    Rahmatbakhsh, Matineh
    Moutaoufik, Mohamed Taha
    Gagarinova, Alla
    Babu, Mohan
    BIOINFORMATICS ADVANCES, 2022, 2 (01):
  • [23] Prediction of protein-protein interactions using evolutionary and structural relationships
    Zaki, Nazar
    WORLD CONGRESS ON ENGINEERING 2008, VOLS I-II, 2008, : 1656 - 1661
  • [24] Predictina protein-protein interactions based only on sequences information
    Shen, Juwen
    Zhang, Jian
    Luo, Xiaomin
    Zhu, Weiliang
    Yu, Kunqian
    Chen, Kaixian
    Li, Yixue
    Jiang, Hualiang
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2007, 104 (11) : 4337 - 4341
  • [25] Computational identification of protein-protein interactions in model plant proteomes
    Ziyun Ding
    Daisuke Kihara
    Scientific Reports, 9
  • [26] Computational identification of protein-protein interactions in model plant proteomes
    Ding, Ziyun
    Kihara, Daisuke
    SCIENTIFIC REPORTS, 2019, 9 (1)
  • [27] Computational Prediction of Protein-Protein Interactions in Sphingolipid Signaling Network
    Gungormez, Yasemen
    Olmez, Elif Ozkirimli
    Ulgen, Kutlu Ozergin
    BIYOMUT: 2009 14TH NATIONAL BIOMEDICAL ENGINEERING MEETING, 2009, : 231 - 234
  • [28] Computational Prediction of Protein-Protein Interactions in Leishmania Predicted Proteomes
    Rezende, Antonio M.
    Folador, Edson L.
    Resende, Daniela de M.
    Ruiz, Jeronimo C.
    PLOS ONE, 2012, 7 (12):
  • [29] PPIevo: Protein-protein interaction prediction from PSSM based evolutionary information
    Zahiri, Javad
    Yaghoubi, Omid
    Mohammad-Noori, Morteza
    Ebrahimpour, Reza
    Masoudi-Nejad, Ali
    GENOMICS, 2013, 102 (04) : 237 - 242
  • [30] Computational prediction of host-pathogen protein-protein interactions
    Dyer, Matthew D.
    Murali, T. M.
    Sobral, Bruno W.
    BIOINFORMATICS, 2007, 23 (13) : I159 - I166