CPIELA: Computational Prediction of Plant Protein-Protein Interactions by Ensemble Learning Approach From Protein Sequences and Evolutionary Information

被引:0
|
作者
Li, Li-Ping [1 ,2 ]
Zhang, Bo [1 ,2 ]
Cheng, Li [3 ]
机构
[1] Xinjiang Agr Univ, Coll Grassland & Environm Sci, Urumqi, Peoples R China
[2] Xinjiang Key Lab Grassland Resources & Ecol, Urumqi, Peoples R China
[3] Chinese Acad Sci, Xinjiang Tech Inst Phys & Chem, Urumqi, Peoples R China
基金
美国国家科学基金会;
关键词
plant; proteinprotein interactions; machine learning; sequence; evolutionary information; PSI-BLAST; DATABASE; BIOLOGY;
D O I
10.3389/fgene.2022.857839
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Identification and characterization of plant protein-protein interactions (PPIs) are critical in elucidating the functions of proteins and molecular mechanisms in a plant cell. Although experimentally validated plant PPIs data have become increasingly available in diverse plant species, the high-throughput techniques are usually expensive and labor-intensive. With the incredibly valuable plant PPIs data accumulating in public databases, it is progressively important to propose computational approaches to facilitate the identification of possible PPIs. In this article, we propose an effective framework for predicting plant PPIs by combining the position-specific scoring matrix (PSSM), local optimal-oriented pattern (LOOP), and ensemble rotation forest (ROF) model. Specifically, the plant protein sequence is firstly transformed into the PSSM, in which the protein evolutionary information is perfectly preserved. Then, the local textural descriptor LOOP is employed to extract texture variation features from PSSM. Finally, the ROF classifier is adopted to infer the potential plant PPIs. The performance of CPIELA is evaluated via cross-validation on three plant PPIs datasets: Arabidopsis thaliana, Zea mays, and Oryza sativa. The experimental results demonstrate that the CPIELA method achieved the high average prediction accuracies of 98.63%, 98.09%, and 94.02%, respectively. To further verify the high performance of CPIELA, we also compared it with the other state-of-the-art methods on three gold standard datasets. The experimental results illustrate that CPIELA is efficient and reliable for predicting plant PPIs. It is anticipated that the CPIELA approach could become a useful tool for facilitating the identification of possible plant PPIs.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] An Efficient Ensemble Learning Approach for Predicting Protein-Protein Interactions by Integrating Protein Primary Sequence and Evolutionary Information
    You, Zhu-Hong
    Huang, Wen-Zhun
    Zhang, Shanwen
    Huang, Yu-An
    Yu, Chang-Qing
    Li, Li-Ping
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2019, 16 (03) : 809 - 817
  • [2] A Novel Ensemble Learning-Based Computational Method to Predict Protein-Protein Interactions from Protein Primary Sequences
    Pan, Jie
    Wang, Shiwei
    Yu, Changqing
    Li, Liping
    You, Zhuhong
    Sun, Yanmei
    BIOLOGY-BASEL, 2022, 11 (05):
  • [3] Probabilistic prediction of protein-protein interactions from the protein sequences
    Chinnasamy, Arunkumar
    Mittal, Ankush
    Sung, Wing-Kin
    COMPUTERS IN BIOLOGY AND MEDICINE, 2006, 36 (10) : 1143 - 1154
  • [4] Computational prediction of protein-protein interactions
    Skrabanek, Lucy
    Saini, Harpreet K.
    Bader, Gary D.
    Enright, Anton J.
    MOLECULAR BIOTECHNOLOGY, 2008, 38 (01) : 1 - 17
  • [5] Learning to predict protein-protein interactions from protein sequences
    Gomez, SM
    Noble, WS
    Rzhetsky, A
    BIOINFORMATICS, 2003, 19 (15) : 1875 - 1881
  • [6] Prediction of protein-protein interactions from primary sequences
    Dong, Qiwen
    Zhou, Shuigeng
    Liu, Xuan
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2010, 4 (02) : 211 - 227
  • [7] FWHT-RF: A Novel Computational Approach to Predict Plant Protein-Protein Interactions via an Ensemble Learning Method
    Pan, Jie
    Li, Li-Ping
    Yu, Chang-Qing
    You, Zhu-Hong
    Ren, Zhong-Hao
    Tang, Jing-Yu
    SCIENTIFIC PROGRAMMING, 2021, 2021
  • [8] Robust and accurate prediction of protein-protein interactions by exploiting evolutionary information
    Li, Yang
    Wang, Zheng
    Li, Li-Ping
    You, Zhu-Hong
    Huang, Wen-Zhun
    Zhan, Xin-Ke
    Wang, Yan-Bin
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [9] Prediction of protein-protein interactions from amino acid sequences with ensemble extreme learning machines and principal component analysis
    Zhu-Hong You
    Ying-Ke Lei
    Lin Zhu
    Junfeng Xia
    Bing Wang
    BMC Bioinformatics, 14
  • [10] Computational Methods for the Prediction of Protein-Protein Interactions
    Xia, Jun-Feng
    Wang, Shu-Lin
    Lei, Ying-Ke
    PROTEIN AND PEPTIDE LETTERS, 2010, 17 (09): : 1069 - 1078