Deep learning-assisted prediction of protein-protein interactions in Arabidopsis thaliana

被引:4
|
作者
Zheng, Jingyan [1 ]
Yang, Xiaodi [2 ]
Huang, Yan [1 ]
Yang, Shiping [3 ]
Wuchty, Stefan [4 ,5 ,6 ,7 ]
Zhang, Ziding [1 ]
机构
[1] China Agr Univ, Coll Biol Sci, State Key Lab Anim Biotech Breeding, Beijing 100193, Peoples R China
[2] Peking Univ First Hosp, Dept Hematol, Beijing 100034, Peoples R China
[3] China Agr Univ, Coll Biol Sci, State Key Lab Plant Physiol & Biochem, Beijing 100193, Peoples R China
[4] Univ Miami, Dept Comp Sci, Miami, FL 33146 USA
[5] Univ Miami, Dept Biol, Miami, FL 33146 USA
[6] Univ Miami, Sylvester Comprehens Canc Ctr, Miami, FL 33136 USA
[7] Univ Miami, Inst Data Sci & Comp, Miami, FL 33146 USA
来源
PLANT JOURNAL | 2023年 / 114卷 / 04期
基金
中国国家自然科学基金;
关键词
Arabidopsis thaliana; protein-protein interaction; deep learning; prediction; GO annotation; domain; MOLECULAR-INTERACTIONS; DATABASE; NETWORKS; PLATFORM; WIDE;
D O I
10.1111/tpj.16188
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Currently, the experimentally identified interactome of Arabidopsis (Arabidopsis thaliana) is still far from complete, suggesting that computational prediction methods can complement experimental techniques. Motivated by the prosperity and success of deep learning algorithms and natural language processing techniques, we introduce an integrative deep learning framework, DeepAraPPI, allowing us to predict protein-protein interactions (PPIs) of Arabidopsis utilizing sequence, domain and Gene Ontology (GO) information. Our current DeepAraPPI comprises: (i) a word2vec encoding-based Siamese recurrent convolutional neural network (RCNN) model; (ii) a Domain2vec encoding-based multiple-layer perceptron (MLP) model; and (iii) a GO2vec encoding-based MLP model. Finally, DeepAraPPI combines the prediction results of the three individual predictors through a logistic regression model. Compiling high-quality positive and negative training and test samples by applying strict filtering strategies, DeepAraPPI shows superior performance compared with existing state-of-the-art Arabidopsis PPI prediction methods. DeepAraPPI also provides better cross-species predictive ability in rice (Oryza sativa) than traditional machine learning methods, although the overall performance in cross-species prediction remains to be improved. DeepAraPPI is freely accessible at . In the meantime, we have also made the source code and data sets of DeepAraPPI available at .
引用
收藏
页码:984 / 994
页数:11
相关论文
共 50 条
  • [41] Predicting protein-protein interactions in Arabidopsis thaliana through integration of orthology, gene ontology and co-expression
    De Bodt, Stefanie
    Proost, Sebastian
    Vandepoele, Klaas
    Rouze, Pierre
    Van de Peer, Yves
    BMC GENOMICS, 2009, 10 : 288
  • [42] Computational Methods for the Prediction of Protein-Protein Interactions
    Xia, Jun-Feng
    Wang, Shu-Lin
    Lei, Ying-Ke
    PROTEIN AND PEPTIDE LETTERS, 2010, 17 (09): : 1069 - 1078
  • [43] Protein-Protein Interactions and Prediction: A Comprehensive Overview
    Sowmya, Gopichandran
    Ranganathan, Shoba
    PROTEIN AND PEPTIDE LETTERS, 2014, 21 (08): : 779 - 789
  • [44] Prediction of protein-protein interactions by docking methods
    Smith, GR
    Sternberg, MJE
    CURRENT OPINION IN STRUCTURAL BIOLOGY, 2002, 12 (01) : 28 - 35
  • [45] Computational Methods for the Prediction of Protein-Protein Interactions
    Guerra, Concettina
    Mina, Marco
    COMBINATORIAL IMAGE ANALYSIS, 2011, 6636 : 13 - 16
  • [46] Theoretical prediction of the protein-protein interaction between Arabidopsis thaliana COP1 and UVR8
    Wu, Min
    Eriksson, Leif A.
    Strid, Ake
    THEORETICAL CHEMISTRY ACCOUNTS, 2013, 132 (07) : 1 - 11
  • [47] Predicting protein-protein interactions through sequence-based deep learning
    Hashemifar, Somaye
    Neyshabur, Behnam
    Khan, Aly A.
    Xu, Jinbo
    BIOINFORMATICS, 2018, 34 (17) : 802 - 810
  • [48] A deep learning algorithm for predicting protein-protein interactions with nonnegative latent factorization
    Wang, Liwei
    Hu, Lun
    2021 INTERNATIONAL CONFERENCE ON CYBER-PHYSICAL SOCIAL INTELLIGENCE (ICCSI), 2021,
  • [49] Adaptive compressive learning for prediction of protein-protein interactions from primary sequence
    Zhang, Ya-Nan
    Pan, Xiao-Yong
    Huang, Yan
    Shen, Hong-Bin
    JOURNAL OF THEORETICAL BIOLOGY, 2011, 283 (01) : 44 - 52
  • [50] Ensemble learning prediction of protein-protein interactions using proteins functional annotations
    Saha, Indrajit
    Zubek, Julian
    Klingstrom, Tomas
    Forsberg, Simon
    Wikander, Johan
    Kierczak, Marcin
    Maulik, Ujjwal
    Plewczynski, Dariusz
    MOLECULAR BIOSYSTEMS, 2014, 10 (04) : 820 - 830