Interpretable Structured Learning with Sparse Gated Sequence Encoder for Protein-Protein Interaction Prediction

被引:2
|
作者
Kishan, K. C. [1 ]
Cui, Feng [2 ]
Haake, Anne R. [1 ]
Li, Rui [1 ]
机构
[1] Rochester Inst Technol, Golisano Coll Comp & Informat Sci, Rochester, NY 14623 USA
[2] Rochester Inst Technol, Thomas H Gosnell Sch Life Sci, Rochester, NY 14623 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
D O I
10.1109/ICPR48806.2021.9412055
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Predicting protein-protein interactions (PPIs) by learning informative representations from amino acid sequences is a challenging yet important problem in biology. Although various deep learning models in Siamese architecture have been proposed to model PPIs from sequences, these methods are computationally expensive for a large number of PPIs due to the pairwise encoding process. Furthermore, these methods are difficult to interpret because of non-intuitive mappings from protein sequences to their sequence representation. To address these challenges, we present a novel deep framework to model and predict PPIs from sequence alone. Our model incorporates a bidirectional gated recurrent unit to learn sequence representations by leveraging contextualized and sequential information from sequences. We further employ a sparse regularization to model long-range dependencies between amino acids and to select important amino acids (protein motifs), thus enhancing interpretability. Besides, the novel design of the encoding process makes our model computationally efficient and scalable to an increasing number of interactions. Experimental results on up-to-date interaction datasets demonstrate that our model achieves superior performance compared to other state-of-the-art methods. Literature-based case studies illustrate the ability of our model to provide biological insights to interpret the predictions.
引用
收藏
页码:7126 / 7133
页数:8
相关论文
共 50 条
  • [41] Protein-Protein Interaction Prediction: Recent Advances
    Shatnawi, Maad
    2017 28TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA), 2017, : 69 - 73
  • [42] Protein-Protein Interaction: Prediction, Design, and Modulation
    Zhang Chang-Sheng
    Lai Lu-Hua
    ACTA PHYSICO-CHIMICA SINICA, 2012, 28 (10) : 2363 - 2380
  • [43] Sequence-based protein-protein interaction prediction via support vector machine
    Wang, Yongcui
    Wang, Jiguang
    Yang, Zhixia
    Deng, Naiyang
    JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2010, 23 (05) : 1012 - 1023
  • [44] Amalgamation of 3D structure and sequence information for protein-protein interaction prediction
    Jha, Kanchan
    Saha, Sriparna
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [45] DeNovo: virus-host sequence-based protein-protein interaction prediction
    Eid, Fatma-Elzahraa
    ElHefnawi, Mahmoud
    Heath, Lenwood S.
    BIOINFORMATICS, 2016, 32 (08) : 1144 - 1150
  • [46] Prediction of protein-protein interactions using stacked auto-encoder
    Jha, Kanchan
    Saha, Sriparna
    Tanveer, M.
    TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2022, 33 (10)
  • [47] Protein-Protein Interaction Prediction via Structure-Based Deep Learning
    Liu, Yucong
    Liu, Yijun
    Li, Zhenhai
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2024, 92 (11) : 1287 - 1296
  • [48] SENSDeep: An Ensemble Deep Learning Method for Protein-Protein Interaction Sites Prediction
    Aybey, Engin
    Gumus, Ozgur
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2023, 15 (01) : 55 - 87
  • [49] Fast prediction of protein-protein interaction sites based on Extreme Learning Machines
    Wang, Debby A.
    Wang, Ran
    Yan, Hong
    NEUROCOMPUTING, 2014, 128 : 258 - 266
  • [50] Classification and prediction of protein-protein interaction interface using machine learning algorithm
    Das, Subhrangshu
    Chakrabarti, Saikat
    SCIENTIFIC REPORTS, 2021, 11 (01)