Interpretable Structured Learning with Sparse Gated Sequence Encoder for Protein-Protein Interaction Prediction

被引:2
|
作者
Kishan, K. C. [1 ]
Cui, Feng [2 ]
Haake, Anne R. [1 ]
Li, Rui [1 ]
机构
[1] Rochester Inst Technol, Golisano Coll Comp & Informat Sci, Rochester, NY 14623 USA
[2] Rochester Inst Technol, Thomas H Gosnell Sch Life Sci, Rochester, NY 14623 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
D O I
10.1109/ICPR48806.2021.9412055
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Predicting protein-protein interactions (PPIs) by learning informative representations from amino acid sequences is a challenging yet important problem in biology. Although various deep learning models in Siamese architecture have been proposed to model PPIs from sequences, these methods are computationally expensive for a large number of PPIs due to the pairwise encoding process. Furthermore, these methods are difficult to interpret because of non-intuitive mappings from protein sequences to their sequence representation. To address these challenges, we present a novel deep framework to model and predict PPIs from sequence alone. Our model incorporates a bidirectional gated recurrent unit to learn sequence representations by leveraging contextualized and sequential information from sequences. We further employ a sparse regularization to model long-range dependencies between amino acids and to select important amino acids (protein motifs), thus enhancing interpretability. Besides, the novel design of the encoding process makes our model computationally efficient and scalable to an increasing number of interactions. Experimental results on up-to-date interaction datasets demonstrate that our model achieves superior performance compared to other state-of-the-art methods. Literature-based case studies illustrate the ability of our model to provide biological insights to interpret the predictions.
引用
收藏
页码:7126 / 7133
页数:8
相关论文
共 50 条
  • [21] Prediction of protein-protein interaction inhibitors by chemoinformatics and machine learning methods
    Neugebauer, Alexander
    Hartmann, Rolf W.
    Klein, Christian D.
    JOURNAL OF MEDICINAL CHEMISTRY, 2007, 50 (19) : 4665 - 4668
  • [22] Learning spatial structures of proteins improves protein-protein interaction prediction
    Song, Bosheng
    Luo, Xiaoyan
    Luo, Xiaoli
    Liu, Yuansheng
    Niu, Zhangming
    Zeng, Xiangxiang
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (02)
  • [23] COFACTOR: improved protein function prediction by combining structure, sequence and protein-protein interaction information
    Zhang, Chengxin
    Freddolino, Peter L.
    Zhang, Yang
    NUCLEIC ACIDS RESEARCH, 2017, 45 (W1) : W291 - W299
  • [24] Prediction of protein motions from amino acid sequence and its application to protein-protein interaction
    Hirose, Shuichi
    Yokota, Kiyonobu
    Kuroda, Yutaka
    Wako, Hiroshi
    Endo, Shigeru
    Kanai, Satoru
    Noguchi, Tamotsu
    BMC STRUCTURAL BIOLOGY, 2010, 10
  • [25] Adaptive compressive learning for prediction of protein-protein interactions from primary sequence
    Zhang, Ya-Nan
    Pan, Xiao-Yong
    Huang, Yan
    Shen, Hong-Bin
    JOURNAL OF THEORETICAL BIOLOGY, 2011, 283 (01) : 44 - 52
  • [26] Ensemble learning model for Protein-Protein interaction prediction with multiple Machine learning techniques
    Lai, Zhenghui
    Li, Mengshan
    Chen, Qianyong
    Gu, Yunlong
    Wang, Nan
    Guan, Lixin
    MEASUREMENT, 2025, 242
  • [27] Learning Sequence Determinants of Protein: Protein Interaction Specificity with Sparse Graphical Models
    Kamisetty, Hetunandan
    Ghosh, Bornika
    Langmead, Christopher James
    Bailey-Kellogg, Chris
    RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY, RECOMB2014, 2014, 8394 : 129 - 143
  • [28] Learning Sequence Determinants of Protein: Protein Interaction Specificity with Sparse Graphical Models
    Kamisetty, Hetunandan
    Ghosh, Bornika
    Langmead, Christopher James
    Bailey-Kellogg, Chris
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2015, 22 (06) : 474 - 486
  • [29] Protein-Protein Interaction Prediction for Targeted Protein Degradation
    Orasch, Oliver
    Weber, Noah
    Mueller, Michael
    Amanzadi, Amir
    Gasbarri, Chiara
    Trummer, Christopher
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2022, 23 (13)
  • [30] Integration of protein sequence and protein-protein interaction data by hypergraph learning to identify novel protein complexes
    Xia, Simin
    Li, Dianke
    Deng, Xinru
    Liu, Zhongyang
    Zhu, Huaqing
    Liu, Yuan
    Li, Dong
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (04)