Pair-EGRET: enhancing the prediction of protein-protein interaction sites through graph attention networks and protein language models

被引:1
|
作者
Alam, Ramisa [1 ]
Mahbub, Sazan [1 ,2 ]
Bayzid, Md Shamsuzzoha [1 ]
机构
[1] Bangladesh Univ Engn & Technol, Dept Comp Sci & Engn, ECE Bldg, West Palashi, Dhaka 1205, Bangladesh
[2] Carnegie Mellon Univ, Sch Comp Sci, Computat Biol Dept, Pittsburgh, PA 15213 USA
关键词
FINGERPRINTS; SEQUENCE; SERVER;
D O I
10.1093/bioinformatics/btae588
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation Proteins are responsible for most biological functions, many of which require the interaction of more than one protein molecule. However, accurately predicting protein-protein interaction (PPI) sites (the interfacial residues of a protein that interact with other protein molecules) remains a challenge. The growing demand and cost associated with the reliable identification of PPI sites using conventional experimental methods call for computational tools for automated prediction and understanding of PPIs.Results We present Pair-EGRET, an edge-aggregated graph attention network that leverages the features extracted from pretrained transformer-like models to accurately predict PPI sites. Pair-EGRET works on a k-nearest neighbor graph, representing the 3D structure of a protein, and utilizes the cross-attention mechanism for accurate identification of interfacial residues of a pair of proteins. Through an extensive evaluation study using a diverse array of experimental data, evaluation metrics, and case studies on representative protein sequences, we demonstrate that Pair-EGRET can achieve remarkable performance in predicting PPI sites. Moreover, Pair-EGRET can provide interpretable insights from the learned cross-attention matrix.Availability and implementation Pair-EGRET is freely available in open source form at the GitHub Repository https://github.com/1705004/Pair-EGRET.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Study of the protein-protein interaction networks via random graph approach
    Lee, PH
    Tsai, JJP
    Huang, CH
    Ng, KL
    Fang, JF
    ICCI 2005: FOURTH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS - PROCEEDINGS, 2005, : 110 - 119
  • [42] Prediction of Protein-Protein Interaction Sites Based on Naive Bayes Classifier
    Geng, Haijiang
    Lu, Tao
    Lin, Xiao
    Liu, Yu
    Yan, Fangrong
    BIOCHEMISTRY RESEARCH INTERNATIONAL, 2015, 2015
  • [43] Prediction of Protein-Protein Interaction Sites Based on Stratified Attentional Mechanisms
    Tang, Minli
    Wu, Longxin
    Yu, Xinyu
    Chu, Zhaoqi
    Jin, Shuting
    Liu, Juan
    FRONTIERS IN GENETICS, 2021, 12
  • [44] Global alignment of protein-protein interaction networks by graph matching methods
    Zaslavskiy, Mikhail
    Bach, Francis
    Vert, Jean-Philippe
    BIOINFORMATICS, 2009, 25 (12) : I259 - I267
  • [45] A novel feature extraction scheme for prediction of protein-protein interaction sites
    Du, Xiuquan
    Jing, Anqi
    Hu, Xinying
    MOLECULAR BIOSYSTEMS, 2015, 11 (02) : 475 - 485
  • [46] Prediction of Protein-Protein Interaction Sites Using Electrostatic Desolvation Profiles
    Fiorucci, Sebastien
    Zacharias, Martin
    BIOPHYSICAL JOURNAL, 2010, 98 (09) : 1921 - 1930
  • [47] Prediction of protein-protein interaction sites using support vector machines
    Minakuchi, Y
    Satou, K
    Konagaya, A
    METMBS'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON MATHEMATICS AND ENGINEERING TECHNIQUES IN MEDICINE AND BIOLOGICAL SCIENCES, 2003, : 22 - 28
  • [48] Prediction of protein-protein interaction sites using support vector machines
    Koike, A
    Takagi, T
    PROTEIN ENGINEERING DESIGN & SELECTION, 2004, 17 (02): : 165 - 173
  • [49] Pitfalls of machine learning models for protein-protein interaction networks
    Lannelongue, Loic
    Inouye, Michael
    BIOINFORMATICS, 2024, 40 (02)
  • [50] Improving protein-protein interaction prediction using protein language model and protein network features
    Hu, Jun
    Li, Zhe
    Rao, Bing
    Thafar, Maha A.
    Arif, Muhammad
    ANALYTICAL BIOCHEMISTRY, 2024, 693