Protein-peptide binding residue prediction based on protein language models and cross-attention mechanism

被引:4
|
作者
Hu, Jun [1 ,2 ]
Chen, Kai-Xin [1 ]
Rao, Bing [3 ]
Ni, Jing-Yuan [4 ]
Thafar, Maha A. [5 ]
Albaradei, Somayah [6 ]
Arif, Muhammad [7 ]
机构
[1] Zhejiang Univ Technol, Coll Informat Engn, Hangzhou 310023, Peoples R China
[2] Suzhou Inst Syst Med, Ctr AI & Computat Biol, Suzhou 215123, Peoples R China
[3] Hangzhou City Univ, Sch Informat & Elect Engn, Hangzhou 310015, Peoples R China
[4] Nanjing Univ Informat Sci & Technol, NUIST Reading Acad, Nanjing 210044, Peoples R China
[5] Taif Univ, Coll Comp & Informat Technol, Dept Comp Sci, Taif 21944, Saudi Arabia
[6] King Abdulaziz Univ, Fac Comp & Informat Technol, Dept Comp Sci, Jeddah, Saudi Arabia
[7] Hamad Bin Khalifa Univ, Coll Sci & Engn, Doha 34110, Qatar
基金
中国国家自然科学基金;
关键词
Peptide-binding residue prediction; Protein language model; Sequence-based methods; Cross-attention mechanism; SEQUENCE-BASED PREDICTION; WEB SERVER; REGIONS; SITES;
D O I
10.1016/j.ab.2024.115637
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Accurate identifications of protein-peptide binding residues are essential for protein-peptide interactions and advancing drug discovery. To address this problem, extensive research efforts have been made to design more discriminative feature representations. However, extracting these explicit features usually depend on third-party tools, resulting in low computational efficacy and suffering from low predictive performance. In this study, we design an end-to-end deep learning-based method, E2EPep, for protein-peptide binding residue prediction using protein sequence only. E2EPep first employs and fine-tunes two state-of-the-art pre-trained protein language models that can extract two different high-latent feature representations from protein sequences relevant for protein structures and functions. A novel feature fusion module is then designed in E2EPep to fuse and optimize the above two feature representations of binding residues. In addition, we have also design E2EPep+, which integrates E2EPep and PepBCL models, to improve the prediction performance. Experimental results on two independent testing data sets demonstrate that E2EPep and E2EPep + could achieve the average AUC values of 0.846 and 0.842 while achieving an average Matthew's correlation coefficient value that is significantly higher than that of existing most of sequence-based methods and comparable to that of the state-of-the-art structurebased predictors. Detailed data analysis shows that the primary strength of E2EPep lies in the effectiveness of feature representation using cross-attention mechanism to fuse the embeddings generated by two fine-tuned protein language models. The standalone package of E2EPep and E2EPep + can be obtained at https://github. com/ckx259/E2EPep.git for academic use only.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Accurate Prediction for Protein-Peptide Binding Based on High-Temperature Molecular Dynamics Simulations
    Chen, Jia-Nan
    Jiang, Fan
    Wu, Yun-Dong
    JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2022, : 6386 - 6395
  • [22] Sequence-Based Prediction of Protein-Peptide Binding Sites Using Support Vector Machine
    Taherzadeh, Ghazaleh
    Yang, Yuedong
    Zhang, Tuo
    Liew, Alan Wee-Chung
    Zhou, Yaoqi
    JOURNAL OF COMPUTATIONAL CHEMISTRY, 2016, 37 (13) : 1223 - 1229
  • [23] Predicting protein-peptide interactions based on the peptide sequence and the protein structure
    Zou, Xiaoqin
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2017, 254
  • [24] A regularized discriminative model for the prediction of protein-peptide interactions
    Lehrach, WP
    Husmeier, D
    Williams, CKI
    BIOINFORMATICS, 2006, 22 (05) : 532 - 540
  • [25] Protein-Peptide Binding Energetics under Crowded Conditions
    Stadmiller, Samantha S.
    Aguilar, Jhoan S.
    Parnham, Stuart
    Pielak, Gary J.
    JOURNAL OF PHYSICAL CHEMISTRY B, 2020, 124 (42): : 9297 - 9309
  • [26] Perceiver CPI: a nested cross-attention network for compound-protein interaction prediction
    Nguyen, Ngoc-Quang
    Jang, Gwanghoon
    Kim, Hajung
    Kang, Jaewoo
    BIOINFORMATICS, 2023, 39 (01)
  • [27] Atomistic simulation of protein-peptide binding with rigorous kinetics
    Chong, Lillian
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2014, 247
  • [28] Prediction of Protein-Peptide Interactions with a Nearest Neighbor Algorithm
    Li, Bi-Qing
    Zhang, Yu-Hang
    Jin, Mei-Ling
    Huang, Tao
    Cai, Yu-Dong
    CURRENT BIOINFORMATICS, 2018, 13 (01) : 14 - 24
  • [29] AF2BIND: Prediction of protein-peptide and protein-ligand binding sites using AlphaFold
    Gazizov, Artem
    Sergey, Ovchinnikov
    Nicholas, Polizzi
    PROTEIN SCIENCE, 2023, 32 (12)
  • [30] Exploring Protein-Peptide Binding Specificity through Computational Peptide Screening
    Bhattacherjee, Arnab
    Wallin, Stefan
    PLOS COMPUTATIONAL BIOLOGY, 2013, 9 (10)