Contrastive learning of protein representations with graph neural networks for structural and functional annotations

被引:0
|
作者
Luo, Jiaqi [1 ]
Luo, Yunan [2 ]
机构
[1] Tsinghua Univ, Inst Interdisciplinary Informat Sci, Beijing, Peoples R China
[2] Georgia Inst Technol, Sch Computat Sci & Engn, Atlanta, GA 30332 USA
关键词
Protein annotation; Protein structure and function; Deep learning; Graph neural network; Contrastive learning; Representation learning; SEQUENCE;
D O I
暂无
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Although protein sequence data is growing at an ever-increasing rate, the protein universe is still sparsely annotated with functional and structural annotations. Computational approaches have become efficient solutions to infer annotations for unlabeled proteins by transferring knowledge from proteins with experimental annotations. Despite the increasing availability of protein structure data and the high coverage of high-quality predicted structures, e.g., by AlphaFold, many existing computational tools still only rely on sequence data to predict structural or functional annotations, including alignment algorithms such as BLAST and several sequence-based deep learning models. Here, we develop PenLight, a general deep learning framework for protein structural and functional annotations. PenLight uses a graph neural network (GNN) to integrate 3D protein structure data and protein language model representations. In addition, PenLight applies a contrastive learning strategy to train the GNN for learning protein representations that reflect similarities beyond sequence identity, such as semantic similarities in the function or structure space. We bench-marked PenLight on a structural classification task and a functional annotation task, where PenLight achieved higher prediction accuracy and coverage than state-of-the-art methods.
引用
下载
收藏
页码:109 / 120
页数:12
相关论文
共 50 条
  • [21] Learning Invariant Representations of Graph Neural Networks via Cluster Generalization
    Xia, Donglin
    Wang, Xiao
    Liu, Nian
    Shi, Chuan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [22] Graph Convolutional Neural Networks for Learning Attribute Representations for Word Spotting
    Wolf, Fabian
    Fischer, Andreas
    Fink, Gernot A.
    DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2021, PT I, 2021, 12821 : 50 - 64
  • [23] Representations of Graph States with Neural Networks
    Ying Yang
    Acta Mathematica Sinica, English Series, 2023, 39 : 685 - 694
  • [24] Representations of Graph States with Neural Networks
    Yang, Ying
    ACTA MATHEMATICA SINICA-ENGLISH SERIES, 2023, 39 (04) : 685 - 694
  • [25] Representations of Graph States with Neural Networks
    Ying YANG
    Acta Mathematica Sinica,English Series, 2023, (04) : 685 - 694
  • [26] Probing Negative Sampling for Contrastive Learning to Learn Graph Representations
    Chen, Shiyi
    Wang, Ziao
    Zhang, Xinni
    Zhang, Xiaofeng
    Peng, Dan
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: RESEARCH TRACK, PT II, 2021, 12976 : 434 - 449
  • [27] DGCL: dual-graph neural networks contrastive learning for molecular property prediction
    Jiang, Xiuyu
    Tan, Liqin
    Zou, Qingsong
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (06)
  • [28] Contrastive learning enhanced by graph neural networks for Universal Multivariate Time Series Representation
    Wang, Xinghao
    Xing, Qiang
    Xiao, Huimin
    Ye, Ming
    INFORMATION SYSTEMS, 2024, 125
  • [30] Contrastive learning enhanced by graph neural networks for Universal Multivariate Time Series Representation
    College of Artificial Intelligence, Southwest University, Chongqing
    400715, China
    Inf. Syst.,