Protein 3D Graph Structure Learning for Robust Structure-Based Protein Property Prediction

被引:0
|
作者
Huang, Yufei [1 ,2 ]
Li, Siyuan [1 ,2 ]
Wu, Lirong [1 ,2 ]
Su, Jin [1 ,2 ]
Lin, Haitao [1 ,2 ]
Zhang, Odin [1 ]
Liu, Zihan [1 ,2 ]
Gao, Zhangyang [1 ,2 ]
Zheng, Jiangbin [1 ,2 ]
Li, Stan Z. [2 ]
机构
[1] Zhejiang Univ, Hangzhou, Peoples R China
[2] Westlake Univ, AI Lab, Res Ctr Ind Future, Hangzhou, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Protein structure-based property prediction has emerged as a promising approach for various biological tasks, such as protein function prediction and sub-cellular location estimation. The existing methods highly rely on experimental protein structure data and fail in scenarios where these data are unavailable. Predicted protein structures from AI tools (e.g., AlphaFold2) were utilized as alternatives. However, we observed that current practices, which simply employ accurately predicted structures during inference, suffer from notable degradation in prediction accuracy. While similar phenomena have been extensively studied in general fields (e.g., Computer Vision) as model robustness, their impact on protein property prediction remains unexplored. In this paper, we first investigate the reason behind the performance decrease when utilizing predicted structures, attributing it to the structure embedding bias from the perspective of structure representation learning. To study this problem, we identify a Protein 3D Graph Structure Learning Problem for Robust Protein Property Prediction (PGSL-RP3), collect benchmark datasets, and present a protein Structure embedding Alignment Optimization framework (SAO) to mitigate the problem of structure embedding bias between the predicted and experimental protein structures. Extensive experiments have shown that our framework is model-agnostic and effective in improving the property prediction of both predicted structures and experimental structures.
引用
收藏
页码:12662 / 12670
页数:9
相关论文
共 50 条
  • [1] 3D structure-based protein retention prediction for ion-exchange chromatography
    Dismer, Florian
    Hubbuch, Juergen
    JOURNAL OF CHROMATOGRAPHY A, 2010, 1217 (08) : 1343 - 1353
  • [2] Improving structure-based protein-ligand affinity prediction by graph representation learning and ensemble learning
    Guo, Jia
    PLOS ONE, 2024, 19 (01):
  • [3] Structure-based protein function prediction using graph convolutional networks
    Gligorijevic, Vladimir
    Renfrew, P. Douglas
    Kosciolek, Tomasz
    Leman, Julia Koehler
    Berenberg, Daniel
    Vatanen, Tommi
    Chandler, Chris
    Taylor, Bryn C.
    Fisk, Ian M.
    Vlamakis, Hera
    Xavier, Ramnik J.
    Knight, Rob
    Cho, Kyunghyun
    Bonneau, Richard
    NATURE COMMUNICATIONS, 2021, 12 (01)
  • [4] Structure-based protein function prediction using graph convolutional networks
    Vladimir Gligorijević
    P. Douglas Renfrew
    Tomasz Kosciolek
    Julia Koehler Leman
    Daniel Berenberg
    Tommi Vatanen
    Chris Chandler
    Bryn C. Taylor
    Ian M. Fisk
    Hera Vlamakis
    Ramnik J. Xavier
    Rob Knight
    Kyunghyun Cho
    Richard Bonneau
    Nature Communications, 12
  • [5] Structure-based prediction of protein allostery
    Greener, Joe G.
    Sternberg, Michael J. E.
    CURRENT OPINION IN STRUCTURAL BIOLOGY, 2018, 50 : 1 - 8
  • [6] Protein-Protein Interaction Prediction via Structure-Based Deep Learning
    Liu, Yucong
    Liu, Yijun
    Li, Zhenhai
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2024,
  • [7] Structure-based prediction of protein function
    Honig, B
    FASEB JOURNAL, 1998, 12 (08): : A1313 - A1313
  • [8] Spatom: a graph neural network for structure-based protein-protein interaction site prediction
    Wu, Haonan
    Han, Jiyun
    Zhang, Shizhuo
    Xin, Gaojia
    Mou, Chaozhou
    Liu, Juntao
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (06)
  • [9] GNNGO3D: Protein Function Prediction Based on 3D Structure and Functional Hierarchy Learning
    Zhang, Liyuan
    Jiang, Yongquan
    Yang, Yan
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (08) : 3867 - 3878
  • [10] Integrated structure-based protein interface prediction
    Walder, M.
    Edelstein, E.
    Carroll, M.
    Lazarev, S.
    Fajardo, J. E.
    Fiser, A.
    Viswanathan, R.
    BMC BIOINFORMATICS, 2022, 23 (01)