Protein 3D Graph Structure Learning for Robust Structure-Based Protein Property Prediction

被引:0
|
作者
Huang, Yufei [1 ,2 ]
Li, Siyuan [1 ,2 ]
Wu, Lirong [1 ,2 ]
Su, Jin [1 ,2 ]
Lin, Haitao [1 ,2 ]
Zhang, Odin [1 ]
Liu, Zihan [1 ,2 ]
Gao, Zhangyang [1 ,2 ]
Zheng, Jiangbin [1 ,2 ]
Li, Stan Z. [2 ]
机构
[1] Zhejiang Univ, Hangzhou, Peoples R China
[2] Westlake Univ, AI Lab, Res Ctr Ind Future, Hangzhou, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Protein structure-based property prediction has emerged as a promising approach for various biological tasks, such as protein function prediction and sub-cellular location estimation. The existing methods highly rely on experimental protein structure data and fail in scenarios where these data are unavailable. Predicted protein structures from AI tools (e.g., AlphaFold2) were utilized as alternatives. However, we observed that current practices, which simply employ accurately predicted structures during inference, suffer from notable degradation in prediction accuracy. While similar phenomena have been extensively studied in general fields (e.g., Computer Vision) as model robustness, their impact on protein property prediction remains unexplored. In this paper, we first investigate the reason behind the performance decrease when utilizing predicted structures, attributing it to the structure embedding bias from the perspective of structure representation learning. To study this problem, we identify a Protein 3D Graph Structure Learning Problem for Robust Protein Property Prediction (PGSL-RP3), collect benchmark datasets, and present a protein Structure embedding Alignment Optimization framework (SAO) to mitigate the problem of structure embedding bias between the predicted and experimental protein structures. Extensive experiments have shown that our framework is model-agnostic and effective in improving the property prediction of both predicted structures and experimental structures.
引用
收藏
页码:12662 / 12670
页数:9
相关论文
共 50 条
  • [41] Structure-based prediction of protein-protein and protein-ligand interactions on a genomic scale
    Garzon, Jose
    Hwang, Howook
    Dey, Fabian
    Murray, Diana
    Petrey, Donald
    Honig, Barry
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2017, 254
  • [42] Structure-based prediction of protein-peptide specificity in Rosetta
    King, Christopher A.
    Bradley, Philip
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2010, 78 (16) : 3437 - 3449
  • [43] Structure-Based Prediction of Protein-Folding Transition Paths
    Jacobs, William M.
    Shakhnovich, Eugene I.
    BIOPHYSICAL JOURNAL, 2016, 111 (05) : 925 - 936
  • [44] Structure-based prediction of host-pathogen protein interactions
    Mariano, Rachelle
    Wuchty, Stefan
    CURRENT OPINION IN STRUCTURAL BIOLOGY, 2017, 44 : 119 - 124
  • [45] Sequence and structure-based prediction of eukaryotic protein phosphorylation sites
    Blom, N
    Gammeltoft, S
    Brunak, S
    JOURNAL OF MOLECULAR BIOLOGY, 1999, 294 (05) : 1351 - 1362
  • [46] Structure-based prediction of the effects of a missense variant on protein stability
    Yang, Yang
    Chen, Biao
    Tan, Ge
    Vihinen, Mauno
    Shen, Bairong
    AMINO ACIDS, 2013, 44 (03) : 847 - 855
  • [47] Structure-based prediction of the effects of a missense variant on protein stability
    Yang Yang
    Biao Chen
    Ge Tan
    Mauno Vihinen
    Bairong Shen
    Amino Acids, 2013, 44 : 847 - 855
  • [48] STRUCTURE-BASED METHODS FOR COMPUTATIONAL PROTEIN FUNCTIONAL SITE PREDICTION
    Dukka, B. K. C.
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2013, 8 (11):
  • [49] MAESTROweb: a web server for structure-based protein stability prediction
    Laimer, Josef
    Hiebl-Flach, Julia
    Lengauer, Daniel
    Lackner, Peter
    BIOINFORMATICS, 2016, 32 (09) : 1414 - 1416
  • [50] Stochastic Fractal Search Algorithm for 3D Protein Structure Prediction
    Sun, Chuan
    Wei, Zi-qi
    Zhou, Chang-jun
    Wang, Bin
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND COMPUTER SCIENCE (AICS 2016), 2016, : 156 - 160