Hierarchical Molecular Graph Self-Supervised Learning for property prediction

被引:0
|
作者
Xuan Zang
Xianbing Zhao
Buzhou Tang
机构
[1] Harbin Institute of Technology,Department of Computer Science
[2] Pengcheng Laboratory,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Molecular graph representation learning has shown considerable strength in molecular analysis and drug discovery. Due to the difficulty of obtaining molecular property labels, pre-training models based on self-supervised learning has become increasingly popular in molecular representation learning. Notably, Graph Neural Networks (GNN) are employed as the backbones to encode implicit representations of molecules in most existing works. However, vanilla GNN encoders ignore chemical structural information and functions implied in molecular motifs, and obtaining the graph-level representation via the READOUT function hinders the interaction of graph and node representations. In this paper, we propose Hierarchical Molecular Graph Self-supervised Learning (HiMol), which introduces a pre-training framework to learn molecule representation for property prediction. First, we present a Hierarchical Molecular Graph Neural Network (HMGNN), which encodes motif structure and extracts node-motif-graph hierarchical molecular representations. Then, we introduce Multi-level Self-supervised Pre-training (MSP), in which corresponding multi-level generative and predictive tasks are designed as self-supervised signals of HiMol model. Finally, superior molecular property prediction results on both classification and regression tasks demonstrate the effectiveness of HiMol. Moreover, the visualization performance in the downstream dataset shows that the molecule representations learned by HiMol can capture chemical semantic information and properties.
引用
收藏
相关论文
共 50 条
  • [31] Learning self-supervised molecular representations for drug–drug interaction prediction
    Rogia Kpanou
    Patrick Dallaire
    Elsa Rousseau
    Jacques Corbeil
    BMC Bioinformatics, 25
  • [32] Self-Supervised Pre-Training via Multi-View Graph Information Bottleneck for Molecular Property Prediction
    Zang, Xuan
    Zhang, Junjie
    Tang, Buzhou
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (12) : 7659 - 7669
  • [33] Self-Supervised Contrastive Molecular Representation Learning with a Chemical Synthesis Knowledge Graph
    Xie, Jiancong
    Wang, Yi
    Rao, Jiahua
    Zheng, Shuangjia
    Yang, Yuedong
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2024, 64 (06) : 1945 - 1954
  • [34] Self-supervised Learning for Unintentional Action Prediction
    Zatsarynna, Olga
    Abu Farha, Yazan
    Gall, Juergen
    PATTERN RECOGNITION, DAGM GCPR 2022, 2022, 13485 : 429 - 444
  • [35] Self-supervised Consensus Representation Learning for Attributed Graph
    Liu, Changshu
    Wen, Liangjian
    Kang, Zhao
    Luo, Guangchun
    Tian, Ling
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2654 - 2662
  • [36] Graph Multihead Attention Pooling with Self-Supervised Learning
    Wang, Yu
    Hu, Liang
    Wu, Yang
    Gao, Wanfu
    ENTROPY, 2022, 24 (12)
  • [37] Self-supervised graph representations with generative adversarial learning
    Sun, Xuecheng
    Wang, Zonghui
    Lu, Zheming
    Lu, Ziqian
    NEUROCOMPUTING, 2024, 592
  • [38] Hierarchical Self-supervised Representation Learning for Movie Understanding
    Xiao, Fanyi
    Kundu, Kaustav
    Tighe, Joseph
    Modolo, Davide
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 9717 - 9726
  • [39] SHERLock: Self-Supervised Hierarchical Event Representation Learning
    Roychowdhury, S.
    Sontakke, S. A.
    Itti, L.
    Sarkar, M.
    Aggarwal, M.
    Badjatiya, P.
    Puri, N.
    Krishnamurthy, B.
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 2672 - 2678
  • [40] Self-supervised Graph Representation Learning with Variational Inference
    Liao, Zihan
    Liang, Wenxin
    Liu, Han
    Mu, Jie
    Zhang, Xianchao
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2021, PT III, 2021, 12714 : 116 - 127