SMICLR: Contrastive Learning on Multiple Molecular Representations for Semisupervised and Unsupervised Representation Learning

被引:17
|
作者
Pinheiro, Gabriel A. [1 ]
Silva, Juarez L. F. [2 ]
Quiles, Marcos G. [1 ]
机构
[1] Fed Univ Sao Paulo Unifesp, Inst Sci & Technol, BR-12247014 Sao Jose Dos Campos, SP, Brazil
[2] Univ Sao Paulo, Sao Carlos Inst Chem, BR-13560970 Sao Carlos, SP, Brazil
基金
巴西圣保罗研究基金会;
关键词
PREDICTION; NETWORKS; LANGUAGE; MODELS; SMILES;
D O I
10.1021/acs.jcim.2c00521
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Machine learning as a tool for chemical space exploration broadens horizons to work with known and unknown molecules. At its core lies molecular representation, an essential key to improve learning about structure-property relationships. Recently, contrastive frameworks have been showing impressive results for representation learning in diverse domains. Therefore, this paper proposes a contrastive framework that embraces multimodal molecular data. Specifically, our approach jointly trains a graph encoder and an encoder for the simplified molecular-input line-entry system (SMILES) string to perform the contrastive learning objective. Since SMILES is the basis of our method, i.e., we built the molecular graph from the SMILES, we call our framework as SMILES Contrastive Learning (SMICLR). When stacking a nonlinear regressor on the SMICLR's pretrained encoder and fine-tuning the entire model, we reduced the prediction error by, on average, 44% and 25% for the energetic and electronic properties of the QM9 data set, respectively, over the supervised baseline. We further improved our framework's performance when applying data augmentations in each molecular-input representation. Moreover, SMICLR demonstrated competitive representation learning results in an unsupervised setting.
引用
收藏
页码:3948 / 3960
页数:13
相关论文
共 50 条
  • [41] Contrastive Learning of Generalized Game Representations
    Trivedi, Chintan
    Liapis, Antonios
    Yannakakis, Georgios N.
    2021 IEEE CONFERENCE ON GAMES (COG), 2021, : 119 - 126
  • [42] Probabilistic Representations for Video Contrastive Learning
    Park, Jungin
    Lee, Jiyoung
    Kim, Ig-Jae
    Sohn, Kwanghoon
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 14691 - 14701
  • [43] CONSS: Contrastive Learning Method for Semisupervised Seismic Facies Classification
    Li K.
    Liu W.
    Dou Y.
    Xu Z.
    Duan H.
    Jing R.
    IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2023, 16 : 7838 - 7849
  • [44] Parametric UMAP Embeddings for Representation and Semisupervised Learning
    Sainburg, Tim
    McInnes, Leland
    Gentner, Timothy Q.
    NEURAL COMPUTATION, 2021, 33 (11) : 2881 - 2907
  • [45] Contrastive Learning Models for Sentence Representations
    Xu, Lingling
    Xie, Haoran
    Li, Zongxi
    Wang, Fu Lee
    Wang, Weiming
    Li, Qing
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2023, 14 (04)
  • [46] A Contrastive Objective for Learning Disentangled Representations
    Kahana, Jonathan
    Hoshen, Yedid
    COMPUTER VISION, ECCV 2022, PT XXVI, 2022, 13686 : 579 - 595
  • [47] CLAR: Contrastive Learning of Auditory Representations
    Al-Tahan, Haider
    Mohsenzadeh, Yalda
    24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
  • [48] Pseudolabeling Contrastive Learning for Semisupervised Hyperspectral and LiDAR Data Classification
    Li, Zhongwei
    Wang, Yuewen
    Wang, Leiquan
    Guo, Fangming
    Yang, Yajie
    Wei, Jie
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 17099 - 17116
  • [49] Progressive Semisupervised Learning of Multiple Classifiers
    Yu, Zhiwen
    Lu, Ye
    Zhang, Jun
    You, Jane
    Wong, Hau-San
    Wang, Yide
    Han, Guoqiang
    IEEE TRANSACTIONS ON CYBERNETICS, 2018, 48 (02) : 689 - 702
  • [50] Federated unsupervised representation learning
    Zhang, Fengda
    Kuang, Kun
    Chen, Long
    You, Zhaoyang
    Shen, Tao
    Xiao, Jun
    Zhang, Yin
    Wu, Chao
    Wu, Fei
    Zhuang, Yueting
    Li, Xiaolin
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2023, 24 (08) : 1181 - 1193