Spectra to structure: contrastive learning framework for library ranking and generating molecular structures for infrared spectra

被引:0
|
作者
Kanakala, Ganesh Chandan [1 ]
Sridharan, Bhuvanesh [1 ]
Priyakumar, U. Deva [1 ]
机构
[1] Int Inst Informat Technol, Ctr Computat Nat Sci & Bioinformat, Hyderabad 500 032, India
来源
DIGITAL DISCOVERY | 2024年 / 3卷 / 12期
关键词
INFORMATION; SMILES;
D O I
10.1039/d4dd00135d
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Inferring complete molecular structure from infrared (IR) spectra is a challenging task. In this work, we propose SMEN (Spectra and Molecule Encoder Network), a framework for scoring molecules against given IR spectra. The proposed framework uses contrastive optimization to obtain similar embedding for a molecule and its spectra. For this study, we consider the QM9 dataset with molecules consisting of less than 9 heavy atoms and obtain simulated spectra. Using the proposed method, we can rank the molecules using embedding similarity and obtain a Top 1 accuracy of similar to 81%, Top 3 accuracy of similar to 96%, and Top 10 accuracy of similar to 99% on the evaluation set. We extend SMEN to build a generative transformer for a direct molecule prediction from IR spectra. The proposed method can significantly help molecule library ranking tasks and aid the problem of inferring molecular structures from spectra.
引用
收藏
页码:2417 / 2423
页数:7
相关论文
共 50 条
  • [1] Deep Learning for Generating Phase-Conditioned Infrared Spectra
    Na, Gyoung S.
    ANALYTICAL CHEMISTRY, 2024, 96 (49) : 19659 - 19669
  • [2] Extracting Infrared Spectra of Protein Secondary Structures Using a Library of Protein Spectra and the Ramachandran Plot
    Coe, James V.
    Nystrom, Steven V.
    Chen, Zhaomin
    Li, Ran
    Verreault, Dominique
    Hitchcock, Charles L.
    Martin, Edward W.
    Allen, Heather C.
    JOURNAL OF PHYSICAL CHEMISTRY B, 2015, 119 (41): : 13079 - 13092
  • [3] THE INFRARED SPECTRA AND THE MOLECULAR STRUCTURE OF PYRAMIDAL MOLECULES
    NIELSEN, HH
    JOURNAL OF CHEMICAL PHYSICS, 1953, 21 (01): : 142 - 144
  • [4] Molecular structure and infrared spectra of dimethyl oxalate
    Lopes, SB
    Lapinski, L
    Fausto, R
    PHYSICAL CHEMISTRY CHEMICAL PHYSICS, 2002, 4 (06) : 1014 - 1020
  • [5] The molecular structure and infrared and Raman spectra of SCCCS
    Xie, YM
    Schaefer, HF
    JOURNAL OF MOLECULAR STRUCTURE-THEOCHEM, 1999, 460 (1-3): : 117 - 121
  • [6] Molecular structure and infrared spectra of dimethyl fumarate
    Lopes, S
    Lapinski, L
    Fausto, R
    PHYSICAL CHEMISTRY CHEMICAL PHYSICS, 2002, 4 (16) : 3965 - 3974
  • [7] Machine learning molecular dynamics for the simulation of infrared spectra
    Gastegger, Michael
    Behler, Joerg
    Marquetand, Philipp
    CHEMICAL SCIENCE, 2017, 8 (10) : 6924 - 6935
  • [8] INFRARED SPECTRA AND STRUCTURE OF MOLECULAR COMPLEXES OF AROMATIC ACIDS
    Belkov, M. V.
    Brinkevich, S. D.
    Samovich, S. N.
    Skornyakov, I. V.
    Tolstorozhev, G. B.
    Shadyro, O. I.
    JOURNAL OF APPLIED SPECTROSCOPY, 2012, 78 (06) : 794 - 801
  • [9] POLYWATER - MOLECULAR STRUCTURE AS DETERMINED BY INFRARED AND RAMAN SPECTRA
    STROMBERG, RR
    CROATICA CHEMICA ACTA, 1970, 42 (02) : 382 - +
  • [10] Infrared spectra and structure of molecular complexes of aromatic acids
    M. V. Belkov
    S. D. Brinkevich
    S. N. Samovich
    I. V. Skornyakov
    G. B. Tolstorozhev
    O. I. Shadyro
    Journal of Applied Spectroscopy, 2012, 78 : 794 - 801