HSQC Spectra Simulation and Matching for Molecular Identification

被引:2
|
作者
Priessner, Martin [1 ]
Lewis, Richard J. [2 ]
Johansson, Magnus J. [1 ]
Goodman, Jonathan M. [3 ]
Janet, Jon Paul [4 ]
Tomberg, Anna [1 ]
机构
[1] AstraZeneca, BioPharmaceut R&D, Med Chem Res & Early Dev, Cardiovasc Renal & Metab CVRM, S-43183 Molndal, Sweden
[2] AstraZeneca, Dept Med Chem Res & Early Dev, Resp & Immunol, BioPharmaceut R&D, S-43183 Molndal, Sweden
[3] Univ Cambridge, Ctr Mol Informat, Yusuf Hamied Dept Chem, Cambridge CB2 1EW, England
[4] AstraZeneca, Mol AI, Discovery Sci, R&D, S-43183 Molndal, Sweden
关键词
STRUCTURAL REVISION; NATURAL-PRODUCTS; CHEMICAL-SHIFTS; VALIDATION; PREDICTION; DP4;
D O I
10.1021/acs.jcim.3c01735
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
In the pursuit of improved compound identification and database search tasks, this study explores heteronuclear single quantum coherence (HSQC) spectra simulation and matching methodologies. HSQC spectra serve as unique molecular fingerprints, enabling a valuable balance of data collection time and information richness. We conducted a comprehensive evaluation of the following four HSQC simulation techniques: ACD/Labs (ACD), MestReNova (MNova), Gaussian NMR calculations (DFT), and a graph-based neural network (ML). For the latter two techniques, we developed a reconstruction logic to combine proton and carbon 1D spectra into HSQC spectra. The methodology involved the implementation of three peak-matching strategies (minimum-sum, Euclidean-distance, and Hungarian distance) combined with three padding strategies (zero-padding, peak-truncated, and nearest-neighbor double assignment). We found that coupling these strategies with a robust simulation technique facilitates the accurate identification of correct molecules from similar analogues (regio- and stereoisomers) and allows for fast and accurate large database searches. Furthermore, we demonstrated the efficacy of the best-performing methodology by rectifying the structures of a set of previously misidentified molecules. This research indicates that effective HSQC spectral simulation and matching methodologies significantly facilitate molecular structure elucidation. Furthermore, we offer a Google Colab notebook for researchers to use our methods on their own data (https://github.com/AstraZeneca/hsqc_structure_elucidation.git).
引用
收藏
页码:3180 / 3191
页数:12
相关论文
共 50 条
  • [1] IDENTIFICATION OF MOLECULAR SPECTRA
    ZANON, I
    NUOVO CIMENTO, 1965, 38 (01): : 691 - +
  • [2] IDENTIFICATION OF MOLECULAR SPECTRA
    不详
    REVUE D OPTIQUE THEORIQUE ET INSTRUMENTALE, 1965, 44 (03): : 114 - &
  • [3] Removing unwanted signals from HSQC spectra
    Shaw, GL
    Stonehouse, J
    JOURNAL OF MAGNETIC RESONANCE SERIES B, 1996, 110 (01): : 91 - 95
  • [4] Unsymmetrical covariance processing of COSY or TOCSY and HSQC NMR data to obtain the equivalent of HSQC-COSY or HSQC-TOCSY spectra
    Blinov, KA
    Larin, NI
    Williams, AJ
    Mills, KA
    Martin, GE
    JOURNAL OF HETEROCYCLIC CHEMISTRY, 2006, 43 (01) : 163 - 166
  • [5] Getting the Most Out of HSQC and HMBC Spectra
    Reynolds, William F.
    Burns, Darcy C.
    ANNUAL REPORTS ON NMR SPECTROSCOPY, VOL 76, 2012, 76 : 1 - 21
  • [6] Improvement in protein HSQC spectra from addition of betaine
    O'Dea, Finn
    Seargeant, Aiden J.
    Hurcum, Jessica
    do Aido-Machado, Rodolpho
    Rowe, Michelle L.
    Baxter, Nicola J.
    Waltho, Jon P.
    Sayers, Jon R.
    Williamson, Mike P.
    JOURNAL OF BIOMOLECULAR NMR, 2025,
  • [7] A NOESY-HSQC simulation program, SPIRIT
    Leiming Zhu
    H. Jane Dyson
    Peter E. Wright
    Journal of Biomolecular NMR, 1998, 11 : 17 - 29
  • [8] A NOESY-HSQC simulation program, SPIRIT
    Zhu, LM
    Dyson, HJ
    Wright, PE
    JOURNAL OF BIOMOLECULAR NMR, 1998, 11 (01) : 17 - 29
  • [9] COMPOUND IDENTIFICATION BY COMPUTER MATCHING OF LOW RESOLUTION MASS SPECTRA
    KNOCK, BA
    SMITH, IC
    WRIGHT, DE
    RIDLEY, RG
    KELLY, W
    ANALYTICAL CHEMISTRY, 1970, 42 (13) : 1516 - &
  • [10] Experimental access to HSQC spectra decoupled in all frequency dimensions
    Sakhaii, Peyman
    Haase, Burkhard
    Bermel, Wolfgang
    JOURNAL OF MAGNETIC RESONANCE, 2009, 199 (02) : 192 - 198