Unique identifiers for small molecules enable rigorous labeling of their atoms

被引:25
|
作者
Dashti, Hesam [1 ]
Westler, William M. [1 ]
Markley, John L. [1 ]
Eghbalnia, Hamid R. [1 ]
机构
[1] Univ Wisconsin, Dept Biochem, Natl Magnet Resonance Facil Madison, Madison, WI 53706 USA
基金
美国国家卫生研究院;
关键词
CHEMICAL LANGUAGE; NMR-SPECTROSCOPY; DATABASE; METABOLITES; DISCOVERY; SMILES; METABOLOMICS; INFORMATION; COLLECTION; GENERATION;
D O I
10.1038/sdata.2017.73
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Rigorous characterization of small organic molecules in terms of their structural and biological properties is vital to biomedical research. The three-dimensional structure of a molecule, its 'photo ID', is inefficient for searching and matching tasks. Instead, identifiers play a key role in accessing compound data. Unique and reproducible molecule and atom identifiers are required to ensure the correct cross-referencing of properties associated with compounds archived in databases. The best approach to this requirement is the International Chemical Identifier (InChI). However, the current implementation of InChI fails to provide a complete standard for atom nomenclature, and incorrect use of the InChI standard has resulted in the proliferation of non-unique identifiers. We propose a methodology and associated software tools, named ALATIS, that overcomes these shortcomings. ALATIS is an adaptation of InChI, which operates fully within the InChI convention to provide unique and reproducible molecule and all atom identifiers. ALATIS includes an InChI extension for unique atom labeling of symmetric molecules. ALATIS forms the basis for improving reproducibility and unifying cross-referencing across databases.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Unique identifiers for small molecules enable rigorous labeling of their atoms
    Hesam Dashti
    William M. Westler
    John L. Markley
    Hamid R. Eghbalnia
    [J]. Scientific Data, 4
  • [2] Counting absolute numbers of molecules using unique molecular identifiers
    Kivioja, Teemu
    Vaharautio, Anna
    Karlsson, Kasper
    Bonke, Martin
    Enge, Martin
    Linnarsson, Sten
    Taipale, Jussi
    [J]. NATURE METHODS, 2012, 9 (01) : 72 - U183
  • [3] Counting absolute numbers of molecules using unique molecular identifiers
    Teemu Kivioja
    Anna Vähärautio
    Kasper Karlsson
    Martin Bonke
    Martin Enge
    Sten Linnarsson
    Jussi Taipale
    [J]. Nature Methods, 2012, 9 (1) : 72 - 74
  • [4] LABELING OF SMALL MOLECULES WITH FLUORESCEIN
    SIMPSON, I
    [J]. ANALYTICAL BIOCHEMISTRY, 1978, 89 (01) : 304 - 305
  • [5] LABELING OF SMALL MOLECULES WITH RADIOIODINE
    HADI, UAM
    MALCOLMELAWES, DJ
    OLDHAM, G
    [J]. INTERNATIONAL JOURNAL OF APPLIED RADIATION AND ISOTOPES, 1978, 29 (11): : 621 - 623
  • [6] TRUmiCount: correctly counting absolute numbers of molecules using unique molecular identifiers
    Pflug, Florian G.
    von Haeseler, Arndt
    [J]. BIOINFORMATICS, 2018, 34 (18) : 3137 - 3144
  • [7] Unique Atoms and the Identification of the Symmetry Elements of Molecules
    Graham, John P.
    [J]. JOURNAL OF CHEMICAL EDUCATION, 2011, 88 (07) : 1010 - 1011
  • [8] REACTIONS OF ATOMS AND SMALL MOLECULES
    DONOVAN, RJ
    HUSAIN, D
    KIRSCH, LJ
    [J]. ANNUAL REPORTS ON THE PROGRESS OF CHEMISTRY SECTION A-PHYSICAL & INORGANIC CHEMISTRY, 1972, 69 : 19 - 74
  • [9] Revisiting the Foundations of the Quantum Theory of Atoms in Molecules: Toward a Rigorous Definition of Topological Atoms
    Nasertayoob, Payam
    Shahbazian, Shant
    [J]. INTERNATIONAL JOURNAL OF QUANTUM CHEMISTRY, 2009, 109 (04) : 726 - 732
  • [10] NEW APPROACH TO LABELING SMALL MOLECULES FOR RIA
    SLAUNWHITE, WR
    TANTCHOU, JK
    [J]. CLINICAL CHEMISTRY, 1979, 25 (06) : 1154 - 1154