The METLIN small molecule dataset for machine learning-based retention time prediction

被引:0
|
作者
Xavier Domingo-Almenara
Carlos Guijas
Elizabeth Billings
J. Rafael Montenegro-Burke
Winnie Uritboonthai
Aries E. Aisporna
Emily Chen
H. Paul Benton
Gary Siuzdak
机构
[1] The Scripps Research Institute,Scripps Center for Metabolomics
[2] The Scripps Research Institute,California Institute for Biomedical Research (Calibr)
[3] The Scripps Research Institute,Department of Integrative Structural and Computational Biology
[4] EURECAT – Technology Centre of Catalonia & Rovira i Virgili University joint unit,Centre for Omic Sciences
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Machine learning has been extensively applied in small molecule analysis to predict a wide range of molecular properties and processes including mass spectrometry fragmentation or chromatographic retention time. However, current approaches for retention time prediction lack sufficient accuracy due to limited available experimental data. Here we introduce the METLIN small molecule retention time (SMRT) dataset, an experimentally acquired reverse-phase chromatography retention time dataset covering up to 80,038 small molecules. To demonstrate the utility of this dataset, we deployed a deep learning model for retention time prediction applied to small molecule annotation. Results showed that in 70%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\%$$\end{document} of the cases, the correct molecular identity was ranked among the top 3 candidates based on their predicted retention time. We anticipate that this dataset will enable the community to apply machine learning or first principles strategies to generate better models for retention time prediction.
引用
下载
收藏
相关论文
共 50 条
  • [41] Machine Learning-Based Prediction of the Excitation Wavelength of Phosphors
    Sahu, Sunil K.
    Shrivastav, Anil
    Swamy, N. K.
    Dubey, Vikas
    Halwar, D. K.
    Kumar, M. Tanooj
    Rao, M. C.
    JOURNAL OF APPLIED SPECTROSCOPY, 2024, 91 (03) : 669 - 677
  • [42] Machine Learning-Based Link Prediction for Hotel Network
    Sevim, Yiğit
    Orman, Günce Keziban
    Yöndem, Meltem Turhan
    IAENG International Journal of Computer Science, 2022, 49 (04)
  • [43] Machine Learning-based Corporate Socia Responsibility Prediction
    Teoh, T-T
    Heng, Q. K.
    Chia, J. J.
    Shie, J. M.
    Liaw, S. W.
    Yang, M.
    Nguwi, Y-Y
    PROCEEDINGS OF THE IEEE 2019 9TH INTERNATIONAL CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS (CIS) ROBOTICS, AUTOMATION AND MECHATRONICS (RAM) (CIS & RAM 2019), 2019, : 501 - 505
  • [44] Machine Learning-based Pin Accessibility Prediction and Application
    Fang, Shao-Yun
    2021 INTERNATIONAL SYMPOSIUM ON VLSI DESIGN, AUTOMATION AND TEST (VLSI-DAT), 2021,
  • [45] Machine Learning-based RSSI Prediction in Factory Environments
    Webber, Julian
    Suga, Norisato
    Ano, Susumu
    Jou, Yafei
    Mehbodniya, Abolfazl
    Higashimori, Toshihide
    Yano, Kazuto
    Suzuki, Yoshinori
    PROCEEDINGS OF 2019 25TH ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS (APCC), 2019, : 195 - 200
  • [46] Machine learning-based approaches for disease gene prediction
    Duc-Hau Le
    BRIEFINGS IN FUNCTIONAL GENOMICS, 2020, 19 (5-6) : 350 - 363
  • [47] Machine learning-based seawater concentration pathway prediction
    Hu, Fang
    Xu, Xingyong
    Liang, Jun
    Yang, Changguo
    Huang, Mingfang
    Su, Qiao
    COMPUTERS & ELECTRICAL ENGINEERING, 2021, 94
  • [48] A Machine Learning-based Approach for The Prediction of Electricity Consumption
    Dinh Hoa Nguyen
    Anh Tung Nguyen
    2019 12TH ASIAN CONTROL CONFERENCE (ASCC), 2019, : 1301 - 1306
  • [49] Machine Learning-Based Prediction of Antiferromagnetic Skyrmion Formation
    Saini, Shipra
    Shukla, Alok Kumar
    Nehete, Hemkant
    Bindal, Namita
    Kaushik, Brajesh Kumar
    IEEE TRANSACTIONS ON ELECTRON DEVICES, 2024, 71 (04) : 2774 - 2780
  • [50] Machine learning-based prediction models for postpartum hemorrhage
    Venkatesh, Kartik K.
    Strauss, Robert
    Grotegut, Chad
    Heine, Phillips
    Stamilio, David M.
    Menard, Kathryn
    Jelovsek, Eric
    AMERICAN JOURNAL OF OBSTETRICS AND GYNECOLOGY, 2020, 222 (01) : S175 - S176