Machine-learning assisted molecular formula assignment to high-resolution mass spectrometry data of dissolved organic matter

被引:4
|
作者
Pan, Qiong [1 ]
Hu, Wenya [1 ]
He, Ding [2 ,3 ]
He, Chen [1 ]
Zhang, Linzhou [1 ]
Shi, Quan [1 ]
机构
[1] China Univ Petr, Petr Mol Engn Ctr PMEC, State Key Lab Heavy Oil Proc, Beijing 102249, Peoples R China
[2] Hong Kong Univ Sci & Technol, Dept Ocean Sci, Hong Kong 999077, Peoples R China
[3] Hong Kong Univ Sci & Technol, Southern Marine Sci & Engn Guangdong Lab Guangzhou, Hong Kong Branch, Hong Kong 999077, Peoples R China
基金
中国国家自然科学基金;
关键词
FT-ICR MS; Orbitrap MS; Molecular formula assignment; Dissolved organic matter; SOLID-PHASE EXTRACTION; FULVIC-ACIDS; IONIZATION; SPECTRA; RIVER; BAY; CLASSIFICATION; FRAGMENTATION; VISUALIZATION; ACCURACY;
D O I
10.1016/j.talanta.2023.124484
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
High-resolution mass spectrometry (HRMS) provides molecular compositional information of dissolved organic matter (DOM) through isotopic assignment from the molecular mass. However, due to the inevitable deviation of molecular mass measurement and the limitation of resolving power, multiple possible solutions frequently occur for a given molecular mass. Lowering the mass deviation threshold and adding assignment restriction rules are often applied to exclude the incorrect solutions, which generally involves time-consuming manual postprocessing of mass data. To improve the result accuracy in an automated manner, we developed a molecular formula assignment algorithm based on machine-learning technology. The method integrated a logistic regression model using manually corrected isotopic composition and the peak features of HRMS data (m/z, signal-tonoise ratio, isotope type, and number, etc.) as training data. The developed model can evaluate the correctness of a candidate formula for the given mass peak based on the peak features. The method was verified by various DOM samples FT-ICR MS data (direct infusion negative mode electrospray), achieving a similar to 90% accuracy (compared to the traditional approach) for formula assignment. The method was applied to a series of NOM samples and showed a significant improvement in formula assignment compared with the mass matching method.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Validation and Evaluation of High-Resolution Orbitrap Mass Spectrometry on Molecular Characterization of Dissolved Organic Matter
    Pan, Qiong
    Zhuo, Xiaocun
    He, Chen
    Zhang, Yahe
    Shi, Quan
    [J]. ACS OMEGA, 2020, 5 (10): : 5372 - 5379
  • [2] Fundamentals of molecular formula assignment to ultrahigh resolution mass data of natural organic matter
    Koch, Boris P.
    Dittmar, Thorsten
    Witt, Matthias
    Kattner, Gerhard
    [J]. ANALYTICAL CHEMISTRY, 2007, 79 (04) : 1758 - 1763
  • [3] A non-targeted high-resolution mass spectrometry data analysis of dissolved organic matter in wastewater treatment
    Verkh, Yaroslav
    Rozman, Marko
    Petrovic, Mira
    [J]. CHEMOSPHERE, 2018, 200 : 397 - 404
  • [4] Quantifying Stochastic Processes in Shaping Dissolved Organic Matter Pool with High-Resolution Mass Spectrometry
    She, Zhixiang
    Wang, Jin
    Wang, Shu
    He, Chen
    Jiang, Zhengfeng
    Pan, Xin
    Shi, Quan
    Yue, Zhengbo
    [J]. ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2023, 57 (43) : 16361 - 16371
  • [5] Molecular dissolved organic matter removal by cotton-based adsorbents and characterization using high-resolution mass spectrometry
    Rakruam, Pharkphum
    Thuptimdang, Pumis
    Siripattanakul-Ratpukdi, Sumana
    Phungsai, Phanwatt
    [J]. SCIENCE OF THE TOTAL ENVIRONMENT, 2021, 754
  • [6] High-Resolution Liquid Chromatography Tandem Mass Spectrometry Enables Large Scale Molecular Characterization of Dissolved Organic Matter
    Petras, Daniel
    Koester, Irina
    Da Silva, Ricardo
    Stephens, Brandon M.
    Haas, Andreas F.
    Nelson, Craig E.
    Kelly, Linda W.
    Aluwihare, Lihini I.
    Dorrestein, Pieter C.
    [J]. FRONTIERS IN MARINE SCIENCE, 2017, 4
  • [7] Molecular Characterization of Dissolved Organic Matter through a Desalination Process by High Resolution Mass Spectrometry
    Cortes-Francisco, Nuria
    Caixach, Josep
    [J]. ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2013, 47 (17) : 9619 - 9627
  • [8] Regular Tetrahedron Model for the Assessment of High-Resolution Mass Spectrometry Data of Four-Way Fractionated Dissolved Organic Matter
    Qiu, Junjie
    Lu, Fan
    Li, Xiao
    Zhang, Hua
    Xu, Bin
    He, Pin-Jing
    [J]. ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2024, 58 (26) : 11685 - 11694
  • [9] Fully Automated Unconstrained Analysis of High-Resolution Mass Spectrometry Data with Machine Learning
    Boiko, Daniil A.
    Kozlov, Konstantin S.
    V. Burykina, Julia
    Ilyushenkova, Valentina V.
    Ananikov, Valentine P.
    [J]. JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2022, 144 (32) : 14590 - 14606
  • [10] An Interpretable Machine-learning Framework for Modeling High-resolution Spectroscopic Data*
    Gully-Santiago, Michael
    Morley, Caroline V.
    [J]. ASTROPHYSICAL JOURNAL, 2022, 941 (02):