Simulated LC-MS Data Set for Assessing the Metabolomics Data Processing Pipeline Implemented into MVAPACK

被引:0
|
作者
Jurich, Christopher P. [1 ]
Jeppesen, Micah J. [1 ,2 ]
Sakallioglu, Isin T. [1 ]
Leite, Aline De Lima [1 ,2 ]
Yesselman, Joseph D. [1 ,2 ]
Powers, Robert [1 ,2 ]
机构
[1] Univ Nebraska Lincoln, Dept Chem, Lincoln, NE 68588 USA
[2] Univ Nebraska Lincoln, Nebraska Ctr Integrated Biomol Commun, Lincoln, NE 68588 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
D-CYCLOSERINE; NMR; STRATEGIES; TOOL;
D O I
10.1021/acs.analchem.3c04979
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Metabolomics commonly relies on using one-dimensional (1D) H-1 NMR spectroscopy or liquid chromatography-mass spectrometry (LC-MS) to derive scientific insights from large collections of biological samples. NMR and MS approaches to metabolomics require, among other issues, a data processing pipeline. Quantitative assessment of the performance of these software platforms is challenged by a lack of standardized data sets with "known" outcomes. To resolve this issue, we created a novel simulated LC-MS data set with known peak locations and intensities, defined metabolite differences between groups (i.e., fold change > 2, coefficient of variation <= 25%), and different amounts of added Gaussian noise (0, 5, or 10%) and missing features (0, 10, or 20%). This data set was developed to improve benchmarking of existing LC-MS metabolomics software and to validate the updated version of our MVAPACK software, which added gas chromatography-MS and LC-MS functionality to its existing 1D and two-dimensional NMR data processing capabilities. We also included two experimental LC-MS data sets acquired from a standard mixture andMycobacterium smegmatiscell lysates since a simulated data set alone may not capture all the unique characteristics and variability of real spectra needed to assess software performance properly. Our simulated and experimental LC-MS data sets were processed with the MS-DIAL and XCMSOnline software packages and our MVAPACK toolkit to showcase the utility of our data sets to benchmark MVAPACK against community standards. Our results demonstrate the enhanced objectivity and clarity of software assessment that can be achieved when both simulated and experimental data are employed since distinctly different software performances were observed with the simulated and experimental LC-MS data sets. We also demonstrate that the performance of MVAPACK is equivalent to or exceeds existing LC-MS software programs while providing a single platform for processing and analyzing both NMR and MS data sets.
引用
收藏
页码:12943 / 12956
页数:14
相关论文
共 50 条
  • [21] Simple data-reduction method for high-resolution LC-MS data in metabolomics
    Scheltema, R. A.
    Decuypere, S.
    Dujardin, J. C.
    Watson, D. G.
    Jansen, R. C.
    Breitling, R.
    BIOANALYSIS, 2009, 1 (09) : 1551 - 1557
  • [22] Disparate Metabolomics Data Reassembler: A Novel Algorithm for Agglomerating Incongruent LC-MS Metabolomics Datasets
    Mak, Tytus D.
    Goudarzi, Maryam
    Laiakis, Evagelia C.
    Stein, Stephen E.
    ANALYTICAL CHEMISTRY, 2020, 92 (07) : 5231 - 5239
  • [23] MetaboLyzer: A Novel Statistical Workflow for Analyzing Postprocessed LC-MS Metabolomics Data
    Mak, Tytus D.
    Laiakis, Evagelia C.
    Goudarzi, Maryam
    Fornace, Albert J., Jr.
    ANALYTICAL CHEMISTRY, 2014, 86 (01) : 506 - 513
  • [24] A high-throughput processing service for retention time alignment of complex proteomics and metabolomics LC-MS data
    Ahmad, Isthiaq
    Suits, Frank
    Hoekman, Berend
    Swertz, Morris A.
    Byelas, Heorhiy
    Dijkstra, Martijn
    Hooft, Rob
    Katsubo, Dmitry
    van Breukelen, Bas
    Bischoff, Rainer
    Horvatovich, Peter
    BIOINFORMATICS, 2011, 27 (08) : 1176 - 1178
  • [25] Untargeted LC–MS Data Preprocessing in Metabolomics
    Tian H.
    Li B.
    Shui G.
    Journal of Analysis and Testing, 2017, 1 (3) : 187 - 192
  • [26] LC-MS/MS software data processing and review workflow improvements for clinical research
    Hammondb, G.
    Calton, L.
    Paulette, C.
    Balloch, S.
    Bosch, G.
    Wardle, R.
    CLINICA CHIMICA ACTA, 2024, 558
  • [27] Software Data Processing and Review LC-MS/MS Workflow Improvements for Clinical Research
    Calton, L.
    Hammond, G.
    Balloch, S.
    Wardle, R.
    Paulette, C.
    CLINICAL CHEMISTRY, 2024, 70
  • [28] BatMass: a Java']Java Software Platform for LC-MS Data Visualization in Proteomics and Metabolomics
    Avtonomov, Dmitry M.
    Raskind, Alexander
    Nesvizhskii, Alexey I.
    JOURNAL OF PROTEOME RESEARCH, 2016, 15 (08) : 2500 - 2509
  • [29] A variable selection approach in the multivariate linear model: an application to LC-MS metabolomics data
    Perrot-Dockes, Marie
    Levy-Leduc, Celine
    Chiquet, Julien
    Sansonnet, Laure
    Bregere, Margaux
    Etienne, Marie-Pierre
    Robin, Stephane
    Genta-Jouve, Gregory
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2018, 17 (05)
  • [30] LC-MS data for metabolomics analysis of Garcinia mangostana L. seed germination
    Mazlan, Othman
    Aizat, Wan Mohd
    Zuddin, Nor Shahida Aziz
    Baharum, Syarul Nataqain
    Noor, Normah Mohd
    DATA IN BRIEF, 2018, 21 : 2221 - 2223