Simulated LC-MS Data Set for Assessing the Metabolomics Data Processing Pipeline Implemented into MVAPACK

被引:0
|
作者
Jurich, Christopher P. [1 ]
Jeppesen, Micah J. [1 ,2 ]
Sakallioglu, Isin T. [1 ]
Leite, Aline De Lima [1 ,2 ]
Yesselman, Joseph D. [1 ,2 ]
Powers, Robert [1 ,2 ]
机构
[1] Univ Nebraska Lincoln, Dept Chem, Lincoln, NE 68588 USA
[2] Univ Nebraska Lincoln, Nebraska Ctr Integrated Biomol Commun, Lincoln, NE 68588 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
D-CYCLOSERINE; NMR; STRATEGIES; TOOL;
D O I
10.1021/acs.analchem.3c04979
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Metabolomics commonly relies on using one-dimensional (1D) H-1 NMR spectroscopy or liquid chromatography-mass spectrometry (LC-MS) to derive scientific insights from large collections of biological samples. NMR and MS approaches to metabolomics require, among other issues, a data processing pipeline. Quantitative assessment of the performance of these software platforms is challenged by a lack of standardized data sets with "known" outcomes. To resolve this issue, we created a novel simulated LC-MS data set with known peak locations and intensities, defined metabolite differences between groups (i.e., fold change > 2, coefficient of variation <= 25%), and different amounts of added Gaussian noise (0, 5, or 10%) and missing features (0, 10, or 20%). This data set was developed to improve benchmarking of existing LC-MS metabolomics software and to validate the updated version of our MVAPACK software, which added gas chromatography-MS and LC-MS functionality to its existing 1D and two-dimensional NMR data processing capabilities. We also included two experimental LC-MS data sets acquired from a standard mixture andMycobacterium smegmatiscell lysates since a simulated data set alone may not capture all the unique characteristics and variability of real spectra needed to assess software performance properly. Our simulated and experimental LC-MS data sets were processed with the MS-DIAL and XCMSOnline software packages and our MVAPACK toolkit to showcase the utility of our data sets to benchmark MVAPACK against community standards. Our results demonstrate the enhanced objectivity and clarity of software assessment that can be achieved when both simulated and experimental data are employed since distinctly different software performances were observed with the simulated and experimental LC-MS data sets. We also demonstrate that the performance of MVAPACK is equivalent to or exceeds existing LC-MS software programs while providing a single platform for processing and analyzing both NMR and MS data sets.
引用
收藏
页码:12943 / 12956
页数:14
相关论文
共 50 条
  • [41] Automated Annotation of Untargeted All-Ion Fragmentation LC-MS Metabolomics Data with MetaboAnnotatoR
    Graca, Goncalo
    Cai, Yuheng
    Lau, Chung-Ho E.
    Vorkas, Panagiotis A.
    Lewis, Matthew R.
    Want, Elizabeth J.
    Herrington, David
    Ebbels, Timothy M. D.
    ANALYTICAL CHEMISTRY, 2022, 94 (08) : 3446 - 3455
  • [42] PiMP my metabolome: an integrated, web-based tool for LC-MS metabolomics data
    Gloaguen, Yoann
    Morton, Fraser
    Daly, Ronan
    Gurden, Ross
    Rogers, Simon
    Wandy, Joe
    Wilson, David
    Barrett, Michael
    Burgess, Karl
    BIOINFORMATICS, 2017, 33 (24) : 4007 - 4009
  • [43] Heterogeneous multimeric metabolite ion species observed in LC-MS based metabolomics data sets
    El Abiead, Yasin
    Bueschl, Christoph
    Panzenboeck, Lisa
    Wang, Mingxun
    Doppler, Maria
    Seidl, Bernhard
    Zanghellini, Juergen
    Dorrestein, Pieter C.
    Koellensperger, Gunda
    ANALYTICA CHIMICA ACTA, 2022, 1229
  • [44] Assessing specialized metabolite diversity of Alnus species by a digitized LC-MS/MS data analysis workflow
    Bin Kang, Kyo
    Woo, Sunmin
    Ernst, Madeleine
    van der Hooft, Justin J. J.
    Nothias, Louis-Felix
    da Silva, Ricardo R.
    Dorrestein, Pieter C.
    Sung, Sang Hyun
    Lee, Mina
    PHYTOCHEMISTRY, 2020, 173
  • [45] Correction: Model-driven data curation pipeline for LC–MS-based untargeted metabolomics
    Gabriel Riquelme
    Emmanuel Ezequiel Bortolotto
    Matías Dombald
    María Eugenia Monge
    Metabolomics, 19
  • [46] Organization of GC/MS and LC/MS metabolomics data into chemical libraries
    Corey D DeHaven
    Anne M Evans
    Hongping Dai
    Kay A Lawton
    Journal of Cheminformatics, 2
  • [47] Organization of GC/MS and LC/MS metabolomics data into chemical libraries
    DeHaven, Corey D.
    Evans, Anne M.
    Dai, Hongping
    Lawton, Kay A.
    JOURNAL OF CHEMINFORMATICS, 2010, 2
  • [48] Improved quality control processing of peptide-centric LC-MS proteomics data
    Matzke, Melissa M.
    Waters, Katrina M.
    Metz, Thomas O.
    Jacobs, Jon M.
    Sims, Amy C.
    Baric, Ralph S.
    Pounds, Joel G.
    Webb-Robertson, Bobbie-Jo M.
    BIOINFORMATICS, 2011, 27 (20) : 2866 - 2872
  • [49] Granger causality in integrated GC-MS and LC-MS metabolomics data reveals the interface of primary and secondary metabolism
    Doerfler, Hannes
    Lyon, David
    Naegele, Thomas
    Sun, Xiaoliang
    Fragner, Lena
    Hadacek, Franz
    Egelhofer, Volker
    Weckwerth, Wolfram
    METABOLOMICS, 2013, 9 (03) : 564 - 574
  • [50] Enabling Efficient and Confident Annotation of LC-MS Metabolomics Data through MS1 Spectrum and Time Prediction
    Broeckling, Corey D.
    Ganna, Andrea
    Layer, Mark
    Brown, Kevin
    Sutton, Ben
    Ingelsson, Erik
    Peers, Graham
    Prenni, Jessica E.
    ANALYTICAL CHEMISTRY, 2016, 88 (18) : 9226 - 9234