Simulated LC-MS Data Set for Assessing the Metabolomics Data Processing Pipeline Implemented into MVAPACK

被引:0
|
作者
Jurich, Christopher P. [1 ]
Jeppesen, Micah J. [1 ,2 ]
Sakallioglu, Isin T. [1 ]
Leite, Aline De Lima [1 ,2 ]
Yesselman, Joseph D. [1 ,2 ]
Powers, Robert [1 ,2 ]
机构
[1] Univ Nebraska Lincoln, Dept Chem, Lincoln, NE 68588 USA
[2] Univ Nebraska Lincoln, Nebraska Ctr Integrated Biomol Commun, Lincoln, NE 68588 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
D-CYCLOSERINE; NMR; STRATEGIES; TOOL;
D O I
10.1021/acs.analchem.3c04979
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Metabolomics commonly relies on using one-dimensional (1D) H-1 NMR spectroscopy or liquid chromatography-mass spectrometry (LC-MS) to derive scientific insights from large collections of biological samples. NMR and MS approaches to metabolomics require, among other issues, a data processing pipeline. Quantitative assessment of the performance of these software platforms is challenged by a lack of standardized data sets with "known" outcomes. To resolve this issue, we created a novel simulated LC-MS data set with known peak locations and intensities, defined metabolite differences between groups (i.e., fold change > 2, coefficient of variation <= 25%), and different amounts of added Gaussian noise (0, 5, or 10%) and missing features (0, 10, or 20%). This data set was developed to improve benchmarking of existing LC-MS metabolomics software and to validate the updated version of our MVAPACK software, which added gas chromatography-MS and LC-MS functionality to its existing 1D and two-dimensional NMR data processing capabilities. We also included two experimental LC-MS data sets acquired from a standard mixture andMycobacterium smegmatiscell lysates since a simulated data set alone may not capture all the unique characteristics and variability of real spectra needed to assess software performance properly. Our simulated and experimental LC-MS data sets were processed with the MS-DIAL and XCMSOnline software packages and our MVAPACK toolkit to showcase the utility of our data sets to benchmark MVAPACK against community standards. Our results demonstrate the enhanced objectivity and clarity of software assessment that can be achieved when both simulated and experimental data are employed since distinctly different software performances were observed with the simulated and experimental LC-MS data sets. We also demonstrate that the performance of MVAPACK is equivalent to or exceeds existing LC-MS software programs while providing a single platform for processing and analyzing both NMR and MS data sets.
引用
收藏
页码:12943 / 12956
页数:14
相关论文
共 50 条
  • [1] MassCascade: Visual Programming for LC-MS Data Processing in Metabolomics
    Beisken, Stephan
    Earll, Mark
    Portwood, David
    Seymour, Mark
    Steinbeck, Christoph
    MOLECULAR INFORMATICS, 2014, 33 (04) : 307 - 310
  • [2] Trackable and scalable LC-MS metabolomics data processing using asari
    Li, Shuzhao
    Siddiqa, Amnah
    Thapa, Maheshwor
    Chi, Yuanye
    Zheng, Shujian
    NATURE COMMUNICATIONS, 2023, 14 (01)
  • [3] Trackable and scalable LC-MS metabolomics data processing using asari
    Shuzhao Li
    Amnah Siddiqa
    Maheshwor Thapa
    Yuanye Chi
    Shujian Zheng
    Nature Communications, 14
  • [4] Strategy for Optimizing LC-MS Data Processing in Metabolomics: A Design of Experiments Approach
    Eliasson, Mattias
    Rannar, Stefan
    Madsen, Rasmus
    Donten, Magdalena A.
    Marsden-Edwards, Emma
    Moritz, Thomas
    Shockcor, John P.
    Johansson, Erik
    Trygg, Johan
    ANALYTICAL CHEMISTRY, 2012, 84 (15) : 6869 - 6876
  • [5] Algorithms and tools for the preprocessing of LC-MS metabolomics data
    Castillo, Sandra
    Gopalacharyulu, Peddinti
    Yetukuri, Laxman
    Oresic, Matej
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2011, 108 (01) : 23 - 32
  • [6] Filtering procedures for untargeted LC-MS metabolomics data
    Schiffman, Courtney
    Petrick, Lauren
    Perttula, Kelsi
    Yano, Yukiko
    Carlsson, Henrik
    Whitehead, Todd
    Metayer, Catherine
    Hayes, Josie
    Rappaport, Stephen
    Dudoit, Sandrine
    BMC BIOINFORMATICS, 2019, 20 (1)
  • [7] Filtering procedures for untargeted LC-MS metabolomics data
    Courtney Schiffman
    Lauren Petrick
    Kelsi Perttula
    Yukiko Yano
    Henrik Carlsson
    Todd Whitehead
    Catherine Metayer
    Josie Hayes
    Stephen Rappaport
    Sandrine Dudoit
    BMC Bioinformatics, 20
  • [8] A Python']Python-Based Pipeline for Preprocessing LC-MS Data for Untargeted Metabolomics Workflows
    Riquelme, Gabriel
    Zabalegui, Nicolas
    Marchi, Pablo
    Jones, Christina M.
    Monge, Maria Eugenia
    METABOLITES, 2020, 10 (10) : 1 - 14
  • [9] With Guide of Spike-in Experiment for Optimizing Workflow of LC-MS data Processing in Metabolomics
    Yan, Bing-peng
    Cao, Chun-mei
    Hou, Jin-jun
    Bi, Qi-rui
    Yang, Min
    Qi, Peng
    Shi, Xiao-jian
    Wang, Jian-wei
    Wu, Wan-ying
    Guo, De-an
    NATURAL PRODUCT COMMUNICATIONS, 2017, 12 (08) : 1295 - 1300
  • [10] An approach for feature selection with data modelling in LC-MS metabolomics
    Plyushchenko, Ivan
    Shakhmatov, Dmitry
    Bolotnik, Timofey
    Baygildiev, Timur
    Nesterenko, Pavel N.
    Rodin, Igor
    ANALYTICAL METHODS, 2020, 12 (28) : 3582 - 3591