xMSanalyzer: automated pipeline for improved feature detection and downstream analysis of large-scale, non-targeted metabolomics data

被引:274
|
作者
Uppal, Karan [1 ,6 ]
Soltow, Quinlyn A. [2 ]
Strobel, Frederick H. [3 ]
Pittard, W. Stephen [1 ]
Gernert, Kim M. [1 ]
Yu, Tianwei [4 ]
Jones, Dean P. [2 ,5 ]
机构
[1] Emory Univ, Sch Med, BimCore, Atlanta, GA USA
[2] Emory Univ, Dept Med, Div Pulm Allergy & Crit Care, Atlanta, GA 30322 USA
[3] Emory Univ, Mass Spectrometry Ctr, Atlanta, GA 30322 USA
[4] Emory Univ, Rollins Sch Publ Hlth, Dept Biostat & Bioinformat, Atlanta, GA 30322 USA
[5] Emory Univ, Clin Biomarkers Lab, Atlanta, GA 30322 USA
[6] Georgia Inst Technol, Sch Biol, Atlanta, GA 30332 USA
来源
BMC BIOINFORMATICS | 2013年 / 14卷
基金
美国国家卫生研究院;
关键词
OPEN-SOURCE SOFTWARE; MASS; ALIGNMENT; ALGORITHMS; FRAMEWORK; OPENMS; MZMINE; SUITE;
D O I
10.1186/1471-2105-14-15
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Detection of low abundance metabolites is important for de novo mapping of metabolic pathways related to diet, microbiome or environmental exposures. Multiple algorithms are available to extract m/z features from liquid chromatography-mass spectral data in a conservative manner, which tends to preclude detection of low abundance chemicals and chemicals found in small subsets of samples. The present study provides software to enhance such algorithms for feature detection, quality assessment, and annotation. Results: xMSanalyzer is a set of utilities for automated processing of metabolomics data. The utilites can be classified into four main modules to: 1) improve feature detection for replicate analyses by systematic re-extraction with multiple parameter settings and data merger to optimize the balance between sensitivity and reliability, 2) evaluate sample quality and feature consistency, 3) detect feature overlap between datasets, and 4) characterize high-resolution m/z matches to small molecule metabolites and biological pathways using multiple chemical databases. The package was tested with plasma samples and shown to more than double the number of features extracted while improving quantitative reliability of detection. MS/MS analysis of a random subset of peaks that were exclusively detected using xMSanalyzer confirmed that the optimization scheme improves detection of real metabolites. Conclusions: xMSanalyzer is a package of utilities for data extraction, quality control assessment, detection of overlapping and unique metabolites in multiple datasets, and batch annotation of metabolites. The program was designed to integrate with existing packages such as apLCMS and XCMS, but the framework can also be used to enhance data extraction for other LC/MS data software.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Automated Analysis of Cellular Signals from Large-Scale Calcium Imaging Data
    Mukamel, Eran A.
    Nimmerjahn, Axel
    Schnitzer, Mark J.
    NEURON, 2009, 63 (06) : 747 - 760
  • [42] Large-Scale Detection of Non-Technical Losses In Imbalanced Data Sets
    Glauner, Patrick
    Boechat, Andre
    Dolberg, Lautaro
    State, Radu
    Bettinger, Franck
    Rangoni, Yves
    Duarte, Diogo
    2016 IEEE POWER & ENERGY SOCIETY INNOVATIVE SMART GRID TECHNOLOGIES CONFERENCE (ISGT), 2016,
  • [43] A Novel Integrated Method for Large-Scale Detection, Identification, and Quantification of Widely Targeted Metabolites: Application in the Study of Rice Metabolomics
    Chen, Wei
    Gong, Liang
    Guo, Zilong
    Wang, Wensheng
    Zhang, Hongyan
    Liu, Xianqing
    Yu, Sibin
    Xiong, Lizhong
    Luo, Jie
    MOLECULAR PLANT, 2013, 6 (06) : 1769 - 1780
  • [44] MRMPROBS: A Data Assessment and Metabolite Identification Tool for Large-Scale Multiple Reaction Monitoring Based Widely Targeted Metabolomics
    Tsugawa, Hiroshi
    Arita, Masanori
    Kanazawa, Mitsuhiro
    Ogiwara, Atsushi
    Bamba, Takeshi
    Fukusaki, Eiichiro
    ANALYTICAL CHEMISTRY, 2013, 85 (10) : 5191 - 5199
  • [45] Large-scale analysis of intraocular lens opacifications using digital automated detection software
    Mastromonaco, Christina
    Balazsi, Matthew
    Burnier, Julia
    Coblentz, Jacqueline
    Lasiste, Jade
    Burnier, Miguel N.
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2019, 60 (09)
  • [46] Trace-Oriented Feature Analysis for Large-Scale Text Data Dimension Reduction
    Yan, Jun
    Liu, Ning
    Yan, Shuicheng
    Yang, Qiang
    Fan, Weiguo
    Wei, Wei
    Chen, Zheng
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (07) : 1103 - 1117
  • [47] Locating large-scale craniofacial feature pointos on x-ray images for automated cephalometric analysis
    Yue, WN
    Yin, DL
    Li, CJ
    Wang, GP
    Xu, TM
    2005 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), VOLS 1-5, 2005, : 1241 - 1244
  • [48] Automated QA/QC reporting for non-targeted analysis: a demonstration of "INTERPRET NTA" with de facto water reuse data
    Sobus, Jon R.
    Sayre-Smith, Nickolas A.
    Chao, Alex
    Ferland, Troy M.
    Minucci, Jeffrey M.
    Carr, E. Tyler
    Brunelle, Laura D.
    Batt, Angela L.
    Whitehead, Heather D.
    Cathey, Tommy
    Boyce, Matthew
    Ulrich, Elin M.
    Mccord, James P.
    Williams, Antony J.
    ANALYTICAL AND BIOANALYTICAL CHEMISTRY, 2025, 417 (09) : 1897 - 1914
  • [49] A novel approach for nontargeted data analysis for metabolomics. Large-scale profiling of tomato fruit volatiles
    Tikunov, Y
    Lommen, A
    de Vos, CHR
    Verhoeven, HA
    Bino, RJ
    Hall, RD
    Bovy, AG
    PLANT PHYSIOLOGY, 2005, 139 (03) : 1125 - 1137
  • [50] Sparse network modeling and metscape-based visualization methods for the analysis of large-scale metabolomics data
    Basu, Sumanta
    Duren, William
    Evans, Charles R.
    Burant, Charles F.
    Michailidis, George
    Karnovsky, Alla
    BIOINFORMATICS, 2017, 33 (10) : 1545 - 1553