An Untargeted Metabolomics Workflow that Scales to Thousands of Samples for Population-Based Studies

被引:1
|
作者
Stancliffe, Ethan [1 ,2 ]
Schwaiger-Haber, Michaela [1 ,2 ]
Sindelar, Miriam [1 ,2 ]
Murphy, Matthew J. [1 ,2 ]
Soerensen, Mette [3 ]
Patti, Gary J. [1 ,2 ,4 ]
机构
[1] Washington Univ, Dept Chem, Dept Med, St Louis, MO 63130 USA
[2] Washington Univ, Ctr Metabol & Isotope Tracing, St Louis, MO 63130 USA
[3] Univ Southern Denmark, Dept Publ Hlth, Epidemiol Biostat & Biodemog, DK-5230 Odense, Denmark
[4] Washington Univ, Siteman Canc Ctr, St Louis, MO 63130 USA
基金
美国国家卫生研究院;
关键词
SPECTROMETRY DATA; ANNOTATION; ALIGNMENT; STRATEGY; DATABASE; XCMS; MILK; MS;
D O I
10.1021/acs.analchem.2c0127017370Anal
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
The success of precision medicine relies upon collecting data from many individuals at the population level. Although advancing technologies have made such large-scale studies increasingly feasible in some disciplines such as genomics, the standard workflows currently implemented in untargeted metabolomics were developed for small sample numbers and are limited by the processing of liquid chromatography/mass spectrometry data. Here we present an untargeted metabolomics workflow that is designed to support large-scale projects with thousands of biospecimens. Our strategy is to first evaluate a reference sample created by pooling aliquots of biospecimens from the cohort. The reference sample captures the chemical complexity of the biological matrix in a small number of analytical runs, which can subsequently be processed with conventional software such as XCMS. Although this generates thousands of so-called features, most do not correspond to unique compounds from the samples and can be filtered with established informatics tools. The features remaining represent a comprehensive set of biologically relevant reference chemicals that can then be extracted from the entire cohort's raw data on the basis of m/z values and retention times by using Skyline. To demonstrate applicability to large cohorts, we evaluated >2000 human plasma samples with our workflow. We focused our analysis on 360 identified compounds, but we also profiled >3000 unknowns from the plasma samples. As part of our workflow, we tested 14 different computational approaches for batch correction and found that a random forest-based approach outperformed the others. The corrected data revealed distinct profiles that were associated with the geographic location of participants.
引用
收藏
页码:17370 / 17378
页数:9
相关论文
共 50 条
  • [41] Pulmonary function testing in population-based studies
    Manolio, TA
    Weinmann, GG
    Buist, AS
    Furberg, CD
    Pinsky, JL
    Hurd, SH
    AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 1997, 156 (03) : 1004 - 1010
  • [42] Population-based glaucoma prevalence studies in Asians
    Cho, Hyun-Kyung
    Kee, Changwon
    SURVEY OF OPHTHALMOLOGY, 2014, 59 (04) : 434 - 447
  • [43] Brain Correlates of Allostatic Load in Population-Based and Clinical Samples
    Frangou, Sophia
    NEUROPSYCHOPHARMACOLOGY, 2017, 42 : S80 - S80
  • [44] Identification of twin pairs from large population-based samples
    Webbink, Dinand
    Roeleveld, Jaap
    Visscher, Peter M.
    TWIN RESEARCH AND HUMAN GENETICS, 2006, 9 (04) : 496 - 500
  • [45] THE GROWTH OF XXX FEMALES - POPULATION-BASED STUDIES
    RATCLIFFE, SG
    PAN, H
    MCKIE, M
    ANNALS OF HUMAN BIOLOGY, 1994, 21 (01) : 57 - 66
  • [46] Measures of frailty in population-based studies: an overview
    Kim Bouillon
    Mika Kivimaki
    Mark Hamer
    Severine Sabia
    Eleonor I Fransson
    Archana Singh-Manoux
    Catharine R Gale
    G David Batty
    BMC Geriatrics, 13
  • [47] The value of population-based studies in the genomic era
    Wang, Jie Jin
    Wong, Tien Yin
    OPHTHALMIC EPIDEMIOLOGY, 2007, 14 (01) : 1 - 2
  • [48] ADHD research: the power of population-based studies
    Sciberras, Emma
    LANCET PSYCHIATRY, 2014, 1 (04): : 248 - 249
  • [49] Haplotype analysis in population-based association studies
    Mander, A. P.
    STATA JOURNAL, 2001, 1 (01): : 58 - 75
  • [50] Population-based studies of adverse drug effects
    Ray, WA
    NEW ENGLAND JOURNAL OF MEDICINE, 2003, 349 (17): : 1592 - 1594