Gaussian binning: a new kernel-based method for processing NMR spectroscopic data for metabolomics

被引:51
|
作者
Anderson, Paul E. [1 ]
Reo, Nicholas V. [2 ]
DelRaso, Nicholas J. [3 ]
Doom, Travis E. [1 ]
Raymer, Michael L. [1 ]
机构
[1] Wright State Univ, Dept Comp Sci & Engn, Dayton, OH 45435 USA
[2] Wright State Univ, Boonshoft Sch Med, Dept Biochem & Mol Biol, Dayton, OH 45429 USA
[3] USAF, Wright Patterson AFB, Human Performance Wing 711, Wright Patterson AFB, OH 45433 USA
关键词
Gaussian; binning; pattern recognition; quantification; nuclear magnetic resonance;
D O I
10.1007/s11306-008-0117-3
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
In many metabolomics studies, NMR spectra are divided into bins of fixed width. This spectral quantification technique, known as uniform binning, is used to reduce the number of variables for pattern recognition techniques and to mitigate effects from variations in peak positions; however, shifts in peaks near the boundaries can cause dramatic quantitative changes in adjacent bins due to non-overlapping boundaries. Here we describe a new Gaussian binning method that incorporates overlapping bins to minimize these effects. A Gaussian kernel weights the signal contribution relative to distance from bin center, and the overlap between bins is controlled by the kernel standard deviation. Sensitivity to peak shift was assessed for a series of test spectra where the offset frequency was incremented in 0.5 Hz steps. For a 4 Hz shift within a bin width of 24 Hz, the error for uniform binning increased by 150%, while the error for Gaussian binning increased by 50%. Further, using a urinary metabolomics data set (from a toxicity study) and principal component analysis (PCA), we showed that the information content in the quantified features was equivalent for Gaussian and uniform binning methods. The separation between groups in the PCA scores plot, measured by the J(2) quality metric, is as good or better for Gaussian binning versus uniform binning. The Gaussian method is shown to be robust in regards to peak shift, while still retaining the information needed by classification and multivariate statistical techniques for NMR-metabolomics data.
引用
收藏
页码:261 / 272
页数:12
相关论文
共 50 条
  • [1] Gaussian binning: a new kernel-based method for processing NMR spectroscopic data for metabolomics
    Paul E. Anderson
    Nicholas V. Reo
    Nicholas J. DelRaso
    Travis E. Doom
    Michael L. Raymer
    Metabolomics, 2008, 4 : 261 - 272
  • [2] Adaptive Binning Method for NMR Spectroscopic Metabonomics Data Preprocessing
    Dong Ji-Yang
    Xu Le
    Xu Jing-Jing
    Chen Zhong
    CHEMICAL JOURNAL OF CHINESE UNIVERSITIES-CHINESE, 2009, 30 (06): : 1101 - 1108
  • [3] Binning of kernel-based projection pursuit indices in XGobi
    Klinke, S
    Cook, D
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 1997, 25 (03) : 363 - 369
  • [4] Binning of kernel-based projection pursuit indices in XGobi
    Humboldt-Univ of Berlin, Berlin, Germany
    Comput Stat Data Anal, 3 (363-369):
  • [5] A new reproducing kernel-based nonlinear dimension reduction method for survival data
    Cui, Wenquan
    Xu, Jianjun
    Wu, Yuehua
    SCANDINAVIAN JOURNAL OF STATISTICS, 2023, 50 (03) : 1365 - 1390
  • [6] Boosting as a kernel-based method
    Aravkin, Aleksandr Y.
    Bottegal, Giulio
    Pillonetto, Gianluigi
    MACHINE LEARNING, 2019, 108 (11) : 1951 - 1974
  • [7] Kernel-based ensemble gaussian mixture filtering for orbit determination with sparse data
    Yun, Sehyun
    Zanetti, Renato
    Jones, Brandon A.
    ADVANCES IN SPACE RESEARCH, 2022, 69 (12) : 4179 - 4197
  • [8] Boosting as a kernel-based method
    Aleksandr Y. Aravkin
    Giulio Bottegal
    Gianluigi Pillonetto
    Machine Learning, 2019, 108 : 1951 - 1974
  • [9] Adaptive binning: An improved binning method for metabolomics data using the undecimated wavelet transform
    Davis, Richard A.
    Charlton, Adrian J.
    Godward, John
    Jones, Stephen A.
    Harrison, Mark
    Wilson, Julie C.
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2007, 85 (01) : 144 - 154
  • [10] New Variable Scaling Method for NMR-based Metabolomics Data Analysis
    Dong Ji-Yang
    Li Wei
    Deng Ling-Li
    Xu Jing-Jing
    Griffin, Julian L.
    Chen Zhong
    CHEMICAL JOURNAL OF CHINESE UNIVERSITIES-CHINESE, 2011, 32 (02): : 262 - 268