Variable Selection in Visible and Near-Infrared Spectral Analysis for Noninvasive Determination of Soluble Solids Content of ‘Ya’ Pear

被引:0
|
作者
Jiangbo Li
Wenqian Huang
Liping Chen
Shuxiang Fan
Baohua Zhang
Zhiming Guo
Chunjiang Zhao
机构
[1] Beijing Academy of Agriculture and Forestry Sciences,Beijing Research Center of Intelligent Equipment for Agriculture
[2] China Agricultural University,College of Engineering
来源
Food Analytical Methods | 2014年 / 7卷
关键词
Near infrared spectroscopy; Monte Carlo–uninformative variable elimination; Successive projections algorithm; Variable selection; Soluble solids content; ‘Ya’ pear;
D O I
暂无
中图分类号
学科分类号
摘要
Informative variable selection or wavelength selection plays an important role in the quantitative analysis of near-infrared (NIR) spectra because the modern spectroscopy instrumentations usually have a high resolution and the obtained spectral data sets may have thousands of variables and hundreds or thousands of samples. In this study, a new combination of Monte Carlo–uninformative variable elimination (MC-UVE) and successive projections algorithm (SPA; MC-UVE-SPA) was proposed to select the most effective variables. MC-UVE was firstly used to eliminate the uninformative variables in the raw spectra data. Then, SPA was applied to determine the variables with the least collinearity. A case study was done based on the NIR spectroscopy for the non-destructive determination of soluble solids content (SSC) in ‘Ya’ pear. A total of 160 samples were prepared for the calibration (n = 120) and prediction (n = 40) sets. Three calibration algorithms including linear regressions of partial least square regression (PLS) and multiple linear regression (MLR), and nonlinear regression of least-square support vector machine (LS-SVM) were used for model establishment by using the selected variables by SPA, UVE, MC-UVE, UVE-SPA, and MC-UVE-SPA, respectively. The results indicated that linear models such as PLS and MLR were more effective than nonlinear model such as LS-SVM in the prediction of SSC of ‘Ya’ pear. In terms of linear models, different variable selection methods can obtain a similar result with the RMSEP values range from 0.2437 to 0.2830. However, combination of MC-UVE and SPA was helpful for obtaining a more parsimonious and efficient model for predicting the SSC values in ‘Ya’ pear. Twenty-two effective variables selected by MC-UVE-SPA achieved the optimal linear MC-UVE-SPA-MLR model compared with other all developed models by balancing between model accuracy and model complexity. The coefficients of determination (r2), root mean square error of prediction, and residual predictive deviation by MC-UVE-SPA-MLR were 0.9271, 0.2522, and 3.7037, respectively.
引用
收藏
页码:1891 / 1902
页数:11
相关论文
共 50 条
  • [1] Variable Selection in Visible and Near-Infrared Spectral Analysis for Noninvasive Determination of Soluble Solids Content of 'Ya' Pear
    Li, Jiangbo
    Huang, Wenqian
    Chen, Liping
    Fan, Shuxiang
    Zhang, Baohua
    Guo, Zhiming
    Zhao, Chunjiang
    FOOD ANALYTICAL METHODS, 2014, 7 (09) : 1891 - 1902
  • [2] Determination of the Soluble Solids Content in Korla Fragrant Pears Based on Visible and Near-Infrared Spectroscopy Combined With Model Analysis and Variable Selection
    Yang, Xuhai
    Zhu, Lichun
    Huang, Xiao
    Zhang, Qian
    Li, Sheng
    Chen, Qiling
    Wang, Zhendong
    Li, Jingbin
    FRONTIERS IN PLANT SCIENCE, 2022, 13
  • [3] Near-Infrared Hyperspectral Imaging Combined with CARS Algorithm to Quantitatively Determine Soluble Solids Content in "Ya" Pear
    Li Jiang-bo
    Peng Yan-kun
    Chen Li-ping
    Huang Wen-qian
    SPECTROSCOPY AND SPECTRAL ANALYSIS, 2014, 34 (05) : 1264 - 1269
  • [4] Variable selection for the determination of the soluble solid content of potatoes with surface impurities in the visible/near-infrared range
    Han, Minjie
    Wang, Xiangyou
    Xu, Yingchao
    Cui, Yingjun
    Wang, Liang
    Lv, Danyang
    Cui, Lixia
    BIOSYSTEMS ENGINEERING, 2021, 209 : 170 - 179
  • [5] Nondestructive online measurement of pineapple maturity and soluble solids content using visible and near-infrared spectral analysis
    Semyalo, Dennis
    Kwon, Ohtae
    Wakholi, Collins
    Min, Hyun Jung
    Cho, Byoung-Kwan
    POSTHARVEST BIOLOGY AND TECHNOLOGY, 2024, 209
  • [6] Improving moisture and soluble solids content prediction in pear fruit using near-infrared spectroscopy with variable selection and model updating approach
    Mishra, Puneet
    Woltering, Ernst
    Brouwer, Bastiaan
    Echtelt, Esther Hogeveen-van
    POSTHARVEST BIOLOGY AND TECHNOLOGY, 2021, 171
  • [7] Visible/near-infrared Spectroscopy and Hyperspectral Imaging Facilitate the Rapid Determination of Soluble Solids Content in Fruits
    Zhao, Yiying
    Zhou, Lei
    Wang, Wei
    Zhang, Xiaobin
    Gu, Qing
    Zhu, Yihang
    Chen, Rongqin
    Zhang, Chu
    FOOD ENGINEERING REVIEWS, 2024, 16 (03) : 470 - 496
  • [8] A variable importance criterion for variable selection in near-infrared spectral analysis
    Jin Zhang
    Xiaoyu Cui
    Wensheng Cai
    Xueguang Shao
    ScienceChina(Chemistry), 2019, 62 (02) : 271 - 279
  • [9] A variable importance criterion for variable selection in near-infrared spectral analysis
    Jin Zhang
    Xiaoyu Cui
    Wensheng Cai
    Xueguang Shao
    Science China Chemistry, 2019, 62 : 271 - 279
  • [10] A variable importance criterion for variable selection in near-infrared spectral analysis
    Zhang, Jin
    Cui, Xiaoyu
    Cai, Wensheng
    Shao, Xueguang
    SCIENCE CHINA-CHEMISTRY, 2019, 62 (02) : 271 - 279