Variable Selection in Visible and Near-Infrared Spectral Analysis for Noninvasive Determination of Soluble Solids Content of ‘Ya’ Pear

被引:0
|
作者
Jiangbo Li
Wenqian Huang
Liping Chen
Shuxiang Fan
Baohua Zhang
Zhiming Guo
Chunjiang Zhao
机构
[1] Beijing Academy of Agriculture and Forestry Sciences,Beijing Research Center of Intelligent Equipment for Agriculture
[2] China Agricultural University,College of Engineering
来源
Food Analytical Methods | 2014年 / 7卷
关键词
Near infrared spectroscopy; Monte Carlo–uninformative variable elimination; Successive projections algorithm; Variable selection; Soluble solids content; ‘Ya’ pear;
D O I
暂无
中图分类号
学科分类号
摘要
Informative variable selection or wavelength selection plays an important role in the quantitative analysis of near-infrared (NIR) spectra because the modern spectroscopy instrumentations usually have a high resolution and the obtained spectral data sets may have thousands of variables and hundreds or thousands of samples. In this study, a new combination of Monte Carlo–uninformative variable elimination (MC-UVE) and successive projections algorithm (SPA; MC-UVE-SPA) was proposed to select the most effective variables. MC-UVE was firstly used to eliminate the uninformative variables in the raw spectra data. Then, SPA was applied to determine the variables with the least collinearity. A case study was done based on the NIR spectroscopy for the non-destructive determination of soluble solids content (SSC) in ‘Ya’ pear. A total of 160 samples were prepared for the calibration (n = 120) and prediction (n = 40) sets. Three calibration algorithms including linear regressions of partial least square regression (PLS) and multiple linear regression (MLR), and nonlinear regression of least-square support vector machine (LS-SVM) were used for model establishment by using the selected variables by SPA, UVE, MC-UVE, UVE-SPA, and MC-UVE-SPA, respectively. The results indicated that linear models such as PLS and MLR were more effective than nonlinear model such as LS-SVM in the prediction of SSC of ‘Ya’ pear. In terms of linear models, different variable selection methods can obtain a similar result with the RMSEP values range from 0.2437 to 0.2830. However, combination of MC-UVE and SPA was helpful for obtaining a more parsimonious and efficient model for predicting the SSC values in ‘Ya’ pear. Twenty-two effective variables selected by MC-UVE-SPA achieved the optimal linear MC-UVE-SPA-MLR model compared with other all developed models by balancing between model accuracy and model complexity. The coefficients of determination (r2), root mean square error of prediction, and residual predictive deviation by MC-UVE-SPA-MLR were 0.9271, 0.2522, and 3.7037, respectively.
引用
收藏
页码:1891 / 1902
页数:11
相关论文
共 50 条
  • [31] Near-infrared (NIR) spectrometric technique for nondestructive determination of soluble solids content in processing tomatoes
    Peiris, KHS
    Dull, GG
    Leffler, RG
    Kays, SJ
    JOURNAL OF THE AMERICAN SOCIETY FOR HORTICULTURAL SCIENCE, 1998, 123 (06) : 1089 - 1093
  • [32] Near-infrared transmittance spectroscopy for nondestructive determination of soluble solids content and pH in tomato juice
    Xie, Lijuan
    Ying, Yibin
    Lin, Hongjian
    Zhou, Ying
    Niu, Xiaoying
    Jiang, Xuesong
    OPTICS FOR NATURAL RESOURCES, AGRICULTURE, AND FOODS II, 2007, 6761
  • [33] Advancing Loquat Total Soluble Solids Content Determination by Near-Infrared Spectroscopy and Explainable AI
    Luo, Yizhi
    Jin, Qingting
    Lu, Huazhong
    Li, Peng
    Qiu, Guangjun
    Qi, Haijun
    Li, Bin
    Zhou, Xingxing
    AGRICULTURE-BASEL, 2025, 15 (03):
  • [34] Determination of Soluble Solids Content in Cuiguan Pear by Vis/NIR Diffuse Transmission Spectroscopy and Variable Selection Methods
    Xu, Wenli
    Sun, Tong
    Wu, Wenqiang
    Hu, Tian
    Hu, Tao
    Liu, Muhua
    KNOWLEDGE ENGINEERING AND MANAGEMENT , ISKE 2013, 2014, 278 : 269 - 276
  • [35] Near-infrared transmittance measuring technique for soluble solids content of watermelon
    Zhejiang University, China
    Nongye Jixie Xuebao, 2007, 5 (111-113):
  • [36] Robustness of Global Model of Soluble Solids in Gongli Pear Based on Near-Infrared Spectroscopy
    Liu Yan-de
    Liao Jun
    Li Bin
    Jiang Xiao-gang
    Zhu Ming-wang
    Yao Jin-liang
    Wang Qiu
    SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42 (09) : 2781 - 2787
  • [37] Enhancing Transferability of Near-Infrared Spectral Models for Soluble Solids Content Prediction across Different Fruits
    Guo, Cheng
    Zhang, Jin
    Cai, Wensheng
    Shao, Xueguang
    APPLIED SCIENCES-BASEL, 2023, 13 (09):
  • [38] Determination of watermelon soluble solids content based on visible/near infrared spectroscopy with convolutional neural network
    Wang, Guantian
    Jiang, Xiaogang
    Li, Xiong
    Liu, Yande
    Rao, Yu
    Zhang, Yu
    Xin, Manyu
    INFRARED PHYSICS & TECHNOLOGY, 2023, 133
  • [39] Near-Infrared Model and Its Robustness as Affected by Fruit Origin for 'Dangshan' Pear Soluble Solids Content and pH Measurement
    Cheng, Tao
    Guo, Sen
    Pan, Zhenggao
    Fan, Shuxiang
    Ju, Shucun
    Xin, Zhenghua
    Zhou, Xin-Gen
    Jiang, Fei
    Zhang, Dongyan
    AGRICULTURE-BASEL, 2022, 12 (10):
  • [40] Postharvest Dry Matter and Soluble Solids Content Prediction in d'Anjou and Bartlett Pear Using Near-infrared Spectroscopy
    Goke, Alex
    Serra, Sara
    Musacchi, Stefano
    HORTSCIENCE, 2018, 53 (05) : 669 - 680