Chemometric pre-processing can negatively affect the performance of near-infrared spectroscopy models for fruit quality prediction

被引:60
|
作者
Mishra, Puneet [1 ]
Rutledge, Douglas N. [2 ,3 ]
Roger, Jean-Michel [4 ,5 ]
Wali, Khan [6 ]
Khan, Haris Ahmad [6 ]
机构
[1] Wageningen Food & Biobased Res, Bornse Weilanden 9,POB 17, NL-6700 AA Wageningen, Netherlands
[2] Univ Paris Saclay, UMR SayFood, AgroParisTech, INRAE, F-75005 Paris, France
[3] Charles Sturt Univ, Natl Wine & Grape Ind Ctr, Wagga Wagga, NSW, Australia
[4] Univ Montpellier, Inst Agro, INRAE, ITAP, Montpellier, France
[5] ChemHouse Res Grp, Montpellier, France
[6] Wageningen Univ & Res, Farm Technol Grp, Wageningen, Netherlands
关键词
Artificial intelligence; Neural network; Fruit quality;
D O I
10.1016/j.talanta.2021.122303
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Chemometrics pre-processing of spectral data is widely performed to enhance the predictive performance of near-infrared (NIR) models related to fresh fruit quality. Pre-processing approaches in the domain of NIR data analysis are used to remove the scattering effects, thus, enhancing the absorption components related to the chemical properties. However, in the case of fresh fruit, both the scattering and absorption properties are of key interest as they jointly explain the physicochemical state of a fruit. Therefore, pre-processing data that reduces the scattering information in the spectra may lead to poorly performing models. The objectives of this study are to test two hypotheses to explore the effect of pre-processing on NIR spectra of fresh fruit. The first hypothesis is that the pre-processing of NIR spectra with scatter correction techniques can reduce the predictive performance of models as the scatter correction can reduce the useful scattering information correlated to the property of interest. The second hypothesis is that the Deep Learning (DL) can model the raw absorbance data (mix of scattering and absorption) much more efficiently than the Partial Least Squares (PLS) regression analysis. To test the hypotheses, a real NIR data set related to dry matter (DM) prediction in mango fruit was used. The dataset consisted of a total of 11,420 NIR spectra and reference DM measurements for model training and independent testing. The chemometric pre-processing methods explored were standard normal variate (SNV), variable sorting for normalization (VSN), Savitzky-Golay based 2nd derivative and their combinations. Further two modelling approaches i.e., PLS regression and DL were used to evaluate the effect of pre-processing. The results showed that the best root mean squared error of prediction (RMSEP) for both the PLS and DL models were obtained with the raw absorbance data. The spectral pre-processing in general decreased the performance of both the PLS and DL models. Further, the DL model attained the lowest RMSEP of 0.76%, which was 13% lower compared to the PLS regression on the raw absorbance data. Pre-processing approaches should be carefully used while analysing the NIR data related to fresh fruit.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] SPORT pre-processing can improve near-infrared quality prediction models for fresh fruits and agro-materials
    Mishra, Puneet
    Roger, Jean Michel
    Rutledge, Douglas N.
    Woltering, Ernst
    [J]. POSTHARVEST BIOLOGY AND TECHNOLOGY, 2020, 168
  • [2] Study on Near-Infrared Spectroscopy Signal Pre-processing and Model Building
    Zhang Xiaochao
    Zhang Yinqiao
    Wang Hui
    [J]. INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER CONTROL : ICACC 2009 - PROCEEDINGS, 2009, : 618 - 622
  • [3] Near-Infrared Data Pre-processing for Glucose Level Prediction in Blood
    Abd Rahim, Intan Maisarah
    Rahim, Herlina Abdul
    Ghazali, Rashidah
    [J]. 2020 IEEE 10TH INTERNATIONAL CONFERENCE ON SYSTEM ENGINEERING AND TECHNOLOGY (ICSET), 2020, : 73 - 78
  • [4] Parallel pre-processing through orthogonalization (PORTO) and its application to near-infrared spectroscopy
    Mishra, Puneet
    Roger, Jean Michel
    Marini, Federico
    Biancolillo, Alessandra
    Rutledge, Douglas N.
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2021, 212
  • [5] Avocado dehydration negatively affects the performance of visible and near-infrared spectroscopy models for dry matter prediction
    Mishra, Puneet
    Paillart, Maxence
    Meesters, Lydia
    Woltering, Ernst
    Chauhan, Aneesh
    [J]. POSTHARVEST BIOLOGY AND TECHNOLOGY, 2022, 183
  • [6] Review of the most common pre-processing techniques for near-infrared spectra
    Rinnan, Asmund
    van den Berg, Frans
    Engelsen, Soren Balling
    [J]. TRAC-TRENDS IN ANALYTICAL CHEMISTRY, 2009, 28 (10) : 1201 - 1222
  • [7] Characterisation of heavy oils using near-infrared spectroscopy: Optimisation of pre-processing methods and variable selection
    Laxalde, Jeremy
    Ruckebusch, Cyril
    Devos, Olivier
    Caillol, Noemie
    Wahl, Francois
    Duponchel, Ludovic
    [J]. ANALYTICA CHIMICA ACTA, 2011, 705 (1-2) : 227 - 234
  • [8] The influence of data pre-processing in the pattern recognition of excipients near-infrared spectra
    Candolfi, A
    De Maesschalck, R
    Jouan-Rimbaud, D
    Hailey, PA
    Massart, DL
    [J]. JOURNAL OF PHARMACEUTICAL AND BIOMEDICAL ANALYSIS, 1999, 21 (01) : 115 - 132
  • [9] Cancer Discrimination Using Fourier Transform Near-Infrared Spectroscopy with Chemometric Models
    Chen, Hui
    Lin, Zan
    Tan, Chao
    [J]. JOURNAL OF CHEMISTRY, 2015, 2015
  • [10] Research on the Pre-Processing Methods of Wheat Hardness Prediction Model Based on Visible-Near Infrared Spectroscopy
    Hui Guang-yan
    Sun Lai-jun
    Wang Jia-nan
    Wang Le-kai
    Dai Chang-jun
    [J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2016, 36 (07) : 2111 - 2116