Data Preprocessing Methods of FT-NIR Spectral Data for the Classification Cooking Oil

被引:2
|
作者
Ruah, Mas Ezatul Nadia Mohd [1 ,2 ]
Rasaruddin, Nor Fazila [1 ,2 ]
Fong, Sim Siong [3 ]
Jaafar, Mohd Zuli [1 ,2 ]
机构
[1] Univ Teknol MARA, Fac Appl Sci, Shah Alam 40450, Selangor, Malaysia
[2] Univ Teknol MARA, Kuala Pilah 72000, Malaysia
[3] Univ Malaysia Sarawak, Kota Samarahan 94300, Malaysia
关键词
Fourier Transform Near Infrared (FT-MR); Savitzky; -; Golay; Standard Nonnal Vanate (SNV); multivariate analysis; Classification; NEAR-INFRARED SPECTRA; EDIBLE OILS; VALIDATION; SELECTION;
D O I
10.1063/1.4903688
中图分类号
O59 [应用物理学];
学科分类号
摘要
This recent work describes the data pre-processing method of FT-NIR spectroscopy datasets of cooking oil and its quality parameters with chemometrics method. Pre-processing of near-infrared (MR) spectral data has become an integral part of chemometrics modelling. Hence, this work is dedicated to investigate the utility and effectiveness of preprocessing algorithms namely row scaling, column scaling and single scaling process with Standard Normal Variate (SNV). The combinations of these scaling methods have impact on exploratory analysis and classification via Principle Component Analysis plot (PCA). The samples were divided into palm oil and non-palm cooking oil. The classification model was build using FT-NIR cooking oil spectra datasets in absorbance mode at the range of 4000cm-1- 14000cm-1. Savitzky Golay derivative was applied before developing the classification model. Then, the data was separated into two sets which were training set and test set by using Duplex method. The number of each class was kept equal to 2/3 of the class that has the minimum number of sample. Then, the sample was employed I-statistic as variable selection method in order to select which variable is significant towards the classification models. The evaluation of data pre-processing were looking at value of modified silhouette width (mSW), PCA and also Percentage Correctly Classified (%CC). The results show that different data processing strategies resulting to substantial amount of model performances quality. The effects of several data pre-processing i.e. row scaling, column standardisation and single scaling process with Standard Normal Variate indicated by mSW and %CC. At two PCs model, all five classifier gave high %CC except Quadratic Distance Analysis.
引用
收藏
页码:890 / 897
页数:8
相关论文
共 50 条
  • [21] Use of FT-NIR spectroscopy for determination of free fatty acids in the sunflowerseed oil
    Semeniuk, Izabela
    Krzysiak-Warzala, Barbara
    Koren-Szwarc, Beata
    Grzywocz, Alicja
    PRZEMYSL CHEMICZNY, 2016, 95 (11): : 2359 - 2362
  • [22] Data fusion of FT-NIR spectroscopy and Vis/NIR hyperspectral imaging to predict quality parameters of yellow flesh "Jintao" kiwifruit
    Cevoli, Chiara
    Iaccheri, Eleonora
    Fabbri, Angelo
    Ragni, Luigi
    BIOSYSTEMS ENGINEERING, 2024, 237 : 157 - 169
  • [23] Large spot size diffuse reflectance FT-NIR spectral sensor for inhomogeneous samples
    Sadek, Mohamed
    Labib, Shady R.
    Mortada, Bassem
    Medhat, Mostafa
    Zeinah, Tarek
    Shebl, Ahmed
    Fadeel, Ahmed
    Gad, Mina
    Abuelnaga, Hussien
    George, Botros
    Sabry, Yasser M.
    Saadany, Bassam
    OPTICAL MEASUREMENT SYSTEMS FOR INDUSTRIAL INSPECTION XII, 2021, 11782
  • [24] Impact of Boolean factorization as preprocessing methods for classification of Boolean data
    Belohlavek, Radim
    Outrata, Jan
    Trnecka, Martin
    ANNALS OF MATHEMATICS AND ARTIFICIAL INTELLIGENCE, 2014, 72 (1-2) : 3 - 22
  • [25] Impact of Boolean factorization as preprocessing methods for classification of Boolean data
    Radim Belohlavek
    Jan Outrata
    Martin Trnecka
    Annals of Mathematics and Artificial Intelligence, 2014, 72 : 3 - 22
  • [26] Machine Learning for Seed Quality Classification: An Advanced Approach Using Merger Data from FT-NIR Spectroscopy and X-ray Imaging
    de Medeiros, Andre Dantas
    da Silva, Laercio Junio
    Ribeiro, Joao Paulo Oliveira
    Ferreira, Kamylla Calzolari
    Rosas, Jorge Tadeu Fim
    Santos, Abraao Almeida
    da Silva, Clissia Barboza
    SENSORS, 2020, 20 (15)
  • [27] High-precision identification of the actual storage periods of edible oil by FT-NIR spectroscopy combined with chemometric methods
    He, Yingchao
    Jiang, Hui
    Chen, Quansheng
    ANALYTICAL METHODS, 2020, 12 (29) : 3722 - 3728
  • [28] Classification and Preprocessing in the Stock Data
    Juszczuk, Przemyslaw
    Kozak, Jan
    BUSINESS INFORMATION SYSTEMS WORKSHOPS, BIS 2017, 2017, 303 : 269 - 281
  • [29] Assessment of sesame oil fatty acid and sterol composition with FT-NIR spectroscopy and chemometrics
    Ozdemir, Ibrahim Sani
    Karaoglu, Oznur
    Dag, Cagdas
    Bekiroglu, Somer
    TURKISH JOURNAL OF AGRICULTURE AND FORESTRY, 2018, 42 (06) : 444 - 452
  • [30] Rapid differentiation of Ghana cocoa beans by FT-NIR spectroscopy coupled with multivariate classification
    Teye, Ernest
    Huang, Xingyi
    Dai, Huang
    Chen, Quansheng
    SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY, 2013, 114 : 183 - 189