Data Preprocessing Methods of FT-NIR Spectral Data for the Classification Cooking Oil

被引:2
|
作者
Ruah, Mas Ezatul Nadia Mohd [1 ,2 ]
Rasaruddin, Nor Fazila [1 ,2 ]
Fong, Sim Siong [3 ]
Jaafar, Mohd Zuli [1 ,2 ]
机构
[1] Univ Teknol MARA, Fac Appl Sci, Shah Alam 40450, Selangor, Malaysia
[2] Univ Teknol MARA, Kuala Pilah 72000, Malaysia
[3] Univ Malaysia Sarawak, Kota Samarahan 94300, Malaysia
关键词
Fourier Transform Near Infrared (FT-MR); Savitzky; -; Golay; Standard Nonnal Vanate (SNV); multivariate analysis; Classification; NEAR-INFRARED SPECTRA; EDIBLE OILS; VALIDATION; SELECTION;
D O I
10.1063/1.4903688
中图分类号
O59 [应用物理学];
学科分类号
摘要
This recent work describes the data pre-processing method of FT-NIR spectroscopy datasets of cooking oil and its quality parameters with chemometrics method. Pre-processing of near-infrared (MR) spectral data has become an integral part of chemometrics modelling. Hence, this work is dedicated to investigate the utility and effectiveness of preprocessing algorithms namely row scaling, column scaling and single scaling process with Standard Normal Variate (SNV). The combinations of these scaling methods have impact on exploratory analysis and classification via Principle Component Analysis plot (PCA). The samples were divided into palm oil and non-palm cooking oil. The classification model was build using FT-NIR cooking oil spectra datasets in absorbance mode at the range of 4000cm-1- 14000cm-1. Savitzky Golay derivative was applied before developing the classification model. Then, the data was separated into two sets which were training set and test set by using Duplex method. The number of each class was kept equal to 2/3 of the class that has the minimum number of sample. Then, the sample was employed I-statistic as variable selection method in order to select which variable is significant towards the classification models. The evaluation of data pre-processing were looking at value of modified silhouette width (mSW), PCA and also Percentage Correctly Classified (%CC). The results show that different data processing strategies resulting to substantial amount of model performances quality. The effects of several data pre-processing i.e. row scaling, column standardisation and single scaling process with Standard Normal Variate indicated by mSW and %CC. At two PCs model, all five classifier gave high %CC except Quadratic Distance Analysis.
引用
收藏
页码:890 / 897
页数:8
相关论文
共 50 条
  • [41] Machine Learning Methods Based Preprocessing to Improve Categorical Data Classification
    Ruiz-Chavez, Zoila
    Salvador-Meneses, Jaime
    Garcia-Rodriguez, Jose
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2018, PT I, 2018, 11314 : 297 - 304
  • [42] Artificial neural networks in classification of NIR spectral data: Selection of the input
    Wu, W
    Massart, DL
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1996, 35 (01) : 127 - 135
  • [43] Classification Method for Viability Screening of Naturally Aged Watermelon Seeds Using FT-NIR Spectroscopy
    Yasmin, Jannat
    Ahmed, Mohammed Raju
    Lohumi, Santosh
    Wakholi, Collins
    Kim, Moon S.
    Cho, Byoung-Kwan
    SENSORS, 2019, 19 (05)
  • [44] Classification of waste wood categories according to the best reuse using FT-NIR spectroscopy and chemometrics
    Mancini, Manuela
    Rinnan, Asmund
    ANALYTICA CHIMICA ACTA, 2023, 1275
  • [45] Monitoring the Composting Process of Olive Oil Industry Waste: Benchtop FT-NIR vs. Miniaturized NIR Spectrometer
    P. Rueda, Marta
    Dominguez-Vidal, Ana
    Aranda, Victor
    Ayora-Canada, Maria Jose
    AGRONOMY-BASEL, 2024, 14 (12):
  • [46] Tracing Geographical Origins of Teas Based on FT-NIR Spectroscopy: Introduction of Model Updating and Imbalanced Data Handling Approaches
    Hong, Xue-Zhen
    Fu, Xian-Shu
    Wang, Zheng-Liang
    Zhang, Li
    Yu, Xiao-Ping
    Ye, Zi-Hong
    JOURNAL OF ANALYTICAL METHODS IN CHEMISTRY, 2019, 2019
  • [47] Data fusion of FT-NIR and ATR-FTIR spectra for accurate authentication of geographical indications for Gastrodia elata Blume
    Zheng, Chuanmao
    Li, Jieqing
    Liu, Honggao
    Wang, Yuanzhong
    FOOD BIOSCIENCE, 2023, 56
  • [48] USING PARTIAL LEAST-SQUARES REGRESSION AND MULTIPLICATIVE SCATTER CORRECTION FOR FT-NIR DATA EVALUATION OF WHEAT FLOURS
    SORVANIEMI, J
    KINNUNEN, A
    TSADOS, A
    MALKKI, Y
    FOOD SCIENCE AND TECHNOLOGY-LEBENSMITTEL-WISSENSCHAFT & TECHNOLOGIE, 1993, 26 (03): : 251 - 258
  • [49] MC: a Unsupervised Data Preprocessing for Classification
    Hu, Enliang
    Chen, Songcan
    Yin, Xuesong
    2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL I, PROCEEDINGS, 2008, : 259 - 263
  • [50] An individualized preprocessing for medical data classification
    AlMuhaideb, Sarab
    Menai, Mohamed El Bachir
    4TH SYMPOSIUM ON DATA MINING APPLICATIONS (SDMA2016), 2016, 82 : 35 - 42