Synchronously Predicting Tea Polyphenol and Epigallocatechin Gallate in Tea Leaves Using Fourier Transform-Near-Infrared Spectroscopy and Machine Learning

被引:10
|
作者
Ye, Sitan [1 ]
Weng, Haiyong [2 ,3 ]
Xiang, Lirong [4 ]
Jia, Liangquan [5 ]
Xu, Jinchai [2 ,3 ]
机构
[1] Newcastle Univ, Sch Engn, Newcastle Upon Tyne NE1 7RU, England
[2] Fujian Agr & Forestry Univ, Coll Mech & Elect Engn, Fujian Key Lab Agr Informat Sensoring Technol, Fuzhou 350100, Peoples R China
[3] Fujian Agr & Forestry Univ, Haixia Inst Sci & Technol, Sch Future Technol, Fuzhou 350002, Peoples R China
[4] North Carolina State Univ, Dept Biol & Agr Engn, Raleigh, NC 27606 USA
[5] Huzhou Univ, Sch Informat Engn, Huzhou 313000, Peoples R China
来源
MOLECULES | 2023年 / 28卷 / 14期
关键词
tea polyphenol; EGCG; Fourier Transform-near-infrared spectroscopy; machine learning; rapid prediction; SUPPORT VECTOR REGRESSION; GREEN TEA; SPECTRA; OPTIMIZATION; COMBINATION; PLSR; NIR;
D O I
10.3390/molecules28145379
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Tea polyphenol and epigallocatechin gallate (EGCG) were considered as key components of tea. The rapid prediction of these two components can be beneficial for tea quality control and product development for tea producers, breeders and consumers. This study aimed to develop reliable models for tea polyphenols and EGCG content prediction during the breeding process using Fourier Transform-near infrared (FT-NIR) spectroscopy combined with machine learning algorithms. Various spectral preprocessing methods including Savitzky-Golay smoothing (SG), standard normal variate (SNV), vector normalization (VN), multiplicative scatter correction (MSC) and first derivative (FD) were applied to improve the quality of the collected spectra. Partial least squares regression (PLSR) and least squares support vector regression (LS-SVR) were introduced to establish models for tea polyphenol and EGCG content prediction based on different preprocessed spectral data. Variable selection algorithms, including competitive adaptive reweighted sampling (CARS) and random forest (RF), were further utilized to identify key spectral bands to improve the efficiency of the models. The results demonstrate that the optimal model for tea polyphenols calibration was the LS-SVR with Rp = 0.975 and RPD = 4.540 based on SG-smoothed full spectra. For EGCG detection, the best model was the LS-SVR with Rp = 0.936 and RPD = 2.841 using full original spectra as model inputs. The application of variable selection algorithms further improved the predictive performance of the models. The LS-SVR model for tea polyphenols prediction with Rp = 0.978 and RPD = 4.833 used 30 CARS-selected variables, while the LS-SVR model build on 27 RF-selected variables achieved the best predictive ability with Rp = 0.944 and RPD = 3.049, respectively, for EGCG prediction. The results demonstrate a potential of FT-NIR spectroscopy combined with machine learning for the rapid screening of genotypes with high tea polyphenol and EGCG content in tea leaves.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Fourier transform near infrared spectroscopy as a tool for predicting antioxidant activity of propolis
    Calegari, Matheus Augusto
    Ayres, Bruno Bresolin
    dos Santos Tonial, Larissa Macedo
    de Alencar, Severino Matias
    Cadorin Oldoni, Tatiane Luiza
    JOURNAL OF KING SAUD UNIVERSITY SCIENCE, 2020, 32 (01) : 784 - 790
  • [22] DETERMINATION OF CATECHINS AND CAFFEINE CONTENT IN TEA (CAMELLIA SINENSIS L.) LEAVES AT DIFFERENT POSITIONS BY FOURIER-TRANSFORM INFRARED SPECTROSCOPY
    Zhou, R. Q.
    Li, X. L.
    He, Y.
    Jin, J. J.
    TRANSACTIONS OF THE ASABE, 2018, 61 (04) : 1221 - 1230
  • [23] A tea classification method based on near infrared spectroscopy (NIRS) and transfer learning
    Liu, Long
    Wang, Bin
    Xu, Xiaoxuan
    Xu, Jing
    INFRARED PHYSICS & TECHNOLOGY, 2025, 145
  • [24] Development and Validation of Near-Infrared Methods for the Quantitation of Caffeine, Epigallocatechin-3-gallate, and Moisture in Green Tea Production
    Zhang, Shengsheng
    Zuo, Yamin
    Wu, Qing
    Wang, Jiao
    Ban, Lin
    Yang, Huili
    Bai, Zhiwen
    JOURNAL OF ANALYTICAL METHODS IN CHEMISTRY, 2021, 2021
  • [25] Study on Detection of Talcum Powder in Green Tea Based on Fourier Transform Infrared (FTIR) Transmission Spectroscopy
    Li Xiao-li
    Zhang Yu-ying
    He Yong
    SPECTROSCOPY AND SPECTRAL ANALYSIS, 2017, 37 (04) : 1081 - 1085
  • [26] Optimization of Informative Spectral Variables for the Quantification of EGCG in Green Tea Using Fourier Transform Near-Infrared (FT-NIR) Spectroscopy and Multivariate Calibration
    Guo, Zhiming
    Chen, Quansheng
    Chen, Liping
    Huang, Wenqian
    Zhang, Chi
    Zhao, Chunjiang
    APPLIED SPECTROSCOPY, 2011, 65 (09) : 1062 - 1067
  • [27] A Study on Origin Traceability of White Tea (White Peony) Based on Near-Infrared Spectroscopy and Machine Learning Algorithms
    Zhang, Lingzhi
    Dai, Haomin
    Zhang, Jialin
    Zheng, Zhiqiang
    Song, Bo
    Chen, Jiaya
    Lin, Gang
    Chen, Linhai
    Sun, Weijiang
    Huang, Yan
    FOODS, 2023, 12 (03)
  • [28] Determination of Chlorogenic Acid, Rutin, Scopoletin and Total Polyphenol in Tobacco by Fourier Transform Near Infrared Spectroscopy
    Leng Hong-qiong
    Guo Ya-dong
    Liu Wei
    Zhang Tao
    Deng Liang
    Shen Zhi-qiang
    SPECTROSCOPY AND SPECTRAL ANALYSIS, 2013, 33 (07) : 1801 - 1804
  • [29] Quantitative analysis of palm carotene using Fourier transform infrared and near infrared spectroscopy
    Moh, M.H.
    Man, Y.B.Che
    Badlishah, B.S.
    Jinap, S.
    Saad, M.S.
    Abdullah, W.J.W.
    JAOCS, Journal of the American Oil Chemists' Society, 1999, 76 (02): : 249 - 254
  • [30] Quantitative analysis of palm carotene using Fourier transform infrared and near infrared spectroscopy
    Moh, MH
    Man, YBC
    Badlishah, BS
    Jinap, S
    Saad, MS
    Abdullah, WJW
    JOURNAL OF THE AMERICAN OIL CHEMISTS SOCIETY, 1999, 76 (02) : 249 - 254