Synchronously Predicting Tea Polyphenol and Epigallocatechin Gallate in Tea Leaves Using Fourier Transform-Near-Infrared Spectroscopy and Machine Learning

被引:10
|
作者
Ye, Sitan [1 ]
Weng, Haiyong [2 ,3 ]
Xiang, Lirong [4 ]
Jia, Liangquan [5 ]
Xu, Jinchai [2 ,3 ]
机构
[1] Newcastle Univ, Sch Engn, Newcastle Upon Tyne NE1 7RU, England
[2] Fujian Agr & Forestry Univ, Coll Mech & Elect Engn, Fujian Key Lab Agr Informat Sensoring Technol, Fuzhou 350100, Peoples R China
[3] Fujian Agr & Forestry Univ, Haixia Inst Sci & Technol, Sch Future Technol, Fuzhou 350002, Peoples R China
[4] North Carolina State Univ, Dept Biol & Agr Engn, Raleigh, NC 27606 USA
[5] Huzhou Univ, Sch Informat Engn, Huzhou 313000, Peoples R China
来源
MOLECULES | 2023年 / 28卷 / 14期
关键词
tea polyphenol; EGCG; Fourier Transform-near-infrared spectroscopy; machine learning; rapid prediction; SUPPORT VECTOR REGRESSION; GREEN TEA; SPECTRA; OPTIMIZATION; COMBINATION; PLSR; NIR;
D O I
10.3390/molecules28145379
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Tea polyphenol and epigallocatechin gallate (EGCG) were considered as key components of tea. The rapid prediction of these two components can be beneficial for tea quality control and product development for tea producers, breeders and consumers. This study aimed to develop reliable models for tea polyphenols and EGCG content prediction during the breeding process using Fourier Transform-near infrared (FT-NIR) spectroscopy combined with machine learning algorithms. Various spectral preprocessing methods including Savitzky-Golay smoothing (SG), standard normal variate (SNV), vector normalization (VN), multiplicative scatter correction (MSC) and first derivative (FD) were applied to improve the quality of the collected spectra. Partial least squares regression (PLSR) and least squares support vector regression (LS-SVR) were introduced to establish models for tea polyphenol and EGCG content prediction based on different preprocessed spectral data. Variable selection algorithms, including competitive adaptive reweighted sampling (CARS) and random forest (RF), were further utilized to identify key spectral bands to improve the efficiency of the models. The results demonstrate that the optimal model for tea polyphenols calibration was the LS-SVR with Rp = 0.975 and RPD = 4.540 based on SG-smoothed full spectra. For EGCG detection, the best model was the LS-SVR with Rp = 0.936 and RPD = 2.841 using full original spectra as model inputs. The application of variable selection algorithms further improved the predictive performance of the models. The LS-SVR model for tea polyphenols prediction with Rp = 0.978 and RPD = 4.833 used 30 CARS-selected variables, while the LS-SVR model build on 27 RF-selected variables achieved the best predictive ability with Rp = 0.944 and RPD = 3.049, respectively, for EGCG prediction. The results demonstrate a potential of FT-NIR spectroscopy combined with machine learning for the rapid screening of genotypes with high tea polyphenol and EGCG content in tea leaves.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Identification of Xihu Longjing Tea by PLS Model Using Near-Infrared Spectroscopy
    Zhou Jian
    Cheng Hao
    He Wei
    Wang Li-yuan
    Wu Di
    SPECTROSCOPY AND SPECTRAL ANALYSIS, 2009, 29 (05) : 1251 - 1254
  • [42] Detection of volatile organic compounds in adulterated tea using Fourier transform infrared spectroscopy and Proton-transfer-reaction mass spectrometry
    Yang, Chongshan
    Duan, Dandan
    Dong, Chunwang
    Li, Chuanxia
    Li, Guanglin
    Zhou, Yunhai
    Gu, Yifan
    Liu, Yachao
    Zhao, Chunjiang
    Dong, Daming
    FOOD CHEMISTRY, 2023, 423
  • [43] Fourier transform infrared spectroscopy and machine learning for Porphyromonas gingivalis detection in oral bacteria
    Kaneda, Tomomi
    Watanabe, Masahiro
    Honda, Hidehiko
    Yamamoto, Masato
    Inagaki, Takae
    Hironaka, Shouji
    ANALYTICAL SCIENCES, 2024, 40 (04) : 691 - 699
  • [44] Fourier transform infrared spectroscopy and machine learning for Porphyromonas gingivalis detection in oral bacteria
    Tomomi Kaneda
    Masahiro Watanabe
    Hidehiko Honda
    Masato Yamamoto
    Takae Inagaki
    Shouji Hironaka
    Analytical Sciences, 2024, 40 : 691 - 699
  • [45] A machine learning model for the classification of illicit drug substances with Fourier transform infrared spectroscopy
    Chang, Kah Haw
    Chua, Hui Na
    MICROCHEMICAL JOURNAL, 2025, 212
  • [46] TeaNet: Deep learning on Near-Infrared Spectroscopy (NIR) data for the assurance of tea quality
    Yang, Jingru
    Wang, Jin
    Lu, Guodong
    Fei, Shaomei
    Yan, Ting
    Zhang, Cheng
    Lu, Xiaohui
    Yu, Zhiyong
    Li, Wencui
    Tang, Xiaolin
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2021, 190
  • [47] Identification and quantitation of oxygenates in gasoline ampules using Fourier transform near-infrared and Fourier transform Raman spectroscopy
    Choquette, SJ
    Chesler, SN
    Duewer, DL
    Wang, SW
    OHaver, TC
    ANALYTICAL CHEMISTRY, 1996, 68 (20) : 3525 - 3533
  • [48] Rapid detection of serological biomarkers in gallbladder carcinoma using fourier transform infrared spectroscopy combined with machine learning
    Dou, Jingrui
    Dawuti, Wubulitalifu
    Li, Jintian
    Zhao, Hui
    Zhou, Run
    Zhou, Jing
    Lin, Renyong
    Lu, Guodong
    TALANTA, 2023, 259
  • [49] Application of near-infrared reflectance spectroscopy to the simultaneous prediction of alkaloids and phenolic substances in green tea leaves
    Schulz, H
    Engelhardt, UH
    Wegent, A
    Drews, HH
    Lapczynski, S
    JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY, 1999, 47 (12) : 5064 - 5067
  • [50] Optical Determination of Lead Chrome Green in Green Tea by Fourier Transform Infrared (FT-IR) Transmission Spectroscopy
    Li, Xiaoli
    Xu, Kaiwen
    Zhang, Yuying
    Sun, Chanjun
    He, Yong
    PLOS ONE, 2017, 12 (01):