Synchronously Predicting Tea Polyphenol and Epigallocatechin Gallate in Tea Leaves Using Fourier Transform-Near-Infrared Spectroscopy and Machine Learning

被引:10
|
作者
Ye, Sitan [1 ]
Weng, Haiyong [2 ,3 ]
Xiang, Lirong [4 ]
Jia, Liangquan [5 ]
Xu, Jinchai [2 ,3 ]
机构
[1] Newcastle Univ, Sch Engn, Newcastle Upon Tyne NE1 7RU, England
[2] Fujian Agr & Forestry Univ, Coll Mech & Elect Engn, Fujian Key Lab Agr Informat Sensoring Technol, Fuzhou 350100, Peoples R China
[3] Fujian Agr & Forestry Univ, Haixia Inst Sci & Technol, Sch Future Technol, Fuzhou 350002, Peoples R China
[4] North Carolina State Univ, Dept Biol & Agr Engn, Raleigh, NC 27606 USA
[5] Huzhou Univ, Sch Informat Engn, Huzhou 313000, Peoples R China
来源
MOLECULES | 2023年 / 28卷 / 14期
关键词
tea polyphenol; EGCG; Fourier Transform-near-infrared spectroscopy; machine learning; rapid prediction; SUPPORT VECTOR REGRESSION; GREEN TEA; SPECTRA; OPTIMIZATION; COMBINATION; PLSR; NIR;
D O I
10.3390/molecules28145379
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Tea polyphenol and epigallocatechin gallate (EGCG) were considered as key components of tea. The rapid prediction of these two components can be beneficial for tea quality control and product development for tea producers, breeders and consumers. This study aimed to develop reliable models for tea polyphenols and EGCG content prediction during the breeding process using Fourier Transform-near infrared (FT-NIR) spectroscopy combined with machine learning algorithms. Various spectral preprocessing methods including Savitzky-Golay smoothing (SG), standard normal variate (SNV), vector normalization (VN), multiplicative scatter correction (MSC) and first derivative (FD) were applied to improve the quality of the collected spectra. Partial least squares regression (PLSR) and least squares support vector regression (LS-SVR) were introduced to establish models for tea polyphenol and EGCG content prediction based on different preprocessed spectral data. Variable selection algorithms, including competitive adaptive reweighted sampling (CARS) and random forest (RF), were further utilized to identify key spectral bands to improve the efficiency of the models. The results demonstrate that the optimal model for tea polyphenols calibration was the LS-SVR with Rp = 0.975 and RPD = 4.540 based on SG-smoothed full spectra. For EGCG detection, the best model was the LS-SVR with Rp = 0.936 and RPD = 2.841 using full original spectra as model inputs. The application of variable selection algorithms further improved the predictive performance of the models. The LS-SVR model for tea polyphenols prediction with Rp = 0.978 and RPD = 4.833 used 30 CARS-selected variables, while the LS-SVR model build on 27 RF-selected variables achieved the best predictive ability with Rp = 0.944 and RPD = 3.049, respectively, for EGCG prediction. The results demonstrate a potential of FT-NIR spectroscopy combined with machine learning for the rapid screening of genotypes with high tea polyphenol and EGCG content in tea leaves.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Non-destructive determination of four tea polyphenols in fresh tea using visible and near-infrared spectroscopy
    Luo, Wei
    Tian, Peng
    Fan, Guozhu
    Dong, Wentao
    Zhang, Hailiang
    Liu, Xuemei
    INFRARED PHYSICS & TECHNOLOGY, 2022, 123
  • [32] Selective Extraction of (-)Epigallocatechin Gallate from Green Tea Leaves Using Two-Stage Infusion Coupled with Membrane Separation
    Kumar, Abhishek
    Thakur, Barun Kumar
    De, Sirshendu
    FOOD AND BIOPROCESS TECHNOLOGY, 2012, 5 (06) : 2568 - 2577
  • [33] Selective Extraction of (−)Epigallocatechin Gallate from Green Tea Leaves Using Two-Stage Infusion Coupled with Membrane Separation
    Abhishek Kumar
    Barun Kumar Thakur
    Sirshendu De
    Food and Bioprocess Technology, 2012, 5 : 2568 - 2577
  • [34] Measurement of four main catechins content in green tea based on visible and near-infrared spectroscopy using optimized machine learning algorithm
    Luo, Wei
    Li, Wenyoujia
    Liu, Shuling
    Li, Qicheng
    Huang, Haihua
    Zhang, Hailiang
    JOURNAL OF FOOD COMPOSITION AND ANALYSIS, 2025, 138
  • [35] Comparison of machine learning models for classifying edible oils using Fourier-transform infrared spectroscopy
    Lim, Hyeona
    Lee, Seon Yeong
    Kim, Jin Young
    Shin, Yeon Ju
    Jang, Yerin
    Kim, Hyeonjin
    Kim, Byung Hee
    Ahn, Sangdoo
    BULLETIN OF THE KOREAN CHEMICAL SOCIETY, 2025, 46 (02) : 131 - 137
  • [36] Chemometric Models for the Quantitative Descriptive Sensory Properties of Green Tea (Camellia sinensis L.) Using Fourier Transform Near Infrared (FT-NIR) Spectroscopy
    Jiang, Hui
    Chen, Quansheng
    FOOD ANALYTICAL METHODS, 2015, 8 (04) : 954 - 962
  • [37] Chemometric Models for the Quantitative Descriptive Sensory Properties of Green Tea (Camellia sinensis L.) Using Fourier Transform Near Infrared (FT-NIR) Spectroscopy
    Hui Jiang
    Quansheng Chen
    Food Analytical Methods, 2015, 8 : 954 - 962
  • [38] The future of fish age estimation: deep machine learning coupled with Fourier transform near-infrared spectroscopy of otoliths
    Benson, Irina M.
    Helser, Thomas E.
    Marchetti, Giovanni
    Barnett, Beverly K.
    CANADIAN JOURNAL OF FISHERIES AND AQUATIC SCIENCES, 2023, 80 (09) : 1482 - 1494
  • [39] Near-infrared spectroscopy and machine learning-based technique to predict quality-related parameters in instant tea
    Bai, Xiaoli
    Zhang, Lei
    Kang, Chaoyan
    Quan, Bingyan
    Zheng, Yu
    Zhang, Xianglong
    Song, Jia
    Xia, Ting
    Wang, Min
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [40] Near-infrared spectroscopy and machine learning-based technique to predict quality-related parameters in instant tea
    Xiaoli Bai
    Lei Zhang
    Chaoyan Kang
    Bingyan Quan
    Yu Zheng
    Xianglong Zhang
    Jia Song
    Ting Xia
    Min Wang
    Scientific Reports, 12