Synchronously Predicting Tea Polyphenol and Epigallocatechin Gallate in Tea Leaves Using Fourier Transform-Near-Infrared Spectroscopy and Machine Learning

被引:10
|
作者
Ye, Sitan [1 ]
Weng, Haiyong [2 ,3 ]
Xiang, Lirong [4 ]
Jia, Liangquan [5 ]
Xu, Jinchai [2 ,3 ]
机构
[1] Newcastle Univ, Sch Engn, Newcastle Upon Tyne NE1 7RU, England
[2] Fujian Agr & Forestry Univ, Coll Mech & Elect Engn, Fujian Key Lab Agr Informat Sensoring Technol, Fuzhou 350100, Peoples R China
[3] Fujian Agr & Forestry Univ, Haixia Inst Sci & Technol, Sch Future Technol, Fuzhou 350002, Peoples R China
[4] North Carolina State Univ, Dept Biol & Agr Engn, Raleigh, NC 27606 USA
[5] Huzhou Univ, Sch Informat Engn, Huzhou 313000, Peoples R China
来源
MOLECULES | 2023年 / 28卷 / 14期
关键词
tea polyphenol; EGCG; Fourier Transform-near-infrared spectroscopy; machine learning; rapid prediction; SUPPORT VECTOR REGRESSION; GREEN TEA; SPECTRA; OPTIMIZATION; COMBINATION; PLSR; NIR;
D O I
10.3390/molecules28145379
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Tea polyphenol and epigallocatechin gallate (EGCG) were considered as key components of tea. The rapid prediction of these two components can be beneficial for tea quality control and product development for tea producers, breeders and consumers. This study aimed to develop reliable models for tea polyphenols and EGCG content prediction during the breeding process using Fourier Transform-near infrared (FT-NIR) spectroscopy combined with machine learning algorithms. Various spectral preprocessing methods including Savitzky-Golay smoothing (SG), standard normal variate (SNV), vector normalization (VN), multiplicative scatter correction (MSC) and first derivative (FD) were applied to improve the quality of the collected spectra. Partial least squares regression (PLSR) and least squares support vector regression (LS-SVR) were introduced to establish models for tea polyphenol and EGCG content prediction based on different preprocessed spectral data. Variable selection algorithms, including competitive adaptive reweighted sampling (CARS) and random forest (RF), were further utilized to identify key spectral bands to improve the efficiency of the models. The results demonstrate that the optimal model for tea polyphenols calibration was the LS-SVR with Rp = 0.975 and RPD = 4.540 based on SG-smoothed full spectra. For EGCG detection, the best model was the LS-SVR with Rp = 0.936 and RPD = 2.841 using full original spectra as model inputs. The application of variable selection algorithms further improved the predictive performance of the models. The LS-SVR model for tea polyphenols prediction with Rp = 0.978 and RPD = 4.833 used 30 CARS-selected variables, while the LS-SVR model build on 27 RF-selected variables achieved the best predictive ability with Rp = 0.944 and RPD = 3.049, respectively, for EGCG prediction. The results demonstrate a potential of FT-NIR spectroscopy combined with machine learning for the rapid screening of genotypes with high tea polyphenol and EGCG content in tea leaves.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Extraction of Epigallocatechin Gallate and Epicatechin Gallate from Tea Leaves Using -Cyclodextrin
    Cui, Lu
    Liu, Yuxuan
    Liu, Ting
    Yuan, Yahong
    Yue, Tianli
    Cai, Rui
    Wang, Zhouli
    JOURNAL OF FOOD SCIENCE, 2017, 82 (02) : 394 - 400
  • [2] DETERMINATION OF β-CAROTENE AND LUTEIN IN GREEN TEA USING FOURIER TRANSFORM INFRARED SPECTROSCOPY
    He, Y.
    Zhao, Y. Y.
    Zhang, C.
    Sun, C. J.
    Li, X. L.
    TRANSACTIONS OF THE ASABE, 2019, 62 (01) : 75 - 81
  • [3] Identification of Pu'er tea by Fourier transform infrared spectroscopy
    Zhou Xiang-Ping
    Liu Gang
    Shi You-Ming
    Dong Qin
    SPECTROSCOPY AND SPECTRAL ANALYSIS, 2008, 28 (03) : 594 - 596
  • [4] Identification of Pu'er tea by Fourier transform infrared spectroscopy
    Zhou, Xiang-Ping
    Liu, Gang
    Shi, You-Ming
    Dong, Qin
    1600, Science Press, 18,Shuangqing Street,Haidian, Beijing, 100085, China (28):
  • [5] Studies on ANN models of determination of tea polyphenol and amylose in tea by near-infrared spectroscopy
    Luo, YF
    Guo, ZF
    Zhu, ZY
    Wang, CP
    Jiang, HY
    Han, BY
    SPECTROSCOPY AND SPECTRAL ANALYSIS, 2005, 25 (08) : 1230 - 1233
  • [6] Predicting the Age and Type of Tuocha Tea by Fourier Transform Infrared Spectroscopy and Chemometric Data Analysis
    Xu, Lu
    Deng, De-Hua
    Cai, Chen-Bo
    JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY, 2011, 59 (19) : 10461 - 10469
  • [7] Prediction of Polyphenol Content in Tea Leaves Using NIR Spectroscopy
    Chanda, Somdeb
    De, Ashmita
    Tudu, Bipan
    Bandyopadhyay, Rajib
    Hazarika, Ajanto Kumar
    Sabhapondit, Santanu
    Baruah, B. D.
    Tamuly, Pradip
    Bhattachryya, Nabarun
    2016 INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL POWER AND INSTRUMENTATION (ICICPI), 2016, : 51 - 55
  • [8] Prediction of Japanese green tea ranking by Fourier transform near-infrared reflectance spectroscopy
    Ikeda, Tatsuhiko
    Kanaya, Shigehiko
    Yonetani, Tsutomu
    Kobayashi, Akio
    Fukusaki, Eiichiro
    JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY, 2007, 55 (24) : 9908 - 9912
  • [9] Rapid Determination of Chlorophyll and Pheophytin in Green Tea Using Fourier Transform Infrared Spectroscopy
    Li, Xiaoli
    Zhou, Ruiqing
    Xu, Kaiwen
    Xu, Jie
    Jin, Juanjuan
    Fang, Hui
    He, Yong
    MOLECULES, 2018, 23 (05):
  • [10] Application of Fourier transform near-infrared spectroscopy to optimization of green tea steaming process conditions
    Ono, Daiki
    Bamba, Takeshi
    Oku, Yuichi
    Yonetani, Tsutomu
    Fukusaki, Eiichiro
    JOURNAL OF BIOSCIENCE AND BIOENGINEERING, 2011, 112 (03) : 247 - 251