UPLC-Q-TOF-MS/MS combined with machine learning methods for screening quality indicators of Hypericum perforatum L.

被引:4
|
作者
Zhang, Zhiyong [1 ]
Ying, Zehua [1 ]
He, Mulan [1 ]
Zhang, Yijing [1 ]
Nie, Wennan [1 ]
Tang, Zhenhao [1 ]
Liu, Wengang [3 ]
Chen, Jingchao [3 ]
Ye, Jianming [3 ]
Li, Wenlong [1 ,2 ]
机构
[1] Tianjin Univ Tradit Chinese Med, Coll Pharmaceut Engn Tradit Chinese Med, Tianjin 301617, Peoples R China
[2] Haihe Lab Modern Chinese Med, Tianjin 301617, Peoples R China
[3] Chengdu Kanghong Pharmaceut Co Ltd, Chengdu 610036, Peoples R China
关键词
Hypericum perforatum L; UPLC-Q-TOF-MS/MS; Machine learning; Quality indicators;
D O I
10.1016/j.jpba.2024.116313
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Hypericum perforatum L. (HPL), also known as St. John's wort, is one of the extensively researched domestically and internationally as a medicinal plant. In this study, non-targeted metabolomics combined with machine learning methods were used to identify reasonable quality indicators for the holistic quality control of HPL. First, the high-resolution MS data from different samples of HPL were collected, and visualized the chemical compounds through the MS molecular network. A total of 122 compounds were identified. Then, the orthogonal partial least squares-discriminant analysis (OPLS-DA) model was established for comparing the differences in metabolite expression between flower, leaf, and branches. A total of 46 differential metabolites were screened out. Subsequently, analyzing the pharmacological activities of these differential metabolites based on proteinprotein interaction (PPI) network. A total of 25 compounds associated with 473 gene targets were retrieved. Among them, 13 highly active compounds were selected as potential quality markers, and five compounds were ultimately selected as quality control markers for HPL. Finally, three different classifiers (support vector machine (SVM), random forest (RF), and K-nearest neighbor (KNN)) were used to validate whether the selected quality control markers are qualified. When the feature count is set to 122 and 46, the RF model demonstrates optimal performance. As the number of variables decreases, the performance of the RF model degrades. The KNN model and the SVM model also exhibit a decrease in performance but still manage to satisfy the intended requirements. The strategy can be applied to the quality control of HPL and can provide a reference for the quality control of other herbal medicines.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Chemical profiling and quantitation of bioactive compounds in Platycladi Cacumen by UPLC-Q-TOF-MS/MS and UPLC-DAD
    Zhuang, Bo
    Bi, Zhi-Ming
    Wang, Zi-Yuan
    Duan, Li
    Lai, Chang-Jiang-Sheng
    Liu, E-Hu
    JOURNAL OF PHARMACEUTICAL AND BIOMEDICAL ANALYSIS, 2018, 154 : 207 - 215
  • [32] UPLC-Q-TOF-MS/MS和UPLC-QQQ-MS/MS分析洋甘菊的化学成分
    邬旻珊
    肖观林
    谢灿辉
    胥爱丽
    分析测试学报, 2024, 43 (11) : 1803 - 1812
  • [33] UPLC-Q-TOF-MS/MS-guided dereplication of Pulsatilla chinensis to identify triterpenoid saponins
    Jin, Miao-miao
    Zhang, Wen-dan
    Jiang, Hong-hong
    Du, Ying-feng
    Guo, Wei
    Cao, Liang
    Xu, Hui-jun
    PHYTOCHEMICAL ANALYSIS, 2018, 29 (05) : 516 - 527
  • [34] Determination of Chemical Constituents in Flowers and Leaves of Hemp by GC-MS and UPLC-Q-TOF-MS
    Sun, Jing
    Zhou, Jing
    Luo, Jiajia
    Xu, Yunhui
    Hua, Moli
    Chinese Journal of Pharmaceuticals, 2022, 53 (10) : 1465 - 1481
  • [35] Comparison of metabolomics of Dendrobium officinale in different habitats by UPLC-Q-TOF-MS
    Yang, Juan
    Han, Xu
    Wang, Hao-Yu
    Yang, Jian
    Kuang, Yi
    Ji, Kai-Yue
    Yang, Yi
    Pang, Kai
    Yang, Sheng-Xiang
    Qin, Jian-Chun
    Wang, Hong-Peng
    BIOCHEMICAL SYSTEMATICS AND ECOLOGY, 2020, 89
  • [36] Analysis of Triacylglycerols in Sumac (Rhus typhina L.) Seed Oil from Different Origins by UPLC-Q-TOF-MS
    Zhang, Tao
    Jiang, Zhongrong
    Tao, Guanjun
    Liu, Ruijie
    Chang, Ming
    Jin, Qingzhe
    Wang, Xingguo
    FOOD ANALYTICAL METHODS, 2022, 15 (01) : 26 - 33
  • [37] Analysis of Triacylglycerols in Sumac (Rhus typhina L.) Seed Oil from Different Origins by UPLC-Q-TOF-MS
    Tao Zhang
    Zhongrong Jiang
    Guanjun Tao
    Ruijie Liu
    Ming Chang
    Qingzhe Jin
    Xingguo Wang
    Food Analytical Methods, 2022, 15 : 26 - 33
  • [38] 基于UPLC-Q-TOF-MS/MS分析藜芦中化学成分
    贾晓华
    王维
    高梦园
    欧阳慧子
    何俊
    天津中医药大学学报, 2021, 40 (01) : 104 - 111
  • [39] Rapid Identification of the Chemical Components of Ilex rotunda Thunb Using UPLC-Q-TOF-MS/MS
    Chen, Xinxin
    Li, Junmao
    Yuan, Renyikun
    Zhuo, Youqiong
    Chen, Yangling
    Zhang, Chenwei
    Chen, Ming
    Gao, Hong-Wei
    Liu, Zhenjie
    Feng, Yulin
    Wang, Zheng-Tao
    Yang, Shilin
    JOURNAL OF CHEMISTRY, 2021, 2021
  • [40] 基于UPLC-Q-TOF-MS/MS的丹参饮体内外成分鉴定
    张思彤
    胡先润
    刘文康
    雷金春
    程雪梅
    吴晓俊
    陈万生
    李曼琳
    王长虹
    中国实验方剂学杂志, 2025, 31 (09) : 175 - 183