Predicting health effects of food compounds via ensemble machine learning

被引:0
|
作者
Mei, Suyu [1 ]
机构
[1] Shenyang Normal Univ, Software Coll, Shenyang 110034, Peoples R China
关键词
Ensemble learning; food ingredient bioactivities; MACCS fingerprints; machine learning; multi-label learning; natural products; transfer learning; NATURAL-PRODUCTS;
D O I
10.1111/ijfs.16992
中图分类号
TS2 [食品工业];
学科分类号
0832 ;
摘要
Identifying natural or synthetic compounds in foods and assaying their bioactivities have significantly contributed to promoting human health. In this work, we propose a machine learning framework to predict 101 classes of health effects of food compounds at a large scale. In this framework, random undersampling boosting (RUSBoost) is used as base learners to tackle the problem of skewed class distributions and MACCSKeys similarity spectra are proposed as a feature engineering strategy to represent chemical molecules including food compounds, natural products and drugs. Computational results show that RUSBoost learners encouragingly reduce model biases, and that the knowledge learnt from food compounds is well transferable to natural products (0.8406-0.9040 recall rates for antibacterial, antivirals, pesticide and anticancer effects) and drugs (0.789-0.9690 recall rates for antibacterial, antiviral, antineoplastic and analgesic effects). and. Dozens of novel effects have been validated in recent literature. These pieces of evidence show that the proposed framework could help us to find lead compounds from food as potential pharmaceuticals and repurpose drugs for anticancer, antiviral or antibacterial therapies. Finally, we use the proposed framework to predict beneficial and risky health effects of food flavour compounds as case studies for recipe composing. Predicting 101 classes of health effects of food compounds at a large scale via random undersampling boosting (RUSBoost). Convincingly demonstrating knowledge transferability between food compounds, natural products and drugs via independent test and literature validation. Revealing associations and drug repurposing between carcinogenesis, inflammation and viral infection. image
引用
收藏
页码:2547 / 2557
页数:11
相关论文
共 50 条
  • [1] Ensemble machine learning framework for predicting maternal health risk during pregnancy
    Khadidos, Alaa O.
    Saleem, Farrukh
    Selvarajan, Shitharth
    Ullah, Zahid
    Khadidos, Adil O.
    [J]. SCIENTIFIC REPORTS, 2024, 14 (01):
  • [2] Machine Learning Ensemble Modelling for Predicting Unemployment Duration
    Gabrikova, Barbora
    Svabova, Lucia
    Kramarova, Katarina
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (18):
  • [3] Ensemble Voting Schemes that Improve Machine Learning Models for Predicting the Effects of Protein Mutations
    Gunderson, Sarah
    Jagodzinski, Filip
    [J]. ACM-BCB'18: PROCEEDINGS OF THE 2018 ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, 2018, : 211 - 219
  • [4] Ensemble-Based Machine Learning for Predicting Sudden Human Fall Using Health Data
    Saxena, Utkarsh
    Moulik, Soumen
    Nayak, Soumya Ranjan
    Hanne, Thomas
    Roy, Diptendu Sinha
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
  • [5] Predicting earnings management through machine learning ensemble classifiers
    Hammami, Ahmad
    Zadeh, Mohammad Hendijani
    [J]. JOURNAL OF FORECASTING, 2022, 41 (08) : 1639 - 1660
  • [6] Ensemble Machine Learning Classification Models for Predicting Pavement Condition
    Chung, Frederick
    Doyle, Andy
    Robinson, Ernay
    Paik, Yejee
    Li, Mingshu
    Baek, Minsoo
    Moore, Brian
    Ashuri, Baabak
    [J]. TRANSPORTATION RESEARCH RECORD, 2024,
  • [7] Predicting Miscarriage and Stillbirth Using Weighted Ensemble Machine Learning
    Lokhande, Anagha
    Gimovsky, Alexis
    Sarkar, Indra
    [J]. OBSTETRICS AND GYNECOLOGY, 2023, 141 : 28S - 28S
  • [8] Predicting the Olea pollen concentration with a machine learning algorithm ensemble
    José María Cordero
    J. Rojo
    A. Montserrat Gutiérrez-Bustillo
    Adolfo Narros
    Rafael Borge
    [J]. International Journal of Biometeorology, 2021, 65 : 541 - 554
  • [9] Predicting the Olea pollen concentration with a machine learning algorithm ensemble
    Cordero, Jose Maria
    Rojo, J.
    Gutierrez-Bustillo, A. Montserrat
    Narros, Adolfo
    Borge, Rafael
    [J]. INTERNATIONAL JOURNAL OF BIOMETEOROLOGY, 2021, 65 (04) : 541 - 554
  • [10] Predicting Anti-inflammatory Peptides by Ensemble Machine Learning and Deep Learning
    Guan, Jiahui
    Yao, Lantian
    Chung, Chia-Ru
    Xie, Peilin
    Zhang, Yilun
    Deng, Junyang
    Chiang, Ying-Chih
    Lee, Tzong-Yi
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2023, 63 (24) : 7886 - 7898