Are Machine Learning Algorithms More Accurate in Predicting Vegetable and Fruit Consumption Than Traditional Statistical Models? An Exploratory Analysis

被引:7
|
作者
Cote, Melina [1 ,2 ]
Osseni, Mazid Abiodoun [3 ,4 ]
Brassard, Didier [1 ,2 ]
Carbonneau, Elise [1 ,2 ]
Robitaille, Julie [1 ,2 ]
Vohl, Marie-Claude [1 ,2 ]
Lemieux, Simone [1 ,2 ]
Laviolette, Francois [1 ,3 ,4 ]
Lamarche, Benoit [1 ,2 ]
机构
[1] Univ Laval, Ctr Nutr Sante & Societe NUTRISS, Inst Nutr Aliments Fonct Univ Laval INAF, Quebec City, PQ, Canada
[2] Univ Laval, Ecole Nutr, Quebec City, PQ, Canada
[3] Univ Laval, Ctr Rech Endonnees Mass CRDM, Quebec City, PQ, Canada
[4] Univ Laval, Grp Rech Apprentissage Automat Univ Laval GRAA, Quebec City, PQ, Canada
来源
FRONTIERS IN NUTRITION | 2022年 / 9卷
关键词
artificial intelligence; machine learning; statistical models; nutrition; prediction; dietary behaviour; CONVENTIONAL REGRESSION; ADHERENCE; VALIDITY; CANADA;
D O I
10.3389/fnut.2022.740898
中图分类号
R15 [营养卫生、食品卫生]; TS201 [基础科学];
学科分类号
100403 ;
摘要
Machine learning (ML) algorithms may help better understand the complex interactions among factors that influence dietary choices and behaviors. The aim of this study was to explore whether ML algorithms are more accurate than traditional statistical models in predicting vegetable and fruit (VF) consumption. A large array of features (2,452 features from 525 variables) encompassing individual and environmental information related to dietary habits and food choices in a sample of 1,147 French-speaking adult men and women was used for the purpose of this study. Adequate VF consumption, which was defined as 5 servings/d or more, was measured by averaging data from three web-based 24 h recalls and used as the outcome to predict. Nine classification ML algorithms were compared to two traditional statistical predictive models, logistic regression and penalized regression (Lasso). The performance of the predictive ML algorithms was tested after the implementation of adjustments, including normalizing the data, as well as in a series of sensitivity analyses such as using VF consumption obtained from a web-based food frequency questionnaire (wFFQ) and applying a feature selection algorithm in an attempt to reduce overfitting. Logistic regression and Lasso predicted adequate VF consumption with an accuracy of 0.64 (95% confidence interval [CI]: 0.58-0.70) and 0.64 (95%CI: 0.60-0.68) respectively. Among the ML algorithms tested, the most accurate algorithms to predict adequate VF consumption were the support vector machine (SVM) with either a radial basis kernel or a sigmoid kernel, both with an accuracy of 0.65 (95%CI: 0.59-0.71). The least accurate ML algorithm was the SVM with a linear kernel with an accuracy of 0.55 (95%CI: 0.49-0.61). Using dietary intake data from the wFFQ and applying a feature selection algorithm had little to no impact on the performance of the algorithms. In summary, ML algorithms and traditional statistical models predicted adequate VF consumption with similar accuracies among adults. These results suggest that additional research is needed to explore further the true potential of ML in predicting dietary behaviours that are determined by complex interactions among several individual, social and environmental factors.
引用
收藏
页数:11
相关论文
共 23 条
  • [1] A NEW MACHINE LEARNING-BASED MODEL IS MORE ACCURATE THAN TRADITIONAL MODELS IN PREDICTING SURVIVAL OF PATIENTS WITH ESOPHAGEAL ADENOCARCINOMA
    Mohapatra, Sonmoon
    Das, Amit
    Ngamruengphong, Saowanee
    [J]. GASTROENTEROLOGY, 2021, 160 (06) : S378 - S378
  • [2] Machine Learning Models for Predicting Water Quality of Treated Fruit and Vegetable Wastewater
    Mundi, Gurvinder
    Zytner, Richard G.
    Warriner, Keith
    Bonakdari, Hossein
    Gharabaghi, Bahram
    [J]. WATER, 2021, 13 (18)
  • [3] Comparative analysis of machine learning algorithms and statistical models for predicting crown width of Larix olgensis
    Siyu Qiu
    Ruiting Liang
    Yifu Wang
    Mi Luo
    Yujun Sun
    [J]. Earth Science Informatics, 2022, 15 : 2415 - 2429
  • [4] Comparative analysis of machine learning algorithms and statistical models for predicting crown width of Larix olgensis
    Qiu, Siyu
    Liang, Ruiting
    Wang, Yifu
    Luo, Mi
    Sun, Yujun
    [J]. EARTH SCIENCE INFORMATICS, 2022, 15 (04) : 2415 - 2429
  • [5] Machine Learning Based Predictive Models Are More Accurate Than TNM Staging in Predicting Survival in Patients With Pancreatic Cancer
    Das, Amit
    Ngamruengphong, Saowanee
    [J]. AMERICAN JOURNAL OF GASTROENTEROLOGY, 2019, 114 : S48 - S48
  • [6] Is Predicting Software Security Bugs using Deep Learning Better than the Traditional Machine Learning Algorithms?
    Clemente, Caesar Jude
    Jaafar, Fehmi
    Malik, Yasir
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY (QRS 2018), 2018, : 95 - 102
  • [7] MACHINE LEARNING-BASED PREDICTIVE MODELS ARE MORE ACCURATE THAN TNM STAGE IN PREDICTING SURVIVAL IN PATIENTS WITH GASTRIC CANCER
    Das, Amit
    Mohapatra, Sonmoon
    Ngamruengphong, Saowanee
    [J]. GASTROENTEROLOGY, 2020, 158 (06) : S782 - S783
  • [8] Machine Learning Models are More Accurate Than Regression-based Models for Predicting Functional Impairment Risk in Acute Ischemic Stroke.
    Alaka, Shakiru A.
    Brobbey, Anita
    Menon, Bijoy K.
    Williamson, Tyler
    Goyal, Mayank
    Demchuk, Andrew M.
    Hill, Michael D.
    Sajobi, Tolulope
    [J]. STROKE, 2019, 50
  • [9] A NEW MACHINE LEARNING-BASED PREDICTIVE MODEL IS MORE ACCURATE THAN TRADITIONAL ANALYTIC MODELS IN PREDICTING 1-YEAR SURVIVAL OF PATIENTS WITH ESOPHAGEAL SQUAMOUS CELL CARCINOMA
    Das, Amit
    Mohapatra, Sonmoon
    Ngamruengphong, Saowanee
    [J]. GASTROENTEROLOGY, 2021, 160 (06) : S601 - S601
  • [10] Predicting incident dementia in cerebral small vessel disease: comparison of machine learning and traditional statistical models
    Li, Rui
    Harshfield, Eric L.
    Bell, Steven
    Burkhart, Michael
    Tuladhar, Anil M.
    Hilal, Saima
    Tozer, Daniel J.
    Chappell, Francesca M.
    Makin, Stephen D. J.
    Lo, Jessica W.
    Wardlaw, Joanna M.
    de Leeuw, Frank-Erik
    Chen, Christopher
    Kourtzi, Zoe
    Markus, Hugh S.
    [J]. CEREBRAL CIRCULATION - COGNITION AND BEHAVIOR, 2023, 5