Perspective: Big Data and Machine Learning Could Help Advance Nutritional Epidemiology

被引:43
|
作者
Morgenstern, Jason D. [1 ]
Rosella, Laura C. [2 ,3 ]
Costa, Andrew P. [1 ]
de Souza, Russell J. [1 ,4 ]
Anderson, Laura N. [1 ]
机构
[1] McMaster Univ, Dept Hlth Res Methods Evidencg & Impact, Hamilton, ON, Canada
[2] Univ Toronto, Dalla Lana Sch Publ Hlth, Toronto, ON, Canada
[3] Vector Inst, Toronto, ON, Canada
[4] Hamilton Hlth Sci, Populat Hlth Res Inst, Hamilton, ON, Canada
基金
加拿大健康研究院;
关键词
machine learning; big data; nutritional epidemiology; artificial intelligence; nutritional sciences; diet; nutrition; precision nutrition; PROPENSITY SCORE ESTIMATION; MEASUREMENT ERROR; CARDIOVASCULAR-DISEASE; LOGISTIC-REGRESSION; CONFOUNDER CONTROL; NEGATIVE CONTROLS; 24-HOUR RECALL; SELECTION; DIET; RISK;
D O I
10.1093/advances/nmaa183
中图分类号
R15 [营养卫生、食品卫生]; TS201 [基础科学];
学科分类号
100403 ;
摘要
The field of nutritional epidemiology faces challenges posed by measurement error, diet as a complex exposure, and residual confounding. The objective of this perspective article is to highlight how developments in big data and machine learning can help address these challenges. New methods of collecting 24-h dietary recalls and recording diet could enable larger samples and more repeated measures to increase statistical power and measurement precision. In addition, use of machine learning to automatically classify pictures of food could become a useful complimentary method to help improve precision and validity of dietary measurements. Diet is complex due to thousands of different foods that are consumed in varying proportions, fluctuating quantities over time, and differing combinations. Current dietary pattern methods may not integrate sufficient dietary variation, and most traditional modeling approaches have limited incorporation of interactions and nonlinearity. Machine learning could help better model diet as a complex exposure with nonadditive and nonlinear associations. Last, novel big data sources could help avoid unmeasured confounding by offering more covariates, including both omics and features derived from unstructured data with machine learning methods. These opportunities notwithstanding, application of big data and machine learning must be approached cautiously to ensure quality of dietary measurements, avoid overfitting, and confirm accurate interpretations. Greater use of machine learning and big data would also require substantial investments in training, collaborations, and computing infrastructure. Overall, we propose that judicious application of big data and machine learning in nutrition science could offer new means of dietary measurement, more tools to model the complexity of diet and its relations with diseases, and additional potential ways of addressing confounding.
引用
收藏
页码:621 / 631
页数:11
相关论文
共 50 条
  • [21] Machine Translation from Big Data Perspective
    Myrzakhmetov, Bagdat
    Yessenbayev, Zhandos
    Makazhanov, Aibek
    2017 11TH IEEE INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT 2017), 2017, : 35 - 39
  • [22] Machine learning on big data for future computing
    Jeong, Young-Sik
    Hassan, Houcine
    Sangaiah, Arun Kumar
    JOURNAL OF SUPERCOMPUTING, 2019, 75 (06): : 2925 - 2929
  • [23] Machine learning on big data for future computing
    Young-Sik Jeong
    Houcine Hassan
    Arun Kumar Sangaiah
    The Journal of Supercomputing, 2019, 75 : 2925 - 2929
  • [24] Machine Learning With Big Data: Challenges and Approaches
    L'Heureux, Alexandra
    Grolinger, Katarina
    Elyamany, Hany F.
    Capretz, Miriam A. M.
    IEEE ACCESS, 2017, 5 : 7776 - 7797
  • [25] Machine Learning Meets Big Spatial Data
    Sabek, Ibrahim
    Mokbel, Mohamed F.
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2019, 12 (12): : 1982 - 1985
  • [26] Machine Learning Challenges in Big Data Era
    Veganzones-Bodon, Miguel
    DYNA, 2019, 94 (05): : 478 - 479
  • [27] Machine Learning for Astronomical Big Data Processing
    Xu, Long
    Yan, Yihua
    2017 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2017,
  • [28] Machine learning in 'big data': handle with care
    Loring, Zak
    Mehrotra, Suchit
    Piccini, Jonathan P.
    EUROPACE, 2019, 21 (09): : 1284 - 1285
  • [29] Machine Learning and Computational Intelligence in Big Data
    Anagnostopoulos, Christos
    Kolomvatsos, Kostas
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2015, 6 (06) : 873 - 874
  • [30] Machine learning on big data: Opportunities and challenges
    Zhou, Lina
    Pan, Shimei
    Wang, Jianwu
    Vasilakos, Athanasios V.
    NEUROCOMPUTING, 2017, 237 : 350 - 361