Perspective: Big Data and Machine Learning Could Help Advance Nutritional Epidemiology

被引:43
|
作者
Morgenstern, Jason D. [1 ]
Rosella, Laura C. [2 ,3 ]
Costa, Andrew P. [1 ]
de Souza, Russell J. [1 ,4 ]
Anderson, Laura N. [1 ]
机构
[1] McMaster Univ, Dept Hlth Res Methods Evidencg & Impact, Hamilton, ON, Canada
[2] Univ Toronto, Dalla Lana Sch Publ Hlth, Toronto, ON, Canada
[3] Vector Inst, Toronto, ON, Canada
[4] Hamilton Hlth Sci, Populat Hlth Res Inst, Hamilton, ON, Canada
基金
加拿大健康研究院;
关键词
machine learning; big data; nutritional epidemiology; artificial intelligence; nutritional sciences; diet; nutrition; precision nutrition; PROPENSITY SCORE ESTIMATION; MEASUREMENT ERROR; CARDIOVASCULAR-DISEASE; LOGISTIC-REGRESSION; CONFOUNDER CONTROL; NEGATIVE CONTROLS; 24-HOUR RECALL; SELECTION; DIET; RISK;
D O I
10.1093/advances/nmaa183
中图分类号
R15 [营养卫生、食品卫生]; TS201 [基础科学];
学科分类号
100403 ;
摘要
The field of nutritional epidemiology faces challenges posed by measurement error, diet as a complex exposure, and residual confounding. The objective of this perspective article is to highlight how developments in big data and machine learning can help address these challenges. New methods of collecting 24-h dietary recalls and recording diet could enable larger samples and more repeated measures to increase statistical power and measurement precision. In addition, use of machine learning to automatically classify pictures of food could become a useful complimentary method to help improve precision and validity of dietary measurements. Diet is complex due to thousands of different foods that are consumed in varying proportions, fluctuating quantities over time, and differing combinations. Current dietary pattern methods may not integrate sufficient dietary variation, and most traditional modeling approaches have limited incorporation of interactions and nonlinearity. Machine learning could help better model diet as a complex exposure with nonadditive and nonlinear associations. Last, novel big data sources could help avoid unmeasured confounding by offering more covariates, including both omics and features derived from unstructured data with machine learning methods. These opportunities notwithstanding, application of big data and machine learning must be approached cautiously to ensure quality of dietary measurements, avoid overfitting, and confirm accurate interpretations. Greater use of machine learning and big data would also require substantial investments in training, collaborations, and computing infrastructure. Overall, we propose that judicious application of big data and machine learning in nutrition science could offer new means of dietary measurement, more tools to model the complexity of diet and its relations with diseases, and additional potential ways of addressing confounding.
引用
收藏
页码:621 / 631
页数:11
相关论文
共 50 条
  • [31] Big Data and Machine Learning Framework in Healthcare
    Dogaru, Delia Ioana
    Dumitrache, Ioan
    2019 E-HEALTH AND BIOENGINEERING CONFERENCE (EHB), 2019,
  • [32] Green Computing for Big Data and Machine Learning
    Barua, Hrishav Bakul
    Mondal, Kartick Chandra
    Khatua, Sunirmal
    PROCEEDINGS OF THE 5TH JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE & MANAGEMENT OF DATA, CODS COMAD 2022, 2022, : 348 - 351
  • [33] PivotalR: A Package for Machine Learning on Big Data
    Qian, Hai
    R JOURNAL, 2014, 6 (01): : 57 - 67
  • [34] Machine learning for Big Data analytics in plants
    Ma, Chuang
    Zhang, Hao Helen
    Wang, Xiangfeng
    TRENDS IN PLANT SCIENCE, 2014, 19 (12) : 798 - 808
  • [35] Big data and machine learning for materials science
    Rodrigues J.F., Jr.
    Florea L.
    de Oliveira M.C.F.
    Diamond D.
    Oliveira O.N., Jr.
    Discover Materials, 1 (1):
  • [36] Big data algorithms beyond machine learning
    Mnich M.
    KI - Kunstliche Intelligenz, 2018, 32 (01): : 9 - 17
  • [37] A survey of machine learning for big data processing
    Junfei Qiu
    Qihui Wu
    Guoru Ding
    Yuhua Xu
    Shuo Feng
    EURASIP Journal on Advances in Signal Processing, 2016
  • [38] Editorial: Big data and machine learning in sociology
    Leitgoeb, Heinz
    Prandner, Dimitri
    Wolbring, Tobias
    FRONTIERS IN SOCIOLOGY, 2023, 8
  • [39] Big Data and Machine Learning in Health Care
    Beam, Andrew L.
    Kohane, Isaac S.
    JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2018, 319 (13): : 1317 - 1318
  • [40] A Survey of Machine Learning Methods for Big Data
    Ruiz, Zoila
    Salvador, Jaime
    Garcia-Rodriguez, Jose
    BIOMEDICAL APPLICATIONS BASED ON NATURAL AND ARTIFICIAL COMPUTING, PT II, 2017, 10338 : 259 - 267