Perspective: Big Data and Machine Learning Could Help Advance Nutritional Epidemiology

被引:43
|
作者
Morgenstern, Jason D. [1 ]
Rosella, Laura C. [2 ,3 ]
Costa, Andrew P. [1 ]
de Souza, Russell J. [1 ,4 ]
Anderson, Laura N. [1 ]
机构
[1] McMaster Univ, Dept Hlth Res Methods Evidencg & Impact, Hamilton, ON, Canada
[2] Univ Toronto, Dalla Lana Sch Publ Hlth, Toronto, ON, Canada
[3] Vector Inst, Toronto, ON, Canada
[4] Hamilton Hlth Sci, Populat Hlth Res Inst, Hamilton, ON, Canada
基金
加拿大健康研究院;
关键词
machine learning; big data; nutritional epidemiology; artificial intelligence; nutritional sciences; diet; nutrition; precision nutrition; PROPENSITY SCORE ESTIMATION; MEASUREMENT ERROR; CARDIOVASCULAR-DISEASE; LOGISTIC-REGRESSION; CONFOUNDER CONTROL; NEGATIVE CONTROLS; 24-HOUR RECALL; SELECTION; DIET; RISK;
D O I
10.1093/advances/nmaa183
中图分类号
R15 [营养卫生、食品卫生]; TS201 [基础科学];
学科分类号
100403 ;
摘要
The field of nutritional epidemiology faces challenges posed by measurement error, diet as a complex exposure, and residual confounding. The objective of this perspective article is to highlight how developments in big data and machine learning can help address these challenges. New methods of collecting 24-h dietary recalls and recording diet could enable larger samples and more repeated measures to increase statistical power and measurement precision. In addition, use of machine learning to automatically classify pictures of food could become a useful complimentary method to help improve precision and validity of dietary measurements. Diet is complex due to thousands of different foods that are consumed in varying proportions, fluctuating quantities over time, and differing combinations. Current dietary pattern methods may not integrate sufficient dietary variation, and most traditional modeling approaches have limited incorporation of interactions and nonlinearity. Machine learning could help better model diet as a complex exposure with nonadditive and nonlinear associations. Last, novel big data sources could help avoid unmeasured confounding by offering more covariates, including both omics and features derived from unstructured data with machine learning methods. These opportunities notwithstanding, application of big data and machine learning must be approached cautiously to ensure quality of dietary measurements, avoid overfitting, and confirm accurate interpretations. Greater use of machine learning and big data would also require substantial investments in training, collaborations, and computing infrastructure. Overall, we propose that judicious application of big data and machine learning in nutrition science could offer new means of dietary measurement, more tools to model the complexity of diet and its relations with diseases, and additional potential ways of addressing confounding.
引用
收藏
页码:621 / 631
页数:11
相关论文
共 50 条
  • [41] Machine Learning Technologies for Big Data Analytics
    Gandomi, Amir H.
    Chen, Fang
    Abualigah, Laith
    ELECTRONICS, 2022, 11 (03)
  • [42] Machine Learning Meets Big Spatial Data
    Sabek, Ibrahim
    Mokbel, Mohamed F.
    2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020), 2020, : 1782 - 1785
  • [43] Machine Learning Research in Big Data Environment
    Jiang, Shi
    2018 5TH INTERNATIONAL CONFERENCE ON ELECTRICAL & ELECTRONICS ENGINEERING AND COMPUTER SCIENCE (ICEEECS 2018), 2018, : 227 - 231
  • [44] Efficient Machine Learning for Big Data: A Review
    Al-Jarrah, Omar Y.
    Yoo, Paul D.
    Muhaidat, Sami
    Karagiannidis, George K.
    Taha, Kamal
    BIG DATA RESEARCH, 2015, 2 (03) : 87 - 93
  • [45] Automated Trading with Machine Learning on Big Data
    Ruta, Dymitr
    2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS), 2014, : 824 - 830
  • [46] A survey of machine learning for big data processing
    Qiu, Junfei
    Wu, Qihui
    Ding, Guoru
    Xu, Yuhua
    Feng, Shuo
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2016,
  • [47] Big Data, Predictive Analytics and Machine Learning
    Ongsulee, Pariwat
    Chotchaung, Veena
    Bamrungsi, Eak
    Rodcheewit, Thanaporn
    2018 16TH INTERNATIONAL CONFERENCE ON ICT AND KNOWLEDGE ENGINEERING (ICT&KE), 2018, : 37 - 42
  • [48] Big Data, Machine Learning, and Molecular Imaging
    Morris, Michael
    Saboury, Babak
    Siegel, Eliot
    JOURNAL OF NUCLEAR MEDICINE, 2018, 59
  • [49] Big data and machine learning for crop protection
    Ip, Ryan H. L.
    Ang, Li-Minn
    Seng, Kah Phooi
    Broster, J. C.
    Pratley, J. E.
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2018, 151 : 376 - 383
  • [50] Data Analytics and Machine Learning: Navigating the Big Data Landscape
    Sloboda, Brian W.
    INTERNATIONAL STATISTICAL REVIEW, 2024,