Large-scale and high-resolution analysis of food purchases and health outcomes

被引:0
|
作者
Luca Maria Aiello
Rossano Schifanella
Daniele Quercia
Lucia Del Prete
机构
[1] Nokia Bell Labs,Centre for Urban Science and Progress (CUSP)
[2] University of Turin,undefined
[3] King’s College,undefined
[4] Tesco Labs,undefined
来源
关键词
Nutrition; Diabetes; Hypertension; Cholesterol; Metabolic syndrome; Digital purchase records; Grocery; Loyalty card; London;
D O I
暂无
中图分类号
学科分类号
摘要
To complement traditional dietary surveys, which are costly and of limited scale, researchers have resorted to digital data to infer the impact of eating habits on people’s health. However, online studies are limited in resolution: they are carried out at country or regional level and do not capture precisely the composition of the food consumed. We study the association between food consumption (derived from the loyalty cards of the main grocery retailer in London) and health outcomes (derived from publicly-available medical prescription records of all general practitioners in the city). The scale and granularity of our analysis is unprecedented: we analyze 1.6B food item purchases and 1.1B medical prescriptions for the entire city of London over the course of one year. By studying food consumption down to the level of nutrients, we show that nutrient diversity and amount of calories are the two strongest predictors of the prevalence of three diseases related to what is called the “metabolic syndrome”: hypertension, high cholesterol, and diabetes. This syndrome is a cluster of symptoms generally associated with obesity, is common across the rich world, and affects one in four adults in the UK. Our linear regression models achieve an R2\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$R^{2}$\end{document} of 0.6 when estimating the prevalence of diabetes in nearly 1000 census areas in London, and a classifier can identify (un)healthy areas with up to 91% accuracy. Interestingly, healthy areas are not necessarily well-off (income matters less than what one would expect) and have distinctive features: they tend to systematically eat less carbohydrates and sugar, diversify nutrients, and avoid large quantities. More generally, our study shows that analytics of digital records of grocery purchases can be used as a cheap and scalable tool for health surveillance and, upon these records, different stakeholders from governments to insurance companies to food companies could implement effective prevention strategies.
引用
收藏
相关论文
共 50 条
  • [1] Large-scale and high-resolution analysis of food purchases and health outcomes
    Aiello, Luca Maria
    Schifanella, Rossano
    Quercia, Daniele
    Del Prete, Lucia
    [J]. EPJ DATA SCIENCE, 2019, 8 (1)
  • [2] A large-scale, high-resolution microscope
    Nina Vogt
    [J]. Nature Methods, 2019, 16 : 806 - 806
  • [3] A large-scale, high-resolution microscope
    Vogt, Nina
    [J]. NATURE METHODS, 2019, 16 (09) : 806 - 806
  • [4] High-resolution modelling of a large-scale river plume
    Vasiliy Vlasenko
    Nataliya Stashchuk
    Robert McEwan
    [J]. Ocean Dynamics, 2013, 63 : 1307 - 1320
  • [5] On the Importance of High-Resolution in Large-Scale Ocean Models
    Chassignet, Eric P.
    Xu, Xiaobiao
    [J]. ADVANCES IN ATMOSPHERIC SCIENCES, 2021, 38 (10) : 1621 - 1634
  • [6] High-resolution modelling of a large-scale river plume
    Vlasenko, Vasiliy
    Stashchuk, Nataliya
    McEwan, Robert
    [J]. OCEAN DYNAMICS, 2013, 63 (11-12) : 1307 - 1320
  • [7] On the Importance of High-Resolution in Large-Scale Ocean Models
    Eric P.CHASSIGNET
    Xiaobiao XU
    [J]. Advances in Atmospheric Sciences, 2021, 38 (10) : 1621 - 1634
  • [8] On the Importance of High-Resolution in Large-Scale Ocean Models
    Eric P. Chassignet
    Xiaobiao Xu
    [J]. Advances in Atmospheric Sciences, 2021, 38 : 1621 - 1634
  • [9] Multi-Resolution Design for Large-Scale and High-Resolution Monitoring
    Chen, Kuan-Wen
    Lin, Chih-Wei
    Chiu, Tzu-Hsuan
    Chen, Mike Yen-Yang
    Hung, Yi-Ping
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2011, 13 (06) : 1256 - 1268
  • [10] Large-scale and High-resolution Flood Risk Model for Japan
    Assteerawatt, Anongnart
    Tsaknias, Dimosthenis
    Azemar, Frederic
    Ghosh, Sourima
    Hilberts, Arno
    [J]. 3RD EUROPEAN CONFERENCE ON FLOOD RISK MANAGEMENT (FLOODRISK 2016), 2016, 7