Web Data Mining: Validity of Data from Google Earth for Food Retail Evaluation

被引:0
|
作者
Mariana Carvalho de Menezes
Vanderlei Pascoal de Matos
Maria de Fátima de Pina
Bruna Vieira de Lima Costa
Larissa Loures Mendes
Milene Cristine Pessoa
Paulo Roberto Borges de Souza-Junior
Amélia Augusta de Lima Friche
Waleska Teixeira Caiaffa
Letícia de Oliveira Cardoso
机构
[1] Fiocruz-RJ,National School of Public Health
[2] Fiocruz-RJ,Instituto de Comunicação e Informação Científica e Tecnológica em Saúde
[3] Universidade Federal de Minas Gerais,Department of Nutrition
[4] Universidade Federal de Minas Gerais. Observatório de Saúde Urbana,Faculdade de Medicina
来源
Journal of Urban Health | 2021年 / 98卷
关键词
Food environment; Food retail; Validation study; Geocoding services; Google Earth; Urban health;
D O I
暂无
中图分类号
学科分类号
摘要
To overcome the challenge of obtaining accurate data on community food retail, we developed an innovative tool to automatically capture food retail data from Google Earth (GE). The proposed method is relevant to non-commercial use or scholarly purposes. We aimed to test the validity of web sources data for the assessment of community food retail environment by comparison to ground-truth observations (gold standard). A secondary aim was to test whether validity differs by type of food outlet and socioeconomic status (SES). The study area included a sample of 300 census tracts stratified by SES in two of the largest cities in Brazil, Rio de Janeiro and Belo Horizonte. The GE web service was used to develop a tool for automatic acquisition of food retail data through the generation of a regular grid of points. To test its validity, this data was compared with the ground-truth data. Compared to the 856 outlets identified in 285 census tracts by the ground-truth method, the GE interface identified 731 outlets. In both cities, the GE interface scored moderate to excellent compared to the ground-truth data across all of the validity measures: sensitivity, specificity, positive predictive value, negative predictive value and accuracy (ranging from 66.3 to 100%). The validity did not differ by SES strata. Supermarkets, convenience stores and restaurants yielded better results than other store types. To our knowledge, this research is the first to investigate using GE as a tool to capture community food retail data. Our results suggest that the GE interface could be used to measure the community food environment. Validity was satisfactory for different SES areas and types of outlets.
引用
收藏
页码:285 / 295
页数:10
相关论文
共 50 条
  • [1] Web Data Mining: Validity of Data from Google Earth for Food Retail Evaluation
    de Menezes, Mariana Carvalho
    de Matos, Vanderlei Pascoal
    de Pina, Maria de Fatima
    de Lima Costa, Bruna Vieira
    Mendes, Larissa Loures
    Pessoa, Milene Cristine
    de Souza-Junior, Paulo Roberto Borges
    de Lima Friche, Amelia Augusta
    Caiaffa, Waleska Teixeira
    de Oliveira Cardoso, Leticia
    [J]. JOURNAL OF URBAN HEALTH-BULLETIN OF THE NEW YORK ACADEMY OF MEDICINE, 2021, 98 (02): : 285 - 295
  • [2] Mining the web to add semantics to retail data mining
    Ghani, R
    [J]. WEB MINING: FROM WEB TO SEMANTIC WEB, 2004, 3209 : 43 - 56
  • [3] Validity of Secondary Retail Food Outlet Data A Systematic Review
    Fleischhacker, Sheila E.
    Evenson, Kelly R.
    Sharkey, Joseph
    Pitts, Stephanie B. Jilcott
    Rodriguez, Daniel A.
    [J]. AMERICAN JOURNAL OF PREVENTIVE MEDICINE, 2013, 45 (04) : 462 - 473
  • [4] Data Mining From Web Search Queries: A Comparison of Google Trends and Baidu Index
    Vaughan, Liwen
    Chen, Yue
    [J]. JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2015, 66 (01) : 13 - 22
  • [5] From temporal data mining and web mining to temporal web mining
    Samia, M
    Conrad, S
    [J]. DATABASES AND INFORMATION SYSTEMS, 2005, 118 : 91 - 102
  • [6] Data-mining behavioural data from the web
    Balogh, Zoltan
    [J]. PROCEEDINGS OF 2016 10TH INTERNATIONAL CONFERENCE ON SOFTWARE, KNOWLEDGE, INFORMATION MANAGEMENT & APPLICATIONS (SKIMA), 2016, : 122 - 127
  • [7] Web + Data Mining = Web Mining
    Kilian Stoffel
    [J]. HMD Praxis der Wirtschaftsinformatik, 2009, 46 (4) : 6 - 20
  • [8] Data mining in retail credit
    Georgios Sarantopoulos
    [J]. Operational Research, 2003, 3 (2) : 99 - 122
  • [9] FROM ERP TO DATA MINING ON THE WEB
    Ghenea, Serban
    [J]. UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2011, 73 (04): : 89 - 98
  • [10] Applications of data warehousing and data mining in the retail industry
    Li, HG
    [J]. 2005 International Conference on Services Systems and Services Management, Vols 1 and 2, Proceedings, 2005, : 1047 - 1050