INTEGRATING MULTIPLE BUILT ENVIRONMENT DATA SOURCES

被引:1
|
作者
Won, Jung Yeon [1 ]
Elliott, Michael R. [1 ]
Sanchez-Vaznaugh, Emma V. [2 ]
Sanchez, Brisa N. [3 ]
机构
[1] Univ Michigan, Dept Biostat, Ann Arbor, MI 48109 USA
[2] San Francisco State Univ, Dept Hlth Educ, San Francisco, CA 94132 USA
[3] Drexel Univ, Dept Epidemiol & Biostat, Philadelphia, PA 19104 USA
来源
ANNALS OF APPLIED STATISTICS | 2023年 / 17卷 / 02期
关键词
Built-environment; count exposure; data integration; measurement error; Dirichlet process mixture model; commercial business lists; BODY-MASS INDEX; MEASUREMENT ERROR; BAYESIAN-APPROACH; POPULATION-SIZE; FOOD; MODELS; MIXTURES; CHILDREN; POISSON; NUMBER;
D O I
10.1214/22-AOAS1692
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Studies examining the contribution of the built environment to health often rely on commercial data sources to derive exposure measures, such as the number of specific food outlets in study participants' neighborhoods. Data on the location of community amenities (e.g., food outlets) can be col-lected from multiple sources. However, these commercial listings are known to have ascertainment errors and thus provide conflicting claims about the number and location of amenities. We propose a method that integrates expo-sure measures from different databases, while accounting for ascertainment errors, and obtains unbiased health effects of latent exposure. We frame the problem of conflicting exposure measures as a problem of two contingency tables with partially known margins, with the entries of the tables modeled using a multinomial distribution. Available estimates of source quality were embedded in a joint model for observed exposure counts, latent exposures, and health outcomes. Simulations show that our modeling framework yields substantially improved inferences regarding the health effects. We used the proposed method to estimate the association between children's body mass index (BMI) and the concentration of food outlets near their schools when both the NETS and Reference USA databases are available.
引用
收藏
页码:1722 / 1739
页数:18
相关论文
共 50 条
  • [1] Integrating Multiple Data Sources for Stock Prediction
    Wu, Di
    Fung, Gabriel Pui Cheong
    Yu, Jeffrey Xu
    Liu, Zheng
    [J]. WEB INFORMATION SYSTEMS ENGINEERING - WISE 2008, PROCEEDINGS, 2008, 5175 : 77 - +
  • [2] Integrating Multiple Data Sources to Enhance Sentiment Prediction
    Heredia, Brian
    Khoshgoftaar, Taghi M.
    Prusa, Joseph D.
    Crawford, Michael
    [J]. 2016 IEEE 2ND INTERNATIONAL CONFERENCE ON COLLABORATION AND INTERNET COMPUTING (IEEE CIC), 2016, : 285 - 291
  • [3] In Silico Gene Prioritization by Integrating Multiple Data Sources
    Chen, Yixuan
    Wang, Wenhui
    Zhou, Yingyao
    Shields, Robert
    Chanda, Sumit K.
    Elston, Robert C.
    Li, Jing
    [J]. PLOS ONE, 2011, 6 (06):
  • [4] Identifying disease genes by integrating multiple data sources
    Chen, Bolin
    Wang, Jianxin
    Li, Min
    Wu, Fang-Xiang
    [J]. BMC MEDICAL GENOMICS, 2014, 7
  • [5] Accessible Routes Integrating Data from Multiple Sources
    Luaces, Miguel R.
    Fisteus, Jesus A.
    Sanchez-Fernandez, Luis
    Munoz-Organero, Mario
    Balado, Jesus
    Diaz-Vilarino, Lucia
    Lorenzo, Henrique
    [J]. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2021, 10 (01)
  • [6] Integrating Multiple Data Sources in a Cardiology Imaging Laboratory
    Godinho, Tiago Marques
    Almeida, Eduardo
    Bastido Silva, Luis A.
    Costa, Carlos
    [J]. 2016 IEEE 18TH INTERNATIONAL CONFERENCE ON E-HEALTH NETWORKING, APPLICATIONS AND SERVICES (HEALTHCOM), 2016, : 596 - 601
  • [7] Exploring Disease Similarity by Integrating Multiple Data Sources
    Deng, Lei
    Ye, Danyi
    Zhao, Junmin
    Zhang, Jingpu
    [J]. PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 853 - 858
  • [8] Identifying disease genes by integrating multiple data sources
    Bolin Chen
    Jianxin Wang
    Min Li
    Fang-Xiang Wu
    [J]. BMC Medical Genomics, 7
  • [9] PROBABILISTIC METHOD FOR INTEGRATING MULTIPLE SOURCES OF RANGE DATA
    BOVE, VM
    [J]. JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 1990, 7 (12): : 2193 - 2198
  • [10] MIROWeb: Integrating multiple data sources through semistructured data types
    Bouganim, L
    Chan-Sine-Ying, T
    Dang-Ngoc, TT
    Darroux, JL
    Gardarin, G
    Sha, F
    [J]. PROCEEDINGS OF THE TWENTY-FIFTH INTERNATIONAL CONFERENCE ON VERY LARGE DATA BASES, 1999, : 750 - 753