Reconstruction of age distributions from differentially private census data

被引:2
|
作者
Dyrting, Sigurd [1 ]
Flaxman, Abraham [2 ]
Sharygin, Ethan [3 ]
机构
[1] Charles Darwin Univ, Northern Inst, Darwin, NT 0909, Australia
[2] Univ Washington, Inst Hlth Metr & Evaluat IHME, 3980 15th Ave NE, Seattle, WA 98195 USA
[3] Portland State Univ, Populat Res Ctr, POB 751, Portland, OR 97207 USA
关键词
Census; Privacy; Demography;
D O I
10.1007/s11113-022-09734-2
中图分类号
C921 [人口统计学];
学科分类号
摘要
The age distribution of a population is important for understanding the demand and provision of labor and services, and as a denominator for calculating key age-specific rates such as fertility and mortality. In the US, the most important source of information on age distributions is the decennial census, but a new disclosure avoidance system (DAS) based on differential privacy will inject noise into the data, potentially compromising its utility for small areas and minority populations. In this paper, we explore the question whether there are statistical methods that can be applied to noisy age distributions to enhance the research uses of census data without compromising privacy. We apply a non-parametric method for smoothing with naive or informative priors to age distributions from the 2010 Census via demonstration data which have had the US Census Bureau's implementation of differential privacy applied. We find that smoothing age distributions can increase the fidelity of the demonstration data to previously published population counts by age. We discuss implications for uses of data from the 2020 US Census and potential consequences for the measurement of population dynamics, health, and disparities.
引用
收藏
页码:2311 / 2329
页数:19
相关论文
共 50 条
  • [1] Reconstruction of age distributions from differentially private census data
    Sigurd Dyrting
    Abraham Flaxman
    Ethan Sharygin
    [J]. Population Research and Policy Review, 2022, 41 : 2311 - 2329
  • [2] Differentially Private Sampling from Distributions
    Raskhodnikova, Sofya
    Sivakumar, Satchit
    Smith, Adam
    Swanberg, Marika
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [3] On Differentially Private Sampling from Gaussian and Product Distributions
    Ghazi, Badih
    Hu, Xiao
    Kumar, Ravi
    Manurangsi, Pasin
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [4] Differentially Private Learning of Structured Discrete Distributions
    Diakonikolas, Ilias
    Hardt, Moritz
    Schmidt, Ludwig
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
  • [5] Differentially Private Auctions for Private Data Crowdsourcing
    Shi, Mingyu
    Qiao, Yu
    Wang, Xinbo
    [J]. 2019 IEEE INTL CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, BIG DATA & CLOUD COMPUTING, SUSTAINABLE COMPUTING & COMMUNICATIONS, SOCIAL COMPUTING & NETWORKING (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2019), 2019, : 1 - 8
  • [6] Collaborative learning from distributed data with differentially private synthetic data
    Prediger, Lukas
    Jalko, Joonas
    Honkela, Antti
    Kaski, Samuel
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2024, 24 (01)
  • [7] RECONSTRUCTION OF BIRTH HISTORIES FROM CENSUS AND HOUSEHOLD SURVEY DATA
    LUTHER, NY
    CHO, LJ
    [J]. POPULATION STUDIES-A JOURNAL OF DEMOGRAPHY, 1988, 42 (03): : 451 - 472
  • [8] THE GENERATION OF SPATIAL POPULATION-DISTRIBUTIONS FROM CENSUS CENTROID DATA
    BRACKEN, I
    MARTIN, D
    [J]. ENVIRONMENT AND PLANNING A, 1989, 21 (04) : 537 - 543
  • [9] Differentially Private Testing of Identity and Closeness of Discrete Distributions
    Acharya, Jayadev
    Sun, Ziteng
    Zhang, Huanyu
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [10] Differentially Private Data Generation with Missing Data
    Mohapatra, Shubhankar
    Zong, Jianqiao
    Kerschbaum, Florian
    He, Xi
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2024, 17 (08): : 2022 - 2035