Differentially private confidence intervals for proportions under stratified random sampling

被引:1
|
作者
Lin, Shurong [1 ]
Bun, Mark [2 ]
Gaboardi, Marco [2 ]
Kolaczyk, Eric D. [3 ]
Smith, Adam [2 ]
机构
[1] Boston Univ, Dept Math & Stat, Boston, MA 02215 USA
[2] Boston Univ, Dept Comp Sci, Boston, MA USA
[3] McGill Univ, Dept Math & Stat, Montreal, PQ, Canada
来源
ELECTRONIC JOURNAL OF STATISTICS | 2024年 / 18卷 / 01期
关键词
Differential privacy; confidence intervals; strat ified sampling; population proportion; WEIGHTS; RATIO;
D O I
10.1214/24-EJS2234
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Confidence intervals are a fundamental tool for quantifying the uncertainty of parameters of interest. With the increase of data privacy awareness, developing a private version of confidence intervals has gained growing attention from both statisticians and computer scientists. Differential privacy is a state-of-the-art framework for analyzing privacy loss when releasing statistics computed from sensitive data. Recent work has been done around differentially private confidence intervals, yet to the best of our knowledge, rigorous methodologies on differentially private confidence intervals in the context of survey sampling have not been studied. In this paper, we propose three differentially private algorithms for constructing confidence intervals for proportions under stratified random sampling. We articulate two variants of differential privacy that make sense for data from stratified sampling designs, analyzing each of our algorithms within one of these two variants. We establish analytical privacy guarantees and asymptotic properties of the estimators. In addition, we conduct simulation studies to evaluate the proposed private confidence intervals, and two applications to the 1940 Census data are provided.
引用
收藏
页码:1455 / 1494
页数:40
相关论文
共 50 条
  • [1] CONFIDENCE-INTERVALS FOR QUANTILES OF A FINITE POPULATION - SIMPLE RANDOM AND STRATIFIED SIMPLE RANDOM SAMPLING
    SEDRANSK, J
    MEYER, J
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1978, 40 (02): : 239 - 252
  • [2] Parametric Bootstrap for Differentially Private Confidence Intervals
    Ferrando, Cecilia
    Wang, Shufan
    Sheldon, Daniel
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
  • [3] Nonparametric Differentially Private Confidence Intervals for the Median
    Drechsler, Joerg
    Globus-Harris, Ira
    Mcmillan, Audra
    Sarathy, Jayshree
    Smith, Adam
    JOURNAL OF SURVEY STATISTICS AND METHODOLOGY, 2022, 10 (03) : 804 - 829
  • [4] Stratified Wilson and Newcombe Confidence Intervals for Multiple Binomial Proportions
    Yan, Xin
    Su, Xiao Gang
    STATISTICS IN BIOPHARMACEUTICAL RESEARCH, 2010, 2 (03): : 329 - 335
  • [5] Empirical likelihood confidence intervals under imputation for missing survey data from stratified simple random sampling
    Cai, Song
    Qin, Yongsong
    Rao, J. N. K.
    Winiszewska, Malgorzata
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2019, 47 (02): : 281 - 301
  • [6] STRATIFIED RANDOM SAMPLING - RANK CORRELATION-COEFFICIENTS, TESTS OF INDEPENDENCE AND CONFIDENCE INTERVALS
    YANAGAWA, T
    BULLETIN OF MATHEMATICAL STATISTICS, 1973, 15 (3-4): : 21 - 30
  • [7] MOVER confidence intervals for a difference or ratio effect parameter under stratified sampling
    Tang, Yongqiang
    STATISTICS IN MEDICINE, 2022, 41 (01) : 194 - 207
  • [8] Score confidence intervals and sample sizes for stratified comparisons of binomial proportions
    Tang, Yongqiang
    STATISTICS IN MEDICINE, 2020, 39 (24) : 3427 - 3457
  • [9] An accurate confidence interval for the mean tourist expenditure under stratified random sampling
    Ding, Cherng G.
    Lee, Hsiu-Yu
    CURRENT ISSUES IN TOURISM, 2014, 17 (08) : 674 - 678
  • [10] Logarithmic confidence intervals for the cross-product ratio of binomial proportions under different sampling schemes
    Sungboonchoo, Chanakan
    Yang, Su-Fen
    Panichkitkosolkul, Wararit
    Volodin, Andrei
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2023, 52 (06) : 2686 - 2704