Hypothesis Testing for Differentially Private Linear Regression

被引:0
|
作者
Alabi, Daniel [1 ,2 ]
Vadhan, Salil [3 ]
机构
[1] Columbia Univ, Dept Comp Sci, New York, NY 10027 USA
[2] Columbia Univ, Data Sci Inst, New York, NY 10027 USA
[3] Harvard Sch Engn & Appl Sci, Boston, MA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we design differentially private hypothesis tests for the following problems in the general linear model: testing a linear relationship and testing for the presence of mixtures. The majority of our hypothesis tests are based on differentially private versions of the F-statistic for the general linear model framework, which are uniformly most powerful unbiased in the non-private setting. We also present another test for testing mixtures, based on the differentially private nonparametric tests of Couch, Kazan, Shi, Bray, and Groce (CCS 2019), which is especially suited for the small dataset regime. We show that the differentially private F-statistic converges to the asymptotic distribution of its non-private counterpart. As a corollary, the statistical power of the differentially private F-statistic converges to the statistical power of the non-private F-statistic. Through a suite of Monte Carlo based experiments, we show that our tests achieve desired significance levels and have a high power that approaches the power of the non-private tests as we increase sample sizes or the privacy-loss parameter. We also show when our tests outperform existing methods in the literature.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Hypothesis testing for differentially correlated features
    Sheng, Elisa
    Witten, Daniela
    Zhou, Xiao-Hua
    [J]. BIOSTATISTICS, 2016, 17 (04) : 677 - 691
  • [32] Estimation and hypothesis testing in multivariate linear regression models under non normality
    Islam, M. Qamarul
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2017, 46 (17) : 8521 - 8543
  • [33] Case-deletion diagnostics for testing a linear hypothesis about regression coefficients
    Kim M.G.
    [J]. Journal of Applied Mathematics and Computing, 2002, 10 (1-2) : 111 - 118
  • [34] Quantum Differentially Private Sparse Regression Learning
    Du, Yuxuan
    Hsieh, Min-Hsiu
    Liu, Tongliang
    You, Shan
    Tao, Dacheng
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2022, 68 (08) : 5217 - 5233
  • [35] Differentially Private Logistic Regression with Sparse Solutions
    Khanna, Amol
    Lu, Fred
    Raff, Edward
    Testa, Brian
    [J]. PROCEEDINGS OF THE 16TH ACM WORKSHOP ON ARTIFICIAL INTELLIGENCE AND SECURITY, AISEC 2023, 2023, : 1 - 9
  • [36] Differentially Private Significance Tests for Regression Coefficients
    Barrientos, Andres F.
    Reiter, Jerome P.
    Machanavajjhala, Ashwin
    Chen, Yan
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2019, 28 (02) : 440 - 453
  • [37] Differentially Private Contextual Linear Bandits
    Shariff, Roshan
    Sheffet, Or
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [38] Differentially private distributed logistic regression using private and public data
    Zhanglong Ji
    Xiaoqian Jiang
    Shuang Wang
    Li Xiong
    Lucila Ohno-Machado
    [J]. BMC Medical Genomics, 7
  • [39] Differentially private distributed logistic regression using private and public data
    Ji, Zhanglong
    Jiang, Xiaoqian
    Wang, Shuang
    Xiong, Li
    Ohno-Machado, Lucila
    [J]. BMC MEDICAL GENOMICS, 2014, 7
  • [40] Bootstrap hypothesis testing in regression models
    Paparoditis, E
    Politis, DN
    [J]. STATISTICS & PROBABILITY LETTERS, 2005, 74 (04) : 356 - 365