Hypothesis Testing Under Mutual Information Privacy Constraints in the High Privacy Regime

被引:33
|
作者
Liao, Jiachun [1 ]
Sankar, Lalitha
Tan, Vincent Y. F. [2 ]
Calmon, Flavio du Pin [3 ,4 ]
机构
[1] Arizona State Univ, Tempe, AZ 85281 USA
[2] Natl Univ Singapore, Singapore 119077, Singapore
[3] IBM Thomas J Watson Res Ctr, Yorktown Hts, NY 10598 USA
[4] Harvard Univ, John A Paulson Sch Engn & Appl Sci, Cambridge, MA 02138 USA
基金
美国国家科学基金会;
关键词
Hypothesis testing; privacy-guaranteed data publishing; privacy mechanism; euclidean information theory; relative entropy; Renyi divergence; mutual information;
D O I
10.1109/TIFS.2017.2779108
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Hypothesis testing is a statistical inference framework for determining the true distribution among a set of possible distributions for a given data set. Privacy restrictions may require the curator of the data or the respondents themselves to share data with the test only after applying a randomizing privacy mechanism. This work considers mutual information (MI) as the privacy metric for measuring leakage. In addition, motivated by the Chernoff-Stein lemma, the relative entropy between pairs of distributions of the output (generated by the privacy mechanism) is chosen as the utility metric. For these metrics, the goal is to find the optimal privacy-utility tradeoff (PUT) and the corresponding optimal privacy mechanism for both binary and m-ary hypothesis testing. Focusing on the high privacy regime, Euclidean information-theoretic approximations of the binary and m-ary PUT problems are developed. The solutions for the approximation problems clarify that an MI-based privacy metric preserves the privacy of the source symbols in inverse proportion to their likelihoods.
引用
收藏
页码:1058 / 1071
页数:14
相关论文
共 50 条
  • [41] Privacy-Preserving Representation Learning on Graphs: A Mutual Information Perspective
    Wang, Binghui
    Guo, Jiayi
    Li, Ang
    Chen, Yiran
    Li, Hai
    KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 1667 - 1676
  • [42] On Error Exponents Under A Privacy-Preserving Voting Regime
    Tuncel, Ertem
    2019 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2019, : 797 - 801
  • [43] Privacy-preserving Hypothesis Testing for the Analysis of Epidemiological Medical Data
    Kikuchi, Hiroaki
    Sato, Tomoki
    Sakuma, Jun
    2014 IEEE 28TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS (AINA), 2014, : 359 - 365
  • [44] Privacy-Utility Tradeoff for Hypothesis Testing Over a Noisy Channel
    Zhou, Lin
    Cao, Daming
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2021, 16 : 4078 - 4091
  • [45] CHINA'S EMERGING LEGAL REGIME FOR PRIVACY AND PERSONAL INFORMATION PROTECTION
    Peng, Chengxin
    Shao, Guosong
    Zheng, Wentong
    TSINGHUA CHINA LAW REVIEW, 2023, 15 (02): : 191 - 221
  • [46] GEOMETRIZING RATES OF CONVERGENCE UNDER LOCAL DIFFERENTIAL PRIVACY CONSTRAINTS
    Rohde, Angelika
    Steinberger, Lukas
    ANNALS OF STATISTICS, 2020, 48 (05): : 2646 - 2670
  • [47] A Framework for Efficient Data Anonymization under Privacy and Accuracy Constraints
    Ghinita, Gabriel
    Karras, Panagiotis
    Kalnis, Panos
    Mamoulis, Nikos
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2009, 34 (02):
  • [48] Estimating Sparse Discrete Distributions Under Privacy and Communication Constraints
    Acharya, Jayadev
    Kairouz, Peter
    Liu, Yuhan
    Sun, Ziteng
    ALGORITHMIC LEARNING THEORY, VOL 132, 2021, 132
  • [49] Personal privacy and common goods: A framework for balancing under the national health information privacy rule
    Gostin, LO
    Hodge, JG
    MINNESOTA LAW REVIEW, 2002, 86 (06) : 1439 - 1479
  • [50] Maximum Likelihood Postprocessing for Differential Privacy under Consistency Constraints
    Lee, Jaewoo
    Wang, Yue
    Kifer, Daniel
    KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, : 635 - 644