Multi-Dimensional Randomized Response

被引:6
|
作者
Domingo-Ferrer, Josep [1 ]
Soria-Comas, Jordi [2 ]
机构
[1] Univ Rovira & Virgili, CYBERCAT Ctr Cybersecur Res Catalonia, Dept Comp Engn & Math, UNESCO Chair Data Privacy, Av Paisos Catalans 26, Tarragona 43007, Catalonia, Spain
[2] Catalan Data Protect Author, Barcelona 08008, Catalonia, Spain
关键词
Privacy; Estimation; Differential privacy; Data privacy; Phase change random access memory; Clustering algorithms; Protocols; Privacy preserving data publishing; randomized response; curse of dimensionality; local anonymization; multivariate data; differential privacy;
D O I
10.1109/TKDE.2020.3045759
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In our data world, a host of not necessarily trusted controllers gather data on individual subjects. To preserve her privacy and, more generally, her informational self-determination, the individual has to be empowered by giving her agency on her own data. Maximum agency is afforded by local anonymization, that allows each individual to anonymize her own data before handing them to the data controller. Randomized response (RR) is a local anonymization approach able to yield multi-dimensional full sets of anonymized microdata that are valid for exploratory analysis and machine learning. This is so because an unbiased estimate of the distribution of the true data of individuals can be obtained from their pooled randomized data. Furthermore, RR offers rigorous privacy guarantees. The main weakness of RR is the curse of dimensionality when applied to several attributes: as the number of attributes grows, the accuracy of the estimated true data distribution quickly degrades. We propose several complementary approaches to mitigate the dimensionality problem. First, we present two basic protocols, separate RR on each attribute and joint RR for all attributes, and discuss their limitations. Then we introduce an algorithm to form clusters of attributes so that attributes in different clusters can be viewed as independent and joint RR can be performed within each cluster. After that, we introduce an adjustment algorithm for the randomized data set that repairs some of the accuracy loss due to assuming independence between attributes when using RR separately on each attribute or due to assuming independence between clusters in cluster-wise RR. We also present empirical work to illustrate the proposed methods.
引用
收藏
页码:4933 / 4946
页数:14
相关论文
共 50 条
  • [41] Applied multi-dimensional fusion
    Mahmood, Asher
    Tudor, Philip M.
    Oxford, William
    Hansford, Robert
    Nelson, James D.B.
    Kingsbury, Nicholas G.
    Katartzis, Antonis
    Petrou, M.
    Mitianoudis, N.
    Stathaki, T.
    Achim, Alin
    Bull, David
    Canagarajah, Nishan
    Nikolov, Stavri
    Loza, Artur
    Cvejic, Nedeljko
    Computer Journal, 2007, 50 (06): : 646 - 659
  • [42] Multi-Dimensional, MultiStep Negotiation
    Xiaoqin Zhang
    Victor Lesser
    Rodion Podorozhny
    Autonomous Agents and Multi-Agent Systems, 2005, 10 (1) : 5 - 40
  • [43] Visualizing multi-dimensional data
    Eick, SG
    COMPUTER GRAPHICS-US, 2000, 34 (01): : 61 - 67
  • [44] Multi-dimensional Forwarding Tables
    Bayzelon, Gautier
    Yang, Shu
    Xu, Mingwei
    Li, Qi
    FRONTIERS IN INTERNET TECHNOLOGIES, 2015, 502 : 68 - 79
  • [45] Multi-dimensional description logics
    Wolter, F
    Zakharyaschev, M
    IJCAI-99: PROCEEDINGS OF THE SIXTEENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 & 2, 1999, : 104 - 109
  • [46] Multi-Dimensional Sparse Models
    Qi, Na
    Shi, Yunhui
    Sun, Xiaoyan
    Wang, Jingdong
    Yin, Baocai
    Gao, Junbin
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (01) : 163 - 178
  • [47] Comment on multi-dimensional indices
    Nancy Birdsall
    The Journal of Economic Inequality, 2011, 9
  • [48] MULTI-DIMENSIONAL APPROACH TO STEREOTYPING
    KIPPAX, S
    AUSTRALIAN PSYCHOLOGIST, 1975, 10 (01) : 114 - 114
  • [49] MULTI-DIMENSIONAL SCALING OF EMOTION
    YOSHIDA, M
    KINASE, R
    KUROKAWA, J
    YASHIRO, S
    JAPANESE PSYCHOLOGICAL RESEARCH, 1970, 12 (02) : 45 - &
  • [50] On multi-dimensional packing problems
    Chekuri, C
    Khanna, S
    PROCEEDINGS OF THE TENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 1999, : 185 - 194