Multi-Dimensional Randomized Response

被引:6
|
作者
Domingo-Ferrer, Josep [1 ]
Soria-Comas, Jordi [2 ]
机构
[1] Univ Rovira & Virgili, CYBERCAT Ctr Cybersecur Res Catalonia, Dept Comp Engn & Math, UNESCO Chair Data Privacy, Av Paisos Catalans 26, Tarragona 43007, Catalonia, Spain
[2] Catalan Data Protect Author, Barcelona 08008, Catalonia, Spain
关键词
Privacy; Estimation; Differential privacy; Data privacy; Phase change random access memory; Clustering algorithms; Protocols; Privacy preserving data publishing; randomized response; curse of dimensionality; local anonymization; multivariate data; differential privacy;
D O I
10.1109/TKDE.2020.3045759
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In our data world, a host of not necessarily trusted controllers gather data on individual subjects. To preserve her privacy and, more generally, her informational self-determination, the individual has to be empowered by giving her agency on her own data. Maximum agency is afforded by local anonymization, that allows each individual to anonymize her own data before handing them to the data controller. Randomized response (RR) is a local anonymization approach able to yield multi-dimensional full sets of anonymized microdata that are valid for exploratory analysis and machine learning. This is so because an unbiased estimate of the distribution of the true data of individuals can be obtained from their pooled randomized data. Furthermore, RR offers rigorous privacy guarantees. The main weakness of RR is the curse of dimensionality when applied to several attributes: as the number of attributes grows, the accuracy of the estimated true data distribution quickly degrades. We propose several complementary approaches to mitigate the dimensionality problem. First, we present two basic protocols, separate RR on each attribute and joint RR for all attributes, and discuss their limitations. Then we introduce an algorithm to form clusters of attributes so that attributes in different clusters can be viewed as independent and joint RR can be performed within each cluster. After that, we introduce an adjustment algorithm for the randomized data set that repairs some of the accuracy loss due to assuming independence between attributes when using RR separately on each attribute or due to assuming independence between clusters in cluster-wise RR. We also present empirical work to illustrate the proposed methods.
引用
收藏
页码:4933 / 4946
页数:14
相关论文
共 50 条
  • [31] RESPONSE SOLUTIONS TO HARMONIC OSCILLATORS BEYOND MULTI-DIMENSIONAL BRJUNO FREQUENCY
    Cheng, Hongyu
    Wang, Shimin
    COMMUNICATIONS ON PURE AND APPLIED ANALYSIS, 2021, 20 (02) : 467 - 494
  • [32] Multi-Dimensional Transport Equations
    Eftimie, Raluca
    HYPERBOLIC AND KINETIC MODELS FOR SELF-ORGANISED BIOLOGICAL AGGREGATIONS: A MODELLING AND PATTERN FORMATION APPROACH, 2018, 2232 : 153 - 193
  • [33] MULTI-DIMENSIONAL BORDERS IN NARRATION
    Jaago, Tiiu
    FOLKLORE-ELECTRONIC JOURNAL OF FOLKLORE, 2018, (73) : 145 - 160
  • [34] Discrete multi-dimensional scaling
    Clouse, DS
    Cottrell, GW
    PROCEEDINGS OF THE EIGHTEENTH ANNUAL CONFERENCE OF THE COGNITIVE SCIENCE SOCIETY, 1996, : 290 - 294
  • [35] A Multi-Dimensional Paper Recommender
    Tang, Tiffany Y.
    McCalla, Gordon
    ARTIFICIAL INTELLIGENCE IN EDUCATION: BUILDING TECHNOLOGY RICH LEARNING CONTEXTS THAT WORK, 2007, 158 : 653 - +
  • [36] ON MULTI-DIMENSIONAL MARKOVIAN COCYCLES
    ACCARDI, L
    JOURNE, JL
    LINDSAY, JM
    LECTURE NOTES IN MATHEMATICS, 1989, 1396 : 59 - 67
  • [37] A Multi-dimensional Analysis of Deception
    Su, Qi
    2017 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2017, : 160 - 163
  • [38] Multi-dimensional Compressive Imaging
    Javidi, B.
    Mahalanobis, A.
    Xiao, X.
    Rivenson, Y.
    Horisaki, R.
    Stern, A.
    Latorre-Carmona, P.
    Martinez-Corral, M.
    Pla, F.
    Tanida, J.
    EMERGING TECHNOLOGIES IN SECURITY AND DEFENCE; AND QUANTUM SECURITY II; AND UNMANNED SENSOR SYSTEMS X, 2013, 8899
  • [39] Multi-dimensional heuristic searching
    1600, Morgan Kaufmann Publ Inc, San Mateo, CA, USA (01):
  • [40] ANALYSIS IN THE MULTI-DIMENSIONAL BALL
    Sjogren, Peter
    Szarek, Tomasz Z.
    MATHEMATIKA, 2019, 65 (02) : 190 - 212