On randomization, public information and the curse of dimensionality

被引:0
|
作者
Aggarwal, Charu C. [1 ]
机构
[1] IBM TJ Watson Res Ctr, Hawthorne, NY 10532 USA
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A key method for privacy preserving data mining is that of randomization. Unlike k-anonymity, this technique does not include Public information in the underlying assumptions. In this paper, we will provide a first comprehensive analysis of the randomization method in the presence of public information. We will define a quantification of the randomization method which we refer to as k-randomization of the data. The inclusion of public information in the theoretical analysis of the randomization method results in a number of interesting and insightful conclusions. These conclusions expose some vulnerabilities of the randomization method. We show that the randomization method is unable to effectively achieve privacy in the high dimensional case. We theoretically quantify the degree of randomization required to guarantee privacy as a function of the underlying data dimensionality. Furthermore, we show that the randomization method is susceptible to many natural properties of real data sets such as clusters or outliers. Finally, we show that the use of public information makes the choice of perturbing distribution very critical in a number of subtle ways. Our analysis shows that the inclusion of public information in the analysis makes the goal of privacy preservation more elusive than previously thought for the randomization method.
引用
收藏
页码:111 / 120
页数:10
相关论文
共 50 条
  • [1] Using randomization to break the curse of dimensionality
    Rust, J
    [J]. ECONOMETRICA, 1997, 65 (03) : 487 - 516
  • [3] A Comment on "Using Randomization to Break the Curse of Dimensionality"
    Bray, Robert L.
    [J]. ECONOMETRICA, 2022, 90 (04) : 1915 - 1929
  • [4] Dealing with the curse of dimensionality in systems and control: the randomization paradigm
    Dabbene, Fabrizio
    [J]. 2014 IEEE 53RD ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2014, : 2248 - 2256
  • [5] Prior information, sampling distributions, and the curse of dimensionality
    Curtis, A
    Lomax, A
    [J]. GEOPHYSICS, 2001, 66 (02) : 372 - 378
  • [6] Regression with comparisons: Escaping the curse of dimensionality with ordinal information
    Xu, Yichong
    Balakrishnan, Sivaraman
    Singh, Aarti
    Dubrawski, Artur
    [J]. Journal of Machine Learning Research, 2020, 21
  • [7] Regression with Comparisons: Escaping the Curse of Dimensionality with Ordinal Information
    Xu, Yichong
    Balakrishnan, Sivaraman
    Singh, Aarti
    Dubrawski, Artur
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2020, 21
  • [8] Information leverage in interconnected ecosystems: Overcoming the curse of dimensionality
    Ye, Hao
    Sugihara, George
    [J]. SCIENCE, 2016, 353 (6302) : 922 - 925
  • [9] The Curse of Dimensionality
    Patel, Niketu P.
    Sarraf, Elie
    Tsai, Mitchell H.
    [J]. ANESTHESIOLOGY, 2018, 129 (03) : 614 - 615
  • [10] Nonparametric Regression with Comparisons: Escaping the Curse of Dimensionality with Ordinal Information
    Xu, Yichong
    Muthakana, Hariank
    Balakrishnan, Sivaraman
    Dubrawski, Artur
    Singh, Aarti
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80