Improved Univariate Microaggregation for Integer Values

被引:0
|
作者
Mortazavi, Reza [1 ]
机构
[1] Damghan Univ, Sch Engn, Comp Engn Dept, Damghan, Iran
关键词
Data Privacy; Statistical; Databases; Microdata Protection; Microaggregation; Integer Optimization; DATA-ORIENTED MICROAGGREGATION; DISCLOSURE RISK; K-ANONYMITY; ALGORITHM; UTILITY;
D O I
10.22042/ISECURE.2019.185397.465
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Privacy issues during data publishing is an increasing concern of involved entities. The problem is addressed in the field of statistical disclosure control with the aim of producing protected datasets that are also useful for interested end users such as government agencies and research communities. The problem of producing useful protected datasets is addressed in multiple computational privacy models such as k-anonymity in which data is clustered into groups of at least k members. Microaggregation is a mechanism to realize k-anonymity. The objective is to assign records of a dataset to clusters and replace the original values with their associated cluster centers which are the average of assigned values to minimize information loss in terms of the sum of within group squared errors (SSE). While the problem is shown to be NP-hard in general, there is an optimal polynomial-time algorithm for univariate datasets. This paper shows that the assignment of the univariate microaggregation algorithm cannot produce optimal partitions for integer observations where the computed centroids have to be integer values. In other words, the integrality constraint on published quantities has to be addressed within the algorithm steps and the optimal partition cannot be attained using only the results of the general solution. Then, an effective method that considers the constraint is proposed and analyzed which can handle very large numerical volumes. Experimental evaluations confirm that the developed algorithm not only produces more useful datasets but also is more efficient in comparison with the general optimal univariate algorithm. (C) 2020 ISC. All rights reserved.
引用
收藏
页码:35 / 43
页数:9
相关论文
共 50 条
  • [1] Analysis of the Univariate Microaggregation Disclosure Risk
    Jordi Nin
    Vicenç Torra
    [J]. New Generation Computing, 2009, 27 : 197 - 214
  • [2] Improving Multivariate Microaggregation through Hamiltonian Paths and Optimal Univariate Microaggregation
    Maya-Lopez, Armando
    Casino, Fran
    Solanas, Agusti
    [J]. SYMMETRY-BASEL, 2021, 13 (06):
  • [3] Analysis of the Univariate Microaggregation Disclosure Risk
    Nin, Jordi
    Torra, Vicenc
    [J]. NEW GENERATION COMPUTING, 2009, 27 (03) : 197 - 214
  • [4] A polynomial algorithm for optimal univariate microaggregation
    Hansen, SL
    Mukherjee, S
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2003, 15 (04) : 1043 - 1044
  • [5] Optimal univariate microaggregation with data suppression
    Laszlo, Michael
    Mukherjee, Sumitra
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2013, 86 (03) : 677 - 682
  • [6] Improving the Utility of Differential Privacy via Univariate Microaggregation
    Sanchez, David
    Domingo-Ferrer, Josep
    Martinez, Sergio
    [J]. PRIVACY IN STATISTICAL DATABASES, PSD 2014, 2014, 8744 : 130 - 142
  • [8] Differentially private data publishing via optimal univariate microaggregation and record perturbation
    Soria-Comas, Jordi
    Domingo-Ferrer, Josep
    [J]. KNOWLEDGE-BASED SYSTEMS, 2018, 153 : 78 - 90
  • [9] Hard to Detect Factors of Univariate Integer Polynomials
    Dennunzio, Alberto
    Formenti, Enrico
    Margara, Luciano
    [J]. MATHEMATICS, 2023, 11 (16)
  • [10] Record values of univariate distribution
    Ahsanullah, M
    [J]. APPLIED STATISTICAL SCIENCE, II: PAPERS IN HONOR OF MUNIR AHMAD, 1997, : 17 - 36