Density-based microaggregation for statistical disclosure control

被引:43
|
作者
Lin, Jun-Lin [1 ]
Wen, Tsung-Hsien [1 ]
Hsieh, Jui-Chien [1 ]
Chang, Pei-Chann [1 ]
机构
[1] Yuan Ze Univ, Dept Informat Management, Chungli, Taiwan
关键词
Mircroaggregation; Disclosure control; k-Anonymity; Microdata protection; K-ANONYMITY; ALGORITHM; PRIVACY;
D O I
10.1016/j.eswa.2009.09.054
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Protection of personal data in statistical databases has recently become a major societal concern. Statistical disclosure control (SDC) is often applied to statistical databases before they are released for public use. Microaggregation for SDC is a family of methods to protect microdata (i.e., records on individuals and/or companies) from individual identification. Microaggregation works by partitioning the microdata into groups of at least k records and, then, replacing the records in each group with the centroid of the group. An optimal microaggregation method must minimize the information loss resulting from this replacement process. However, this problem of minimizing information loss has been shown to be NP-hard for multivariate data. Methods based on various heuristics have been proposed for this problem, but none performs the best for every microdata set and various k values. This work presents a density-based algorithm (DBA) for microaggregation. The DBA first forms groups of records by the descending order of their densities, then fine-tunes these groups in reverse order. The performance of the DBA is compared against the latest microaggregation methods. Experimental results indicate that DBA incurs the least information loss for over half of the test situations. (C) 2009 Elsevier Ltd. All rights reserved.
引用
收藏
页码:3256 / 3263
页数:8
相关论文
共 50 条
  • [1] Microaggregation heuristic applied to statistical disclosure control
    Fadel, Augusto César
    Ochi, Luiz Satoru
    Brito, José André de Moura
    Semaan, Gustavo Silva
    [J]. Information Sciences, 2021, 548 : 37 - 55
  • [2] Microaggregation heuristic applied to statistical disclosure control
    Fadel, Augusto Cesar
    Ochi, Luiz Satoru
    Moura Brito, Jose Andre de
    Semaan, Gustavo Silva
    [J]. INFORMATION SCIENCES, 2021, 548 : 37 - 55
  • [3] Disclosure Control of Business Microdata: A Density-Based Approach
    Ichim, Daniela
    [J]. INTERNATIONAL STATISTICAL REVIEW, 2009, 77 (02) : 196 - 211
  • [4] Practical data-oriented microaggregation for statistical disclosure control
    Domingo-Ferrer, J
    Mateo-Sanz, JM
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2002, 14 (01) : 189 - 201
  • [5] Microaggregation Sorting Framework for K-Anonymity Statistical Disclosure Control in Cloud Computing
    Kabir, Md Enamul
    Mahmood, Abdun Naser
    Wang, Hua
    Mustafa, Abdul K.
    [J]. IEEE TRANSACTIONS ON CLOUD COMPUTING, 2020, 8 (02) : 408 - 417
  • [6] Density-based Control of Multiple Robots
    Zhao, Sheng
    Ramakrishnan, Subramanian
    Kumar, Manish
    [J]. 2011 AMERICAN CONTROL CONFERENCE, 2011, : 481 - 486
  • [7] Spectral density-based statistical measures for image sharpness
    Zhang, NF
    Vladar, AE
    Postek, MT
    Larrabee, RD
    [J]. METROLOGIA, 2005, 42 (05) : 351 - 359
  • [8] A Functional Density-Based Nonparametric Approach for Statistical Calibration
    Hernandez, Noslen
    Biscay, Rolando J.
    Villa-Vialaneix, Nathalie
    Talavera, Isneri
    [J]. PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, 2010, 6419 : 450 - 457
  • [9] Understanding Microaggregation- A technique of Statistical Disclosure Control for Privacy Preserving and Data Publishing in Inter-Cloud
    Gadad, Veena
    Sowmyarani, C. N.
    [J]. 2018 SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRONICS, COMPUTERS AND COMMUNICATIONS (ICAECC), 2018,
  • [10] On the disclosure risk of multivariate microaggregation
    Nin, Jordi
    Herranz, Javier
    Torra, Vicenc
    [J]. DATA & KNOWLEDGE ENGINEERING, 2008, 67 (03) : 399 - 412