Density-based microaggregation for statistical disclosure control

被引:43
|
作者
Lin, Jun-Lin [1 ]
Wen, Tsung-Hsien [1 ]
Hsieh, Jui-Chien [1 ]
Chang, Pei-Chann [1 ]
机构
[1] Yuan Ze Univ, Dept Informat Management, Chungli, Taiwan
关键词
Mircroaggregation; Disclosure control; k-Anonymity; Microdata protection; K-ANONYMITY; ALGORITHM; PRIVACY;
D O I
10.1016/j.eswa.2009.09.054
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Protection of personal data in statistical databases has recently become a major societal concern. Statistical disclosure control (SDC) is often applied to statistical databases before they are released for public use. Microaggregation for SDC is a family of methods to protect microdata (i.e., records on individuals and/or companies) from individual identification. Microaggregation works by partitioning the microdata into groups of at least k records and, then, replacing the records in each group with the centroid of the group. An optimal microaggregation method must minimize the information loss resulting from this replacement process. However, this problem of minimizing information loss has been shown to be NP-hard for multivariate data. Methods based on various heuristics have been proposed for this problem, but none performs the best for every microdata set and various k values. This work presents a density-based algorithm (DBA) for microaggregation. The DBA first forms groups of records by the descending order of their densities, then fine-tunes these groups in reverse order. The performance of the DBA is compared against the latest microaggregation methods. Experimental results indicate that DBA incurs the least information loss for over half of the test situations. (C) 2009 Elsevier Ltd. All rights reserved.
引用
收藏
页码:3256 / 3263
页数:8
相关论文
共 50 条
  • [41] Density-Based Statistical Clustering: Enabling Sidefire Ultrasonic Traffic Sensing in Smart Cities
    Luecken, Volker
    Voss, Nils
    Schreier, Julien
    Baag, Thomas
    Gehring, Michael
    Raschen, Matthias
    Lanius, Christian
    Leupers, Rainer
    Ascheid, Gerd
    JOURNAL OF ADVANCED TRANSPORTATION, 2018,
  • [42] Generalizing Local Density for Density-Based Clustering
    Lin, Jun-Lin
    SYMMETRY-BASEL, 2021, 13 (02): : 1 - 24
  • [43] Density-Based Clustering for Adaptive Density Variation
    Qian, Li
    Plant, Claudia
    Boehm, Christian
    2021 21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2021), 2021, : 1282 - 1287
  • [44] Density-based Pattern Control For Spacecraft Swarms Around Circular Orbits
    Yang, Chihang
    Zhang, Hao
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 5915 - 5921
  • [45] Directional maximum length scale control in density-based topology optimization
    Song, Longlong
    Gao, Tong
    Wang, Jie
    Zhang, Weihong
    COMPUTERS & STRUCTURES, 2024, 292
  • [46] A firm foundation for statistical disclosure control
    Nobuaki Hoshino
    Japanese Journal of Statistics and Data Science, 2020, 3 : 721 - 746
  • [48] Spatial Smoothing and Statistical Disclosure Control
    de Jonge, Edwin
    de Wolf, Peter-Paul
    PRIVACY IN STATISTICAL DATABASES: UNESCO CHAIR IN DATA PRIVACY, 2016, 9867 : 107 - 117
  • [49] Statistical Disclosure Control: To Trust or Not to Trust
    Giggins, Helen
    Brankovic, Ljiljana
    CSA 2008: INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND ITS APPLICATIONS, PROCEEDINGS, 2008, : 108 - 113
  • [50] Measurement Error and Statistical Disclosure Control
    Shlomo, Natalie
    PRIVACY IN STATISTICAL DATABASES, 2010, 6344 : 118 - 126