Density-based microaggregation for statistical disclosure control

被引:43
|
作者
Lin, Jun-Lin [1 ]
Wen, Tsung-Hsien [1 ]
Hsieh, Jui-Chien [1 ]
Chang, Pei-Chann [1 ]
机构
[1] Yuan Ze Univ, Dept Informat Management, Chungli, Taiwan
关键词
Mircroaggregation; Disclosure control; k-Anonymity; Microdata protection; K-ANONYMITY; ALGORITHM; PRIVACY;
D O I
10.1016/j.eswa.2009.09.054
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Protection of personal data in statistical databases has recently become a major societal concern. Statistical disclosure control (SDC) is often applied to statistical databases before they are released for public use. Microaggregation for SDC is a family of methods to protect microdata (i.e., records on individuals and/or companies) from individual identification. Microaggregation works by partitioning the microdata into groups of at least k records and, then, replacing the records in each group with the centroid of the group. An optimal microaggregation method must minimize the information loss resulting from this replacement process. However, this problem of minimizing information loss has been shown to be NP-hard for multivariate data. Methods based on various heuristics have been proposed for this problem, but none performs the best for every microdata set and various k values. This work presents a density-based algorithm (DBA) for microaggregation. The DBA first forms groups of records by the descending order of their densities, then fine-tunes these groups in reverse order. The performance of the DBA is compared against the latest microaggregation methods. Experimental results indicate that DBA incurs the least information loss for over half of the test situations. (C) 2009 Elsevier Ltd. All rights reserved.
引用
收藏
页码:3256 / 3263
页数:8
相关论文
共 50 条
  • [21] Statistical disclosure control based on random uncertainty intervals
    Wang, JF
    ENABLING SOCIETY WITH INFORMATION TECHNOLOGY, 2002, : 244 - 253
  • [22] Novel density-based and hierarchical density-based clustering algorithms for uncertain data
    Zhang, Xianchao
    Liu, Han
    Zhang, Xiaotong
    NEURAL NETWORKS, 2017, 93 : 240 - 255
  • [23] Density-Based Traffic Control System Using Artificial Intelligence
    Sabeenian, R. S.
    Ramapriya, R.
    Swetha, S.
    INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING AND COMMUNICATIONS, ICICC 2022, VOL 3, 2023, 492 : 417 - 425
  • [24] Appraisal of density-based field compaction control test validity
    Sadrekarimi, J.
    Seyyedi, S.
    BEARING CAPACITY OF ROADS, RAILWAYS AND AIRFIELDS, VOLS 1 AND 2, 2009, : 739 - 744
  • [25] Statistical and density-based clustering of geographical flows for crowd movement patterns recognition
    Tang, Jianbo
    Zhao, Yuxin
    Yang, Xuexi
    Deng, Min
    Liu, Huimin
    Ding, Chen
    Peng, Ju
    Mei, Xiaoming
    APPLIED SOFT COMPUTING, 2024, 163
  • [26] Rounding based continuous data discretization for statistical disclosure control
    Senavirathne N.
    Torra V.
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (11) : 15139 - 15157
  • [27] Application of a Statistical Disclosure Control Techniques Based on Multiplicative Noise
    Kim, Young-Won
    Kim, Tae-Yeon
    Kim, Kye-Nam
    KOREAN JOURNAL OF APPLIED STATISTICS, 2011, 24 (01) : 127 - 136
  • [28] Density-based label placement
    Lhuillier, Antoine
    van Garderen, Mereke
    Weiskopf, Daniel
    VISUAL COMPUTER, 2019, 35 (6-8): : 1041 - 1052
  • [29] Density-based algorithm in MapReduce
    Pang Lin
    Liu Fang-ai
    PROCEEDINGS OF 2016 9TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2016, : 394 - 397
  • [30] Density-based view materialization
    Das, A
    Bhattacharyya, DK
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS, 2005, 3776 : 589 - 594