A High-Availability K-modes Clustering Method Based on Differential Privacy

被引:1
|
作者
Zhang, Shaobo [1 ,2 ,3 ]
Yuan, Liujie [1 ,2 ]
Li, Yuxing [1 ,2 ]
Chen, Wenli [1 ,2 ]
Ding, Yifei [1 ,2 ]
机构
[1] Hunan Univ Sci & Technol, Sch Comp Sci & Engn, Xiangtan 411201, Peoples R China
[2] Hunan Key Lab Serv Comp & New Software Serv Techn, Xiangtan 411201, Peoples R China
[3] Natl Univ Def Technol, Coll Comp, Key Lab Software Engn Complex Syst, Changsha 410073, Peoples R China
关键词
Privacy protection; Categorical data mining; Differential privacy; K-modes clustering; ALGORITHM;
D O I
10.1007/978-3-030-95388-1_18
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In categorical data mining, the K-modes algorithm is a classic algorithm that has been widely used. However, the data analyzed by the K-modes algorithm usually contains sensitive user information. If these data are leaked, it will seriously threaten the privacy of users. In response to this problem, the existing method that combines differential privacy with the K-modes algorithm can effectively prevent privacy leakage. Nevertheless, differential privacy adds noise to the data while protecting data privacy, which will reduce the availability of clustering results. In this paper, we propose a high-availability K-modes clustering mechanism based on differential privacy(HAKC). In this mechanism, based on the use of differential privacy to protect data privacy, we select the initial centroid of the clustering by calculation, and improve the calculation method of the distance between the data point and the centroid in the iterative process.
引用
下载
收藏
页码:274 / 283
页数:10
相关论文
共 50 条
  • [41] Metaheuristic-based possibilistic fuzzy k-modes algorithms for categorical data clustering
    Kuo, R. J.
    Zheng, Y. R.
    Thi Phuong Quyen Nguyen
    INFORMATION SCIENCES, 2021, 557 : 1 - 15
  • [42] A genetic fuzzy k-Modes algorithm for clustering categorical data
    Gan, G.
    Wu, J.
    Yang, Z.
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (02) : 1615 - 1620
  • [43] Initial seed selection for K-modes clustering-A distance and density based approach
    Sajidha, S. A.
    Chodnekar, Siddha Prabhu
    Desikan, Kalyani
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2021, 33 (06) : 693 - 701
  • [44] High-availability Internet servers: Linux clustering on a CompactPCI platform
    Mueller, H
    Scott, R
    EDN, 2000, 45 (25) : 143 - +
  • [45] Parallel k-modes Algorithm based on MapReduce
    Guo Tao
    Ding Xiangwu
    Li Yefeng
    2015 THIRD INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION, NETWORKING, AND WIRELESS COMMUNICATIONS (DINWC), 2015, : 176 - 179
  • [46] A Global-Relationship Dissimilarity Measure for the k-Modes Clustering Algorithm
    Zhou, Hongfang
    Zhang, Yihui
    Liu, Yibin
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2017, 2017
  • [47] DK-BKM: Decremental K Belief K-Modes Method
    Ben Hariz, Sarra
    Elouedi, Zied
    SCALABLE UNCERTAINTY MANAGEMENT, SUM 2010, 2010, 6379 : 84 - 97
  • [48] Differential Privacy Data Protection Method Based on Clustering
    Li Li-xin
    Ding Yong-shan
    Wang Jia-yan
    2017 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY (CYBERC), 2017, : 11 - 16
  • [49] Categorical fuzzy k-modes clustering with automated feature weight learning
    Saha, Arkajyoti
    Das, Swagatam
    NEUROCOMPUTING, 2015, 166 : 422 - 435
  • [50] A Moving Shape-based Robust Fuzzy K-modes Clustering Algorithm for Electricity Profiles
    Liu, Chang
    Wang, Xiaodi
    Huang, Yuan
    Liu, Youbo
    Li, Ran
    Li, Yang
    Liu, Junyong
    ELECTRIC POWER SYSTEMS RESEARCH, 2020, 187