A High-Availability K-modes Clustering Method Based on Differential Privacy

被引:1
|
作者
Zhang, Shaobo [1 ,2 ,3 ]
Yuan, Liujie [1 ,2 ]
Li, Yuxing [1 ,2 ]
Chen, Wenli [1 ,2 ]
Ding, Yifei [1 ,2 ]
机构
[1] Hunan Univ Sci & Technol, Sch Comp Sci & Engn, Xiangtan 411201, Peoples R China
[2] Hunan Key Lab Serv Comp & New Software Serv Techn, Xiangtan 411201, Peoples R China
[3] Natl Univ Def Technol, Coll Comp, Key Lab Software Engn Complex Syst, Changsha 410073, Peoples R China
关键词
Privacy protection; Categorical data mining; Differential privacy; K-modes clustering; ALGORITHM;
D O I
10.1007/978-3-030-95388-1_18
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In categorical data mining, the K-modes algorithm is a classic algorithm that has been widely used. However, the data analyzed by the K-modes algorithm usually contains sensitive user information. If these data are leaked, it will seriously threaten the privacy of users. In response to this problem, the existing method that combines differential privacy with the K-modes algorithm can effectively prevent privacy leakage. Nevertheless, differential privacy adds noise to the data while protecting data privacy, which will reduce the availability of clustering results. In this paper, we propose a high-availability K-modes clustering mechanism based on differential privacy(HAKC). In this mechanism, based on the use of differential privacy to protect data privacy, we select the initial centroid of the clustering by calculation, and improve the calculation method of the distance between the data point and the centroid in the iterative process.
引用
下载
收藏
页码:274 / 283
页数:10
相关论文
共 50 条
  • [11] K-Modes clustering algorithm based on a new distance measure
    Liang, Jiye
    Bai, Liang
    Cao, Fuyuan
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2010, 47 (10): : 1749 - 1755
  • [12] A dissimilarity measure for the k-Modes clustering algorithm
    Cao, Fuyuan
    Liang, Jiye
    Li, Deyu
    Bai, Liang
    Dang, Chuangyin
    KNOWLEDGE-BASED SYSTEMS, 2012, 26 : 120 - 127
  • [13] Block Fuzzy K-modes Clustering Algorithm
    Yang, Miin-Shen
    Lin, Chih-Ying
    2009 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, 2009, : 384 - 389
  • [14] CLEKMODES: a modified k-modes clustering algorithm
    Mastrogiannis, N.
    Giannikos, I.
    Boutsinas, B.
    Antzoulatos, G.
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2009, 60 (08) : 1085 - 1095
  • [15] Attribute value weighting in k-modes clustering
    He, Zengyou
    Xu, Xiaofei
    Deng, Shengchun
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (12) : 15365 - 15369
  • [16] Initialization of K-Modes Clustering for Categorical Data
    Li Tao-ying
    Chen Yan
    Jin Zhi-hong
    Li Ye
    2013 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE AND ENGINEERING (ICMSE), 2013, : 107 - 112
  • [17] Research on Seafood Traceable Data Based on k-Modes Clustering Algorithm
    Ge, Li
    Li, Jiajun
    Chen, Jun
    JOURNAL OF COASTAL RESEARCH, 2020, : 73 - 77
  • [18] Software cost estimation based on modified K-Modes clustering Algorithm
    Bishnu, Partha Sarathi
    Bhattacherjee, Vandana
    NATURAL COMPUTING, 2016, 15 (03) : 415 - 422
  • [19] Software cost estimation based on modified K-Modes clustering Algorithm
    Partha Sarathi Bishnu
    Vandana Bhattacherjee
    Natural Computing, 2016, 15 : 415 - 422
  • [20] Application of metaheuristic based fuzzy K-modes algorithm to supplier clustering
    Kuo, R. J.
    Potti, Yuliana
    Zulvia, Ferani E.
    COMPUTERS & INDUSTRIAL ENGINEERING, 2018, 120 : 298 - 307