DPLK-means: A novel Differential Privacy K-means Mechanism

被引:33
|
作者
Ren, Jun [1 ]
Xiong, Jinbo [1 ,2 ]
Yao, Zhiqiang [1 ,2 ]
Ma, Rong [1 ]
Lin, Mingwei [1 ,2 ]
机构
[1] Fujian Normal Univ, Fac Software, Fuzhou, Fujian, Peoples R China
[2] Fujian Engn Res Ctr Publ Serv Big Data Min & Appl, Fuzhou, Fujian, Peoples R China
来源
2017 IEEE SECOND INTERNATIONAL CONFERENCE ON DATA SCIENCE IN CYBERSPACE (DSC) | 2017年
基金
中国国家自然科学基金;
关键词
Data mining; privacy disclosure; k-means algorithm; differential privacy mechanism;
D O I
10.1109/DSC.2017.64
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
K-means algorithm is an important type of clustering algorithm and the foundation of some data mining methods. But it has the risk of privacy disclosure in the process of clustering. In order to solve this problem, Blum et al. proposed a differential privacy K-means algorithm, which can prevent privacy disclosure effectively. However, the availability of clustering results is reduced due to the added noise. In this paper, we propose a novel DPLK-means algorithm based on differential privacy, which improves the selection of the initial center points through performing the differential privacy K-means algorithm to each subset divided by the original dataset. Performance evaluation shows that our algorithm improves the availability of clustering results compared to the existing differential privacy K-means algorithm at the same privacy level.
引用
收藏
页码:133 / 139
页数:7
相关论文
共 50 条
  • [41] Balanced k-Means
    Tai, Chen-Ling
    Wang, Chen-Shu
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS (ACIIDS 2017), PT II, 2017, 10192 : 75 - 82
  • [42] GAPBAS: Genetic algorithm-based privacy budget allocation strategy in differential privacy K-means clustering algorithm
    Li, Yong
    Song, Xiao
    Tu, Yuchun
    Liu, Ming
    COMPUTERS & SECURITY, 2024, 139
  • [43] A Modified K-means Algorithms - Bi-Level K-Means Algorithm
    Yu, Shyr-Shen
    Chu, Shao-Wei
    Wang, Ching-Lin
    Chan, Yung-Kuan
    Chuang, Chia-Yi
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON SOFT COMPUTING IN INFORMATION COMMUNICATION TECHNOLOGY, 2014, : 10 - 13
  • [44] A Modified K-means Algorithm - Two-Layer K-means Algorithm
    Liu, Chen-Chung
    Chu, Shao-Wei
    Chan, Yung-Kuan
    Yu, Shyr-Shen
    2014 TENTH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING (IIH-MSP 2014), 2014, : 447 - 450
  • [45] Soil data clustering by using K-means and fuzzy K-means algorithm
    Hot, Elma
    Popovic-Bugarin, Vesna
    2015 23RD TELECOMMUNICATIONS FORUM TELFOR (TELFOR), 2015, : 890 - 893
  • [46] Sorted K-Means Towards the Enhancement of K-Means to Form Stable Clusters
    Arora, Preeti
    Virmani, Deepali
    Jindal, Himanshu
    Sharma, Mritunjaya
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON COMMUNICATION AND NETWORKS, 2017, 508 : 479 - 486
  • [47] Research on k-means Clustering Algorithm An Improved k-means Clustering Algorithm
    Shi Na
    Liu Xumin
    Guan Yong
    2010 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY AND SECURITY INFORMATICS (IITSI 2010), 2010, : 63 - 67
  • [48] k*-means:: A new generalized k-means clustering algorithm
    Cheung, YM
    PATTERN RECOGNITION LETTERS, 2003, 24 (15) : 2883 - 2893
  • [49] RETRACTED: CVDP k-means clustering algorithm for differential privacy based on coefficient of variation (Retracted Article)
    Kong, Yuting
    Qian, Yurong
    Tan, Fuxiang
    Bai, Lu
    Shao, Jinxin
    Ma, Tinghuai
    Tereshchenko, Sergei Nikolayevich
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 43 (05) : 6027 - 6045
  • [50] K-Means Clustering With Local dχ-Privacy for Privacy-Preserving Data Analysis
    Yang, Mengmeng
    Tjuawinata, Ivan
    Lam, Kwok-Yan
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2022, 17 : 2524 - 2537