Efficient privacy-preserving outsourced k-means clustering on distributed data

被引:2
|
作者
Qiu, Guowei [1 ]
Zhao, Yingliang [1 ]
Gui, Xiaolin [1 ,2 ]
机构
[1] Xi An Jiao Tong Univ, Sch Comp Sci & Technol, Xian 710049, Peoples R China
[2] Xi An Jiao Tong Univ, Shaanxi Prov Key Lab Comp Network, Xian, Peoples R China
关键词
Privacy-preserving clustering; Secure comparison; Secret sharing; Homomorphic encryption; MULTIPARTY COMPUTATION; SECURE; EQUALITY;
D O I
10.1016/j.ins.2024.120687
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Today, more and more data is collected and stored by different organizations. When the data is distributed among multiple users who wish to perform data mining on the joint data, outsourcing the task to cloud servers becomes an attractive solution for users who lack professional skills. However, privacy concerns make users reluctant to adopt such solutions. In this paper, we propose a privacy -preserving outsourced k -means clustering (PPOKC) algorithm on distributed data. Our system consists of multiple users and two non -colluding servers. Each user submits their data to the servers, which perform k -means clustering over the joint data without compromising data privacy. Unlike existing solutions, which typically include a cryptographic server and a computing server, each server in our system provides both services. Based on this architecture, we first design several important sub -protocols, including secure comparison, secure minimum and secure division. We then use these protocols to construct an efficient and highly secure PPOKC algorithm. The implementation of our algorithm relies heavily on secret sharing techniques, complemented by homomorphic encryption. We also conduct theoretical analysis and experiments. Security analysis shows that our scheme guarantees the security of the input/output data, the intermediate results and the data access pattern under the semi -honest model. Complexity analysis and numerical experiments show that our algorithm has good efficiency and is suitable for practical applications.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Achieving Efficient and Privacy-Preserving k-NN Query for Outsourced eHealthcare Data
    Zheng, Yandong
    Lu, Rongxing
    Shao, Jun
    JOURNAL OF MEDICAL SYSTEMS, 2019, 43 (05)
  • [32] Privacy Preserving K-means Clustering: A Survey Research
    Meskine, Fatima
    Bahloul, Safia Nait
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2012, 9 (02) : 194 - 200
  • [33] An Efficient Privacy-Preserving Outsourced Computation over Public Data
    Liu, Ximeng
    Qin, Baodong
    Deng, Robert H.
    Li, Yingjiu
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2017, 10 (05) : 756 - 770
  • [34] Achieving Efficient and Privacy-Preserving k-NN Query for Outsourced eHealthcare Data
    Yandong Zheng
    Rongxing Lu
    Jun Shao
    Journal of Medical Systems, 2019, 43
  • [35] Privacy-preserving k-means clustering with local synchronization in peer-to-peer networks
    Zhu, Youwen
    Li, Xingxin
    PEER-TO-PEER NETWORKING AND APPLICATIONS, 2020, 13 (06) : 2272 - 2284
  • [36] Privacy-preserving k-means clustering with local synchronization in peer-to-peer networks
    Youwen Zhu
    Xingxin Li
    Peer-to-Peer Networking and Applications, 2020, 13 : 2272 - 2284
  • [37] Privacy Preserving Clustering: A k-Means Type Extension
    Li, Wenye
    NEURAL INFORMATION PROCESSING (ICONIP 2014), PT II, 2014, 8835 : 319 - 326
  • [38] A Parallel Privacy-Preserving k-Means Clustering Algorithm for Encrypted Databases in Cloud Computing
    Song, Youngho
    Kim, Hyeong-Jin
    Lee, Hyun-Jo
    Chang, Jae-Woo
    APPLIED SCIENCES-BASEL, 2024, 14 (02):
  • [39] Privacy-Preserving Cloud-Edge Collaborative K-Means Clustering Model in IoT
    Wang, Chen
    Xu, Jian
    Tan, Shanru
    Yin, Long
    FRONTIERS IN CYBER SECURITY, FCS 2023, 2024, 1992 : 655 - 669
  • [40] Privacy-preserving distributed clustering
    Erkin, Zekeriya
    Veugen, Thijs
    Toft, Tomas
    Lagendijk, Reginald L.
    EURASIP JOURNAL ON INFORMATION SECURITY, 2013, (01):