Large-scale k-means clustering with user-centric privacy-preservation

被引:0
|
作者
Jun Sakuma
Shigenobu Kobayashi
机构
[1] University of Tsukuba,Department of Computer Science
[2] Tokyo Institute of Technology,Department of Computational Intelligence and Systems Science
来源
关键词
Privacy; Privacy-preserving data mining; Clustering; -means; Peer-to-peer;
D O I
暂无
中图分类号
学科分类号
摘要
A k-means clustering with a new privacy-preserving concept, user-centric privacy preservation, is presented. In this framework, users can conduct data mining using their private information by storing them in their local storage. After the computation, they obtain only the mining result without disclosing private information to others. In most cases, the number of parties that can join conventional privacy-preserving data mining has been assumed to be only two. In our framework, we assume large numbers of parties join the protocol; therefore, not only scalability but also asynchronism and fault-tolerance is important. Considering this, we propose a k-mean algorithm combined with a decentralized cryptographic protocol and a gossip-based protocol. The computational complexity is O(log n) with respect to the number of parties n, and experimental results show that our protocol is scalable even with one million parties.
引用
收藏
页码:253 / 279
页数:26
相关论文
共 50 条
  • [21] Privacy-Preserving and Outsourced Multi-User k-Means Clustering
    Rao, Fang-Yu
    Samanthula, Bharath K.
    Bertino, Elisa
    Yi, Xun
    Liu, Dongxi
    [J]. 2015 IEEE CONFERENCE ON COLLABORATION AND INTERNET COMPUTING (CIC), 2015, : 80 - 89
  • [22] Efficient Privacy Preserving K-Means Clustering
    Upmanyu, Maneesh
    Namboodiri, Anoop M.
    Srinathan, Kannan
    Jawahar, C. V.
    [J]. INTELLIGENCE AND SECURITY INFORMATICS, PROCEEDINGS, 2010, 6122 : 154 - 166
  • [23] K-Means Clustering with Local Distance Privacy
    Yang, Mengmeng
    Huang, Longxia
    Tang, Chenghua
    [J]. BIG DATA MINING AND ANALYTICS, 2023, 6 (04) : 433 - 442
  • [24] Privacy Preserving Approximate K-means Clustering
    Biswas, Chandan
    Ganguly, Debasis
    Roy, Dwaipayan
    Bhattacharya, Ujjwal
    [J]. PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 1321 - 1330
  • [25] Outlier-eliminated k-means clustering algorithm based on differential privacy preservation
    Qingying Yu
    Yonglong Luo
    Chuanming Chen
    Xintao Ding
    [J]. Applied Intelligence, 2016, 45 : 1179 - 1191
  • [26] Outlier-eliminated k-means clustering algorithm based on differential privacy preservation
    Yu, Qingying
    Luo, Yonglong
    Chen, Chuanming
    Ding, Xintao
    [J]. APPLIED INTELLIGENCE, 2016, 45 (04) : 1179 - 1191
  • [27] Deep clustering of small molecules at large-scale via variational autoencoder embedding and K-means
    Hamid Hadipour
    Chengyou Liu
    Rebecca Davis
    Silvia T. Cardona
    Pingzhao Hu
    [J]. BMC Bioinformatics, 23
  • [28] K-means Clustering Algorithm for Large-scale Chinese Commodity Information Web Based on Hadoop
    Geng Yushui
    Zhang Lishuo
    [J]. 14TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS FOR BUSINESS, ENGINEERING AND SCIENCE (DCABES 2015), 2015, : 256 - 259
  • [29] A Semantic Partition Algorithm Based on Improved K-Means Clustering for Large-Scale Indoor Areas
    Shi, Kegong
    Yan, Jinjin
    Yang, Jinquan
    [J]. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2024, 13 (02)
  • [30] Optimal Operation of Large-scale Electric Vehicles Based on Improved K-means Clustering Algorithm
    Liu, Jian
    Xu, Weifeng
    Liu, Zhijun
    Fu, Guanhua
    Jiang, Yunpeng
    Zhao, Ergang
    [J]. PROCEEDINGS OF 2022 5TH INTERNATIONAL CONFERENCE ON ROBOT SYSTEMS AND APPLICATIONS, ICRSA2022, 2022, : 23 - 28