Large-scale k-means clustering with user-centric privacy-preservation

被引:0
|
作者
Jun Sakuma
Shigenobu Kobayashi
机构
[1] University of Tsukuba,Department of Computer Science
[2] Tokyo Institute of Technology,Department of Computational Intelligence and Systems Science
来源
关键词
Privacy; Privacy-preserving data mining; Clustering; -means; Peer-to-peer;
D O I
暂无
中图分类号
学科分类号
摘要
A k-means clustering with a new privacy-preserving concept, user-centric privacy preservation, is presented. In this framework, users can conduct data mining using their private information by storing them in their local storage. After the computation, they obtain only the mining result without disclosing private information to others. In most cases, the number of parties that can join conventional privacy-preserving data mining has been assumed to be only two. In our framework, we assume large numbers of parties join the protocol; therefore, not only scalability but also asynchronism and fault-tolerance is important. Considering this, we propose a k-mean algorithm combined with a decentralized cryptographic protocol and a gossip-based protocol. The computational complexity is O(log n) with respect to the number of parties n, and experimental results show that our protocol is scalable even with one million parties.
引用
收藏
页码:253 / 279
页数:26
相关论文
共 50 条
  • [1] Large-scale k-means clustering with user-centric privacy-preservation
    Sakuma, Jun
    Kobayashi, Shigenobu
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2010, 25 (02) : 253 - 279
  • [2] Large-scale k-means clustering with user-centric privacy preservation
    Sakuma, Jun
    Kobayashi, Shigenobu
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2008, 5012 : 320 - 332
  • [3] Scalable k-means for large-scale clustering
    Ming, Yuewei
    Zhu, En
    Wang, Mao
    Liu, Qiang
    Liu, Xinwang
    Yin, Jianping
    [J]. INTELLIGENT DATA ANALYSIS, 2019, 23 (04) : 825 - 838
  • [4] Compressed K-Means for Large-Scale Clustering
    Shen, Xiaobo
    Liu, Weiwei
    Tsang, Ivor
    Shen, Fumin
    Sun, Quan-Sen
    [J]. THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2527 - 2533
  • [5] Privacy Preservation in k-Means Clustering by Cluster Rotation
    Dhiraj, S. S. Shivaji
    Khan, Ameer M. Asif
    Khan, Wajhiulla
    Challagalla, Ajay
    [J]. TENCON 2009 - 2009 IEEE REGION 10 CONFERENCE, VOLS 1-4, 2009, : 1437 - 1443
  • [6] Large-scale k-means clustering via variance reduction
    Zhao, Yawei
    Ming, Yuewei
    Liu, Xinwang
    Zhu, En
    Zhao, Kaikai
    Yin, Jianping
    [J]. NEUROCOMPUTING, 2018, 307 : 184 - 194
  • [7] Practical Privacy-Preserving MapReduce Based K-Means Clustering Over Large-Scale Dataset
    Yuan, Jiawei
    Tian, Yifan
    [J]. IEEE TRANSACTIONS ON CLOUD COMPUTING, 2019, 7 (02) : 568 - 579
  • [8] Regularized and Sparse Stochastic K-Means for Distributed Large-Scale Clustering
    Jumutc, Vilen
    Langone, Rocco
    Suykens, Johan A. K.
    [J]. PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 2535 - 2540
  • [9] Fast K-means for Large Scale Clustering
    Hu, Qinghao
    Wu, Jiaxiang
    Bai, Lu
    Zhang, Yifan
    Cheng, Jian
    [J]. CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 2099 - 2102
  • [10] User-Centric Privacy Preservation in Data-Sharing Applications
    Gao, Feng
    He, Jingsha
    Peng, Shufen
    [J]. NETWORK AND PARALLEL COMPUTING, 2010, 6289 : 423 - +