Privacy-preserving collaborative fuzzy clustering

被引:16
|
作者
Lyu, Lingjuan [1 ]
Bezdek, James C. [2 ]
Law, Yee Wei [3 ]
He, Xuanli [2 ]
Palaniswami, Marimuthu [1 ]
机构
[1] Univ Melbourne, Dept Elect & Elect Engn, Parkville, Vic, Australia
[2] Univ Melbourne, Sch Comp & Informat Syst, Parkville, Vic, Australia
[3] Univ South Australia, Sch Engn, Mawson Lakes, Australia
关键词
Participatory sensing; Collaborative learning; Clustering; Privacy-preserving; Randomisation; DATA PERTURBATION; JOHNSON-LINDENSTRAUSS; VISUAL ASSESSMENT; NOISE; TENDENCY;
D O I
10.1016/j.datak.2018.05.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The proliferation of Internet of Things devices has contributed to the emergence of participatory sensing (PS), where multiple individuals collect and report their data to a third-party data mining cloud service for analysis. The need for the participants to collaborate with each other for this analysis gives rise to the concept of collaborative learning. However, the possibility of the cloud service being semi-honest poses a key challenge: preserving the participants' privacy. In this paper, we address this challenge with a two-stage scheme called RG + RP: in the first stage, each participant perturbs his/her data by passing the data through a nonlinear function called repeated Gompertz (RG); in the second stage, he/she then projects his/her perturbed data to a lower dimension in an (almost) distance-preserving manner, using a specific random projection (RP) matrix. The nonlinear RG function is designed to mitigate maximum a posteriori (MAP) estimation attacks, while random projection resists independent component analysis (ICA) attacks and ensures clustering accuracy. The proposed two-stage randomisation scheme is assessed in terms of its recovery resistance to MAP estimation attacks. Preliminary theoretical analysis as well as experimental results on synthetic and real-world datasets indicate that RG + RP has better recovery resistance to MAP estimation attacks than most state-of-the-art techniques. For clustering, fuzzy c-means (FCM) is used. Results using seven cluster validity indices, root mean squared error (RMSE) and accuracy ratio show that clustering results based on two-stage-perturbed data are comparable to the clustering results based on raw data this confirms the utility of our privacy-preserving scheme when used with either FCM or HCM.
引用
收藏
页码:21 / 41
页数:21
相关论文
共 50 条
  • [1] Privacy-Preserving Data Mining in Homogeneous Collaborative Clustering
    Ouda, Mohamed
    Salem, Sameh
    Ali, Ihab
    Saad, El-Sayed
    [J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2015, 12 (06) : 604 - 612
  • [2] Privacy-preserving collaborative filtering
    Polat, H
    Du, WL
    [J]. INTERNATIONAL JOURNAL OF ELECTRONIC COMMERCE, 2005, 9 (04) : 9 - 35
  • [3] Privacy-preserving distributed clustering
    Erkin, Zekeriya
    Veugen, Thijs
    Toft, Tomas
    Lagendijk, Reginald L.
    [J]. EURASIP JOURNAL ON INFORMATION SECURITY, 2013, (01):
  • [4] A comparison of clustering-based privacy-preserving collaborative filtering schemes
    Bilge, Alper
    Polat, Huseyin
    [J]. APPLIED SOFT COMPUTING, 2013, 13 (05) : 2478 - 2489
  • [5] Privacy-preserving collaborative social networks
    Zhan, Justin
    Blosser, Gary
    Yang, Chris
    Singh, Lisa
    [J]. INTELLIGENCE AND SECURITY INFORMATICS, PROCEEDINGS, 2008, 5075 : 114 - +
  • [6] Privacy-preserving collaborative data mining
    Zhan, J
    Chang, LW
    Matwin, S
    [J]. FOUNDATIONS AND NOVEL APPROACHES IN DATA MINING, 2006, 9 : 213 - +
  • [7] Privacy-preserving distributed collaborative filtering
    Boutet, Antoine
    Frey, Davide
    Guerraoui, Rachid
    Jegou, Arnaud
    Kermarrec, Anne-Marie
    [J]. COMPUTING, 2016, 98 (08) : 827 - 846
  • [8] PRIVACY-PRESERVING COLLABORATIVE DATA MINING
    Zhan, Justin
    [J]. KMIS 2009: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE MANAGEMENT AND INFORMATION SHARING, 2009, : IS15 - IS15
  • [9] Privacy-preserving distributed collaborative filtering
    Antoine Boutet
    Davide Frey
    Rachid Guerraoui
    Arnaud Jégou
    Anne-Marie Kermarrec
    [J]. Computing, 2016, 98 : 827 - 846
  • [10] Privacy-Preserving Enhanced Collaborative Tagging
    Parra-Arnau, Javier
    Perego, Andrea
    Ferrari, Elena
    Forne, Jordi
    Rebollo-Monedero, David
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (01) : 180 - 193