A hybrid multi-group approach for privacy-preserving data mining

被引:17
|
作者
Teng, Zhouxuan [1 ]
Du, Wenliang [1 ]
机构
[1] Syracuse Univ, Dept Elect Engn & Comp Sci, Syracuse, NY 13244 USA
关键词
Privacy; SMC; Randomization; Hybrid;
D O I
10.1007/s10115-008-0158-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a hybrid multi-group approach for privacy preserving data mining. We make two contributions in this paper. First, we propose a hybrid approach. Previous work has used either the randomization approach or the secure multi-party computation (SMC) approach. However, these two approaches have complementary features: the randomization approach is much more efficient but less accurate, while the SMC approach is less efficient but more accurate. We propose a novel hybrid approach, which takes advantage of the strength of both approaches to balance the accuracy and efficiency constraints. Compared to the two existing approaches, our proposed approach can achieve much better accuracy than randomization approach and much reduced computation cost than SMC approach. We also propose a multi-group scheme that makes it flexible for the data miner to control the balance between data mining accuracy and privacy. This scheme is motivated by the fact that existing randomization schemes that randomize data at individual attribute level can produce insufficient accuracy when the number of dimensions is high. We partition attributes into groups, and develop a scheme to conduct group-based randomization to achieve better data mining accuracy. To demonstrate the effectiveness of the proposed general schemes, we have implemented them for the ID3 decision tree algorithm and association rule mining problem and we also present experimental results.
引用
收藏
页码:133 / 157
页数:25
相关论文
共 50 条
  • [31] Privacy-preserving data mining in electronic surveys
    Zhan, J
    Matwin, S
    SHAPING BUSINESS STRATEGY IN A NETWORKED WORLD, VOLS 1 AND 2, PROCEEDINGS, 2004, : 1179 - 1185
  • [32] Privacy-preserving data mining: Developments and directions
    Thuraisingham, B
    JOURNAL OF DATABASE MANAGEMENT, 2005, 16 (01) : 75 - 87
  • [33] Privacy-Preserving Data Publishing in Process Mining
    Rafiei, Majid
    van der Aalst, Wil M. P.
    BUSINESS PROCESS MANAGEMENT FORUM, BPM FORUM 2020, 2020, 392 : 122 - 138
  • [34] Granular Computing in Privacy-Preserving Data Mining
    Zhan, Justin
    Lin, Tsau Young
    2008 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, VOLS 1 AND 2, 2008, : 86 - +
  • [35] A Knowledge Model Sharing Based Approach to Privacy-Preserving Data Mining
    Tian, Hongwei
    Zhang, Weining
    Xu, Shouhuai
    Sharkey, Patrick
    TRANSACTIONS ON DATA PRIVACY, 2012, 5 (02) : 433 - 467
  • [36] Privacy-Preserving Mining of Decision Trees Using Data Negation Approach
    Dhandhania, R. K.
    Baruah, P. K.
    Mukkamala, R.
    2014 INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2014, : 43 - 48
  • [37] DATA MINING AS A TOOL IN PRIVACY-PRESERVING DATA PUBLISHING
    Sramka, Michal
    NILCRYPT 10, 2010, 45 : 151 - 159
  • [38] A New Hybrid Approach for Privacy Preserving Distributed Data Mining
    Sun, Chongjing
    Gao, Hui
    Zhou, Junlin
    Fu, Yan
    She, Li
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (04): : 876 - 883
  • [39] Privacy-preserving data utilization in hybrid clouds
    Li, Jingwei
    Li, Jin
    Chen, Xiaofeng
    Liu, Zheli
    Jia, Chunfu
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2014, 30 : 98 - 106
  • [40] PRELIMINARY DATA ANALYSIS IN HEALTHCARE MULTICENTRIC DATA MINING: A PRIVACY-PRESERVING DISTRIBUTED APPROACH
    Damiani, Andrea
    Masciocchi, Carlotta
    Boldrini, Luca
    Gatta, Roberto
    Dinapoli, Nicola
    Lenkowicz, Jacopo
    Chiloiro, Giuditta
    Gambacorta, Maria Antonietta
    Tagliaferri, Luca
    Autorino, Rosa
    Pagliara, Monica Maria
    Blasi, Maria Antonietta
    van Soest, Johan
    Dekker, Andre
    Valentini, Vincenzo
    JOURNAL OF E-LEARNING AND KNOWLEDGE SOCIETY, 2018, 14 (01): : 71 - 81