A Random Decision Tree Framework for Privacy-Preserving Data Mining

被引:76
|
作者
Vaidya, Jaideep [1 ]
Shafiq, Basit [2 ]
Fan, Wei [3 ]
Mehmood, Danish [2 ]
Lorenzi, David [1 ]
机构
[1] Rutgers State Univ, MSIS Dept, Newark, NJ 07102 USA
[2] Lahore Univ Management Sci, CS Dept, Lahore 54792, Pakistan
[3] Huawei Noahs Ark Lab, Shatin, Hong Kong, Peoples R China
关键词
Privacy-preserving data mining; classification;
D O I
10.1109/TDSC.2013.43
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Distributed data is ubiquitous in modern information driven applications. With multiple sources of data, the natural challenge is to determine how to collaborate effectively across proprietary organizational boundaries while maximizing the utility of collected information. Since using only local data gives suboptimal utility, techniques for privacy-preserving collaborative knowledge discovery must be developed. Existing cryptography-based work for privacy-preserving data mining is still too slow to be effective for large scale data sets to face today's big data challenge. Previous work on random decision trees (RDT) shows that it is possible to generate equivalent and accurate models with much smaller cost. We exploit the fact that RDTs can naturally fit into a parallel and fully distributed architecture, and develop protocols to implement privacy-preserving RDTs that enable general and efficient distributed privacy-preserving knowledge discovery.
引用
收藏
页码:399 / 411
页数:13
相关论文
共 50 条
  • [1] Privacy-preserving decision tree mining based on random substitutions
    Dowd, Jim
    Xu, Shouhuai
    Zhang, Weining
    [J]. EMERGING TRENDS IN INFORMATION AND COMMUNICATION SECURITY, PROCEEDINGS, 2006, 3995 : 145 - 159
  • [2] Fuzzy Random Decision Tree (FRDT) Framework for Privacy Preserving Data Mining
    Sumalatha, L.
    Sankar, P. Uma
    [J]. PROCEEDINGS OF THE 2016 SAI COMPUTING CONFERENCE (SAI), 2016, : 195 - 202
  • [3] Privacy-preserving data mining
    Agrawal, R
    Srikant, R
    [J]. SIGMOD RECORD, 2000, 29 (02) : 439 - 450
  • [4] Random-data perturbation techniques and privacy-preserving data mining
    Kargupta, H
    Datta, S
    Wang, Q
    Sivakumar, K
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2005, 7 (04) : 387 - 414
  • [5] A New Scheme on Privacy-preserving Distributed Decision-tree Mining
    Fang, Weiwei
    Yang, Bingru
    Song, Dingli
    Tang, Zhigang
    [J]. PROCEEDINGS OF THE FIRST INTERNATIONAL WORKSHOP ON EDUCATION TECHNOLOGY AND COMPUTER SCIENCE, VOL II, 2009, : 517 - +
  • [6] Random-data perturbation techniques and privacy-preserving data mining
    Hillol Kargupta
    Souptik Datta
    Qi Wang
    Krishnamoorthy Sivakumar
    [J]. Knowledge and Information Systems, 2005, 7 : 387 - 414
  • [7] A tree-based data perturbation approach for privacy-preserving data mining
    Li, Xiao-Bai
    Sarkar, Sumit
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2006, 18 (09) : 1278 - 1283
  • [8] A Review on Privacy-Preserving Data Mining
    Li, Xueyun
    Yan, Zheng
    Zhang, Peng
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (CIT), 2014, : 769 - 774
  • [9] Privacy-preserving collaborative data mining
    Zhan, J
    Chang, LW
    Matwin, S
    [J]. FOUNDATIONS AND NOVEL APPROACHES IN DATA MINING, 2006, 9 : 213 - +
  • [10] PRIVACY-PRESERVING COLLABORATIVE DATA MINING
    Zhan, Justin
    [J]. KMIS 2009: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE MANAGEMENT AND INFORMATION SHARING, 2009, : IS15 - IS15