Privacy-preserving data mining in electronic surveys

被引:0
|
作者
Zhan, J [1 ]
Matwin, S [1 ]
机构
[1] Univ Ottawa, Sch Informat Technol & Engn, Ottawa, ON, Canada
关键词
privacy; data mining; randomization;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Electronic surveys are an important resource in data mining. However, how to protect respondents' data privacy during the survey is a challenge to the security and privacy community. In this paper, we develop a scheme to solve the problem of privacy-preserving data mining in electronic surveys. We propose a randomized response technique to collect the data from the respondents. We then demonstrate how to perform data mining computations on randomized data. Specifically, we apply our scheme to build a Naive Bayesian classifier from randomized data. Our experimental results indicate that accuracy of classification in our scheme, when private data is protected by randomization, is close to the accuracy of a classifier build from the same data with the total disclosure of private information. Finally, we develop a measure to quantify privacy achieved by our proposed scheme.
引用
收藏
页码:1179 / 1185
页数:7
相关论文
共 50 条
  • [41] Privacy-Preserving Data Mining in Homogeneous Collaborative Clustering
    Ouda, Mohamed
    Salem, Sameh
    Ali, Ihab
    Saad, El-Sayed
    [J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2015, 12 (06) : 604 - 612
  • [42] Rings for Privacy: An Architecture for Large Scale Privacy-Preserving Data Mining
    Merani, Maria Luisa
    Croce, Daniele
    Tinnirello, Ilenia
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 32 (06) : 1340 - 1352
  • [43] Privacy-preserving data mining on data grids in the presence of malicious participants
    Gilburd, B
    Schuster, A
    Wolff, R
    [J]. 13TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE DISTRIBUTED COMPUTING, PROCEEDINGS, 2004, : 225 - 234
  • [44] Random-data perturbation techniques and privacy-preserving data mining
    Kargupta, H
    Datta, S
    Wang, Q
    Sivakumar, K
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2005, 7 (04) : 387 - 414
  • [45] A Kind of Privacy-Preserving Data Mining Algorithm Oriented to Data User
    Cai, Li
    Su, JianYing
    [J]. ADVANCES IN MULTIMEDIA, SOFTWARE ENGINEERING AND COMPUTING, VOL 2, 2011, 129 : 25 - +
  • [46] Distributed anonymous data perturbation method for privacy-preserving data mining
    Li, Feng
    Ma, Jin
    Li, Jian-hua
    [J]. JOURNAL OF ZHEJIANG UNIVERSITY-SCIENCE A, 2009, 10 (07): : 952 - 963
  • [47] Distributed anonymous data perturbation method for privacy-preserving data mining
    Feng Li
    Jin Ma
    Jian-hua Li
    [J]. Journal of Zhejiang University-SCIENCE A, 2009, 10 : 952 - 963
  • [49] Random-data perturbation techniques and privacy-preserving data mining
    Hillol Kargupta
    Souptik Datta
    Qi Wang
    Krishnamoorthy Sivakumar
    [J]. Knowledge and Information Systems, 2005, 7 : 387 - 414
  • [50] Supporting geospatial privacy-preserving data mining of social media
    Wang, Shuo
    Sinnott, Richard O.
    [J]. SOCIAL NETWORK ANALYSIS AND MINING, 2016, 6 (01)