Differentially Private Naive Bayes Classification

被引:66
|
作者
Vaidya, Jaideep [1 ]
Basu, Anirban [2 ]
Shafiq, Basit [3 ]
Hong, Yuan [4 ]
机构
[1] Rutgers State Univ, 1 Washington Pk, Newark, NJ 07102 USA
[2] KDDI R&D Lab Inc, Saitama 3568502, Japan
[3] Lahore Univ Management Sci, Lahore 54792, Pakistan
[4] SUNY Albany, Albany, NY 12222 USA
关键词
Differential Privacy; Naive Bayes Classification; NOISE;
D O I
10.1109/WI-IAT.2013.80
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Privacy and security concerns often prevent the sharing of users' data or even of the knowledge gained from it, thus deterring valuable information from being utilized. Privacy-preserving knowledge discovery, if done correctly, can alleviate this problem. One of the most important and widely used data mining techniques is that of classification. We consider the model where a single provider has centralized access to a dataset and would like to release a classifier while protecting privacy to the best extent possible. Recently, the model of differential privacy has been developed which provides a strong privacy guarantee even if adversaries hold arbitrary prior knowledge. In this paper, we apply this rigorous privacy model to develop a Naive Bayes classifier, which is often used as a baseline and consistently provides reasonable classification performance. We experimentally evaluate the proposed approach, and discuss how it could be potentially deployed in PaaS clouds.
引用
收藏
页码:571 / 576
页数:6
相关论文
共 50 条
  • [1] Differentially private Naive Bayes learning over multiple data sources
    Li, Tong
    Li, Jin
    Liu, Zheli
    Li, Ping
    Jia, Chunfu
    [J]. INFORMATION SCIENCES, 2018, 444 : 89 - 104
  • [2] Naive Bayes classification in R
    Zhang, Zhongheng
    [J]. ANNALS OF TRANSLATIONAL MEDICINE, 2016, 4 (12) : 1 - 5
  • [3] Improving naive bayes for classification
    Jiang L.
    Cai Z.
    Wang D.
    [J]. International Journal of Computers and Applications, 2010, 32 (03) : 328 - 332
  • [4] Private naive bayes classification of personal biomedical data: Application in cancer data analysis
    Wood, Alexander
    Shpilrain, Vladimir
    Najarian, Kayvan
    Kahrobaei, Delaram
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2019, 105 : 144 - 150
  • [5] An Improvement to Naive Bayes for Text Classification
    Zhang, Wei
    Gao, Feng
    [J]. CEIS 2011, 2011, 15
  • [6] Structured Features in Naive Bayes Classification
    Choi, Arthur
    Tavabi, Nazgol
    Darwiche, Adnan
    [J]. THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 3233 - 3240
  • [7] Variable selection for Naive Bayes classification
    Blanquero, Rafael
    Carrizosa, Emilio
    Ramirez-Cobo, Pepa
    Remedios Sillero-Denamiel, M.
    [J]. COMPUTERS & OPERATIONS RESEARCH, 2021, 135
  • [8] Survey of improving naive Bayes for classification
    Jiang, Liangxiao
    Wang, Dianhong
    Cai, Zhihua
    Yan, Xuesong
    [J]. ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2007, 4632 : 134 - +
  • [9] Naive Bayes Approach for Website Classification
    Rajalakshmi, R.
    Aravindan, C.
    [J]. INFORMATION TECHNOLOGY AND MOBILE COMMUNICATION, 2011, 147 : 323 - 326
  • [10] Naive Bayes Classification of Uncertain Data
    Ren, Jiangtao
    Lee, Sau Dan
    Chen, Xianlu
    Kao, Ben
    Cheng, Reynold
    Cheung, David
    [J]. 2009 9TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2009, : 944 - +