Differentially Private Random Forest with High Utility

被引:40
|
作者
Rana, Santu [1 ]
Gupta, Sunil Kumar [1 ]
Venkatesh, Svetha [1 ]
机构
[1] Deakin Univ, Ctr Pattern Recognit & Data Analyt, Geelong, Vic 3217, Australia
关键词
differential privacy; decision trees; random forest; privacy preserving data mining;
D O I
10.1109/ICDM.2015.76
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Privacy-preserving data mining has become an active focus of the research community in the domains where data are sensitive and personal in nature. We propose a novel random forest algorithm under the framework of differential privacy. Unlike previous works that strictly follow differential privacy and keep the complete data distribution approximately invariant to change in one data instance, we only keep the necessary statistics (e.g. variance of the estimate) invariant. This relaxation results in significantly higher utility. To realize our approach, we propose a novel differentially private decision tree induction algorithm and use them to create an ensemble of decision trees. We also propose feasible adversary models to infer about the attribute and class label of unknown data in presence of the knowledge of all other data. Under these adversary models, we derive bounds on the maximum number of trees that are allowed in the ensemble while maintaining privacy. We focus on binary classification problem and demonstrate our approach on four real-world datasets. Compared to the existing privacy preserving approaches we achieve significantly higher utility.
引用
收藏
页码:955 / 960
页数:6
相关论文
共 50 条
  • [1] A differentially private greedy decision forest classification algorithm with high utility
    Guan, Zhitao
    Sun, Xianwen
    Shi, Lingyun
    Wu, Longfei
    Du, Xiaojiang
    COMPUTERS & SECURITY, 2020, 96
  • [2] Utility Analysis of Differentially Private Anonymized Data Based on Random Sampling
    Sugiyama, Takumi
    Oosugi, Hiroto
    Yamanaka, Io
    Minami, Kazuhiro
    PRIVACY IN STATISTICAL DATABASES, PSD 2024, 2024, 14915 : 35 - 47
  • [3] A High-Utility Differentially Private Mechanism for Space Information Networks
    Zhuo, Ming
    Huang, Wen
    Liu, Leyuan
    Zhou, Shijie
    Tian, Zhiwen
    REMOTE SENSING, 2022, 14 (22)
  • [4] A Differentially Private Random Decision Forest Using Reliable Signal-to-Noise Ratios
    Fletcher, Sam
    Islam, Md Zahidul
    AI 2015: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2015, 9457 : 192 - 203
  • [5] DIFFERENTIALLY PRIVATE GREEDY DECISION FOREST
    Xin, Bangzhou
    Yang, Wei
    Wang, Shaowei
    Huang, Liusheng
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 2672 - 2676
  • [6] Differentially private SGD with random features
    WANG Yi-guang
    GUO Zheng-chu
    Applied Mathematics:A Journal of Chinese Universities, 2024, 39 (01) : 1 - 23
  • [7] Differentially Private Exponential Random Graphs
    Karwa, Vishesh
    Slavkovic, Aleksandra B.
    Krivitsky, Pavel
    PRIVACY IN STATISTICAL DATABASES, PSD 2014, 2014, 8744 : 143 - 155
  • [8] Differentially private SGD with random features
    Yi-guang Wang
    Zheng-chu Guo
    Applied Mathematics-A Journal of Chinese Universities, 2024, 39 : 1 - 23
  • [9] Differentially private SGD with random features
    Wang, Yi-guang
    Guo, Zheng-chu
    APPLIED MATHEMATICS-A JOURNAL OF CHINESE UNIVERSITIES SERIES B, 2024, 39 (01) : 1 - 23
  • [10] Differential Private Random Forest
    Patil, Abhijit
    Singh, Sanjay
    2014 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2014, : 2623 - 2630