Systematic Evaluation of Probabilistic k-Anonymity for Privacy Preserving Micro-data Publishing and Analysis

被引:1
|
作者
Senavirathne, Navoda [1 ]
Torra, Vicenc [2 ]
机构
[1] Univ Skovde, Sch Informat, Skovde, Sweden
[2] Univ Umea, Dept Comp Sci, Umea, Sweden
关键词
Data Privacy; Anonymization; Statistical Disclosure Control; Privacy Preserving Machine Learning; MICROAGGREGATION;
D O I
10.5220/0010560703070320
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the light of stringent privacy laws, data anonymization not only supports privacy preserving data publication (PPDP) but also improves the flexibility of micro-data analysis. Machine learning (ML) is widely used for personal data analysis in the present day thus, it is paramount to understand how to effectively use data anonymization in the ML context. In this work, we introduce an anonymization framework based on the notion of "probabilistic k-anonymity" that can be applied with respect to mixed datasets while addressing the challenges brought forward by the existing syntactic privacy models in the context of ML. Through systematic empirical evaluation, we show that the proposed approach can effectively limit the disclosure risk in micro-data publishing while maintaining a high utility for the ML models induced from the anonymized data.
引用
收藏
页码:307 / 320
页数:14
相关论文
共 50 条
  • [1] K-Anonymity for Privacy Preserving Crime Data Publishing in Resource Constrained Environments
    Burke, Mark-John
    Kayem, Anne V. D. M.
    [J]. 2014 28TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS WORKSHOPS (WAINA), 2014, : 833 - 840
  • [2] Partial k-Anonymity for Privacy-Preserving Social Network Data Publishing
    Liu, Peng
    Bai, Yan
    Wang, Lie
    Li, Xianxian
    [J]. INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2017, 27 (01) : 71 - 90
  • [3] Privacy preserving data publishing of categorical data through k-anonymity and feature selection
    Aristodimou, Aristos
    Antoniades, Athos
    Pattichis, Constantinos S.
    [J]. HEALTHCARE TECHNOLOGY LETTERS, 2016, 3 (01) : 16 - 21
  • [4] Enhanced P-Sensitive K-Anonymity Models for Privacy Preserving Data Publishing
    Sun, Xiaoxun
    Wang, Hua
    Li, Jiuyong
    Truta, Traian Marius
    [J]. TRANSACTIONS ON DATA PRIVACY, 2008, 1 (02) : 53 - 66
  • [5] A Coalitional Game Theoretic Mechanism for Privacy Preserving Publishing Based on k-Anonymity
    Chakravarthy, Srinivasa L.
    Kumari, Valli V.
    Sarojini, Ch
    [J]. 2ND INTERNATIONAL CONFERENCE ON COMMUNICATION, COMPUTING & SECURITY [ICCCS-2012], 2012, 1 : 889 - 896
  • [6] A personalized k-anonymity privacy preserving method
    [J]. Xu, Y. (uxyong@nuaa.edu.cn), 1600, Binary Information Press, Flat F 8th Floor, Block 3, Tanner Garden, 18 Tanner Road, Hong Kong (10):
  • [7] Scalable l-Diversity: An Extension to Scalable k-Anonymity for Privacy Preserving Big Data Publishing
    Rao, Udai Pratap
    Mehta, Brijesh B.
    Kumar, Nikhil
    [J]. INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY AND WEB ENGINEERING, 2019, 14 (02) : 27 - 40
  • [8] Privacy-preserving distributed k-anonymity
    Jiang, W
    Clifton, C
    [J]. DATA AND APPLICATIONS SECURITY XIX, PROCEEDINGS, 2005, 3654 : 166 - 177
  • [9] Probabilistic optimal projection partition KD-Tree k-anonymity for data publishing privacy protection
    Wang, Xiaohan
    Luo, Yonglong
    Jiang, Yefeng
    Wu, Wenli
    Yu, Qingying
    [J]. INTELLIGENT DATA ANALYSIS, 2018, 22 (06) : 1415 - 1437
  • [10] User k-anonymity for privacy preserving data mining of query logs
    Navarro-Arribas, Guillermo
    Torra, Vicenc
    Erola, Arnau
    Castella-Roca, Jordi
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2012, 48 (03) : 476 - 487