On Sampling, Anonymization, and Differential Privacy Or, K-Anonymization Meets Differential Privacy

被引：0

作者：

Li, Ninghui ^{[1
]}

Qardaji, Wahbeh ^{[1
]}

Su, Dong ^{[1
]}

机构：

[1] Purdue Univ, 305 N Univ St, W Lafayette, IN 47907 USA

来源：

7TH ACM SYMPOSIUM ON INFORMATION, COMPUTER AND COMMUNICATIONS SECURITY (ASIACCS 2012) | 2012年

基金：

美国国家科学基金会;

关键词：

Differential Privacy; Anonymization; Data Privacy; ANONYMITY;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

This paper aims at answering the following two questions in privacy-preserving data analysis and publishing: What formal privacy guarantee (if any) does k-anonymization provide? How can we benefit from the adversary's uncertainty about the data? We have found that random sampling provides a connection that helps answer these two questions, as sampling can create uncertainty. The main result of the paper is that k-anonymization, when done "safely", and when preceded with a random sampling step, satisfies (epsilon, delta)-differential privacy with reasonable parameters. This result illustrates that "hiding in a crowd of k" indeed offers some privacy guarantees. We point out, however, that almost all existing k-anonymization algorithms in the literature are not "safe". Regarding the second question, we provide both positive and negative results. On the positive side, we show that adding a random-sampling pre-processing step to a differentially-private algorithm can greatly amplify the level of privacy protection. Hence, when given a dataset resulted from sampling, one can utilize a much large privacy budget. On the negative side, any privacy notion that takes advantage of the adversary's uncertainty, likely does not compose.

引用

页数：11

共 50 条

[11] Balanced k-Anonymization
Al-Fedaghi, Sabah S.
PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 6, 2005, : 179 - 182
[12] K-anonymization revisited
Gionis, Aristides
Mazza, Arnon
Tassa, Tamir
2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 744 - +
[13] On Distributed k-Anonymization
Zhong, Sheng
FUNDAMENTA INFORMATICAE, 2009, 92 (04) : 411 - 431
[14] Adaptive Privacy Preservation Approach for Big Data Publishing in Cloud using k-anonymization
Madan S.
Goswami P.
Recent Advances in Computer Science and Communications, 2021, 14 (08) : 2678 - 2688
[15] Anonymization Level and Compliance for Differential Privacy: A Systematic Literature Review
Prokhorenkov, Dmitry
2022 INTERNATIONAL WIRELESS COMMUNICATIONS AND MOBILE COMPUTING, IWCMC, 2022, : 1119 - 1124
[16] Thoughts on k-anonymization
Nergiz, M. Ercan
Clifton, Chris
DATA & KNOWLEDGE ENGINEERING, 2007, 63 (03) : 622 - 645
[17] A Scalable K-Anonymization Solution for Preserving Privacy in an Aging-in-Place Welfare Intercloud
Chakravorty, Antorweep
Wlodarczyk, Tomasz Wiktor
Rong, Chunming
2014 IEEE INTERNATIONAL CONFERENCE ON CLOUD ENGINEERING (IC2E), 2014, : 424 - 431
[18] K-Anonymization approach for privacy preservation using data perturbation techniques in data mining
Kiran, Ajmeera
Shirisha, N.
MATERIALS TODAY-PROCEEDINGS, 2022, 64 : 578 - 584
[19] K-Anonymization approach for privacy preservation using data perturbation techniques in data mining
Kiran, Ajmeera
Shirisha, N.
MATERIALS TODAY-PROCEEDINGS, 2022, 64 : 578 - 584
[20] Privacy preserving big data publishing: a scalable k-anonymization approach using MapReduce
Mehta, Brijesh B.
Rao, Udai Pratap
IET SOFTWARE, 2017, 11 (05) : 271 - 276

← 1 2 3 4 5 →