Toward Scalable Anonymization for Privacy-Preserving Big Data Publishing

被引:4
|
作者
Mehta, Brijesh B. [1 ]
Rao, Udai Pratap [1 ]
机构
[1] Sardar Vallabhbhai Natl Inst Technol, Surat, India
关键词
Big data; Big data privacy; k-anonymity;
D O I
10.1007/978-981-10-8636-6_31
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Big data is collected and processed using different sources and tools, which leads to privacy issues. Privacy-preserving data publishing techniques such as k-anonymity, l-diversity, t-closeness are used to de-identify data, but chances of re-identification are there as data is collected from multiple sources. Due to a large amount of data, less generalization or suppression is required to achieve same level of privacy, which is also known as "large crowd effect," but to handle such a large data for anonymization is also a challenging task. MapReduce handles a large amount of data, but it distributes data into small chunks, so the advantage of large data cannot be achieved. Therefore, scalability of privacy-preserving techniques has become a challenging area of research, and we are trying to explore it by proposing an algorithm for scalable k-anonymity for MapReduce. Based on comparison with existing algorithm, our approach shows significant improvement in running time.
引用
收藏
页码:297 / 304
页数:8
相关论文
共 50 条
  • [21] Privacy-preserving Anonymization of Set-valued Data
    Terrovitis, Manolis
    Mamoulis, Nikos
    Kalnis, Panos
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2008, 1 (01): : 115 - 125
  • [22] Slicing-Based Enhanced Method for Privacy-Preserving in Publishing Big Data
    BinJubier, Mohammed
    Ismail, Mohd Arfian
    Ahmed, Abdulghani Ali
    Sadiq, Ali Safaa
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 72 (02): : 3665 - 3686
  • [23] Anonymization Techniques for Privacy Preserving Data Publishing: A Comprehensive Survey
    Majeed, Abdul
    Lee, Sungchang
    [J]. IEEE ACCESS, 2021, 9 : 8512 - 8545
  • [24] Personalized Privacy-Preserving Trajectory Data Publishing
    Lu Qiwei
    Wang Caimei
    Xiong Yan
    Xia Huihua
    Huang Wenchao
    Gong Xudong
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2017, 26 (02) : 285 - 291
  • [25] Privacy-Preserving Continuous Event Data Publishing
    Rafiei, Majid
    van der Aalst, Wil M. P.
    [J]. BUSINESS PROCESS MANAGEMENT FORUM (BPM 2021), 2021, 427 : 178 - 194
  • [26] Privacy-preserving data publishing for cluster analysis
    Fung, Benjamin C. M.
    Wang, Ke
    Wang, Lingyu
    Hung, Patrick C. K.
    [J]. DATA & KNOWLEDGE ENGINEERING, 2009, 68 (06) : 552 - 575
  • [27] δ-Dependency for privacy-preserving XML data publishing
    Landberg, Anders H.
    Nguyen, Kinh
    Pardede, Eric
    Rahayu, J. Wenny
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2014, 50 : 77 - 94
  • [28] An efficient privacy-preserving approach for data publishing
    Xinyu Qian
    Xinning Li
    Zhiping Zhou
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2023, 14 : 2077 - 2093
  • [29] An efficient privacy-preserving approach for data publishing
    Qian, Xinyu
    Li, Xinning
    Zhou, Zhiping
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 14 (3) : 2077 - 2093
  • [30] Privacy-Preserving Data Publishing in Process Mining
    Rafiei, Majid
    van der Aalst, Wil M. P.
    [J]. BUSINESS PROCESS MANAGEMENT FORUM, BPM FORUM 2020, 2020, 392 : 122 - 138