Toward Scalable Anonymization for Privacy-Preserving Big Data Publishing

被引:4
|
作者
Mehta, Brijesh B. [1 ]
Rao, Udai Pratap [1 ]
机构
[1] Sardar Vallabhbhai Natl Inst Technol, Surat, India
关键词
Big data; Big data privacy; k-anonymity;
D O I
10.1007/978-981-10-8636-6_31
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Big data is collected and processed using different sources and tools, which leads to privacy issues. Privacy-preserving data publishing techniques such as k-anonymity, l-diversity, t-closeness are used to de-identify data, but chances of re-identification are there as data is collected from multiple sources. Due to a large amount of data, less generalization or suppression is required to achieve same level of privacy, which is also known as "large crowd effect," but to handle such a large data for anonymization is also a challenging task. MapReduce handles a large amount of data, but it distributes data into small chunks, so the advantage of large data cannot be achieved. Therefore, scalability of privacy-preserving techniques has become a challenging area of research, and we are trying to explore it by proposing an algorithm for scalable k-anonymity for MapReduce. Based on comparison with existing algorithm, our approach shows significant improvement in running time.
引用
收藏
页码:297 / 304
页数:8
相关论文
共 50 条
  • [1] Privacy-Preserving Big Data Publishing
    Zakerzadeh, Hessam
    Aggarwal, Charu C.
    Barker, Ken
    [J]. PROCEEDINGS OF THE 27TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, 2015,
  • [2] A Scalable (α, k)-Anonymization Approach using MapReduce for Privacy Preserving Big Data Publishing
    Mehta, Brijesh B.
    Gupta, Ruchika
    Rao, Udai Pratap
    Muthiyan, Mukesh
    [J]. 2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
  • [3] Selective Feature Anonymization for Privacy-Preserving Image Data Publishing
    Kim, Taehoon
    Yang, Jihoon
    [J]. ELECTRONICS, 2020, 9 (05)
  • [4] Anonymization-Based Attacks in Privacy-Preserving Data Publishing
    Wong, Raymond Chi-Wing
    Fu, Ada Wai-Chee
    Wang, Ke
    Pei, Jian
    [J]. ACM TRANSACTIONS ON DATABASE SYSTEMS, 2009, 34 (02):
  • [5] Privacy preserving big data publishing: a scalable k-anonymization approach using MapReduce
    Mehta, Brijesh B.
    Rao, Udai Pratap
    [J]. IET SOFTWARE, 2017, 11 (05) : 271 - 276
  • [6] Improved l-diversity: Scalable anonymization approach for Privacy Preserving Big Data Publishing
    Mehta, Brijesh B.
    Rao, Udai Pratap
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (04) : 1423 - 1430
  • [7] Privacy-Preserving Trajectory Data Publishing by Dynamic Anonymization with Bounded Distortion
    Li, Songyuan
    Tian, Hui
    Shen, Hong
    Sang, Yingpeng
    [J]. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2021, 10 (02)
  • [8] EDAMS: Efficient Data Anonymization Model Selector for Privacy-Preserving Data Publishing
    Qamar, Tehreem
    Bawany, Narmeen Zakaria
    Khan, Najeed Ahmed
    [J]. ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2020, 10 (02) : 5423 - 5427
  • [9] Scalable privacy-preserving big data aggregation mechanism
    Wu, Dapeng
    Yang, Boran
    Wang, Ruyan
    [J]. DIGITAL COMMUNICATIONS AND NETWORKS, 2016, 2 (03) : 122 - 129
  • [10] Privacy-Preserving Data Publishing
    Liu, Ruilin
    Wang, Hui
    [J]. 2010 IEEE 26TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDE 2010), 2010, : 305 - 308