Sensitivity-based Anonymization of Big Data

被引:0
|
作者
Al-Zobbi, Mohammed [1 ]
Shahrestani, Seyed [1 ]
Ruan, Chun [1 ]
机构
[1] Western Sydney Univ, Sch Comp Engn & Math, Penrith, NSW, Australia
关键词
Access Control; Anonymization; Big Data; k-anonymity; MapReduce; Sensitivity; PRIVACY PROTECTION; K-ANONYMITY;
D O I
10.1109/LCNW.2016.25
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Data Analytics is widely used as a means of extracting useful information from available data. It is only natural that it is increasingly adapted for processing big data. The rapidly growing demand for big data analytics has several undesirable side-effects. Perhaps, the most significant of those relates to increased risks for data disclosure and privacy violations. Data anonymization can provide promising solutions for minimizing such risks. In this paper, we discuss some of the specific requirements of the anonymization process when dealing with big data. We show that in general, information loss is the result of avoidable generalization of similar or equivalent data. Using these analyses, we propose a novel framework for data anonymization, which expands the k-anonymity properties and concepts and takes the data class values and the sensitivity of data into account. As such, the proposed process can utilize a bottom-up approach, in contrast to most other anonymization methods. The top-down approaches usually generalize all records, the equivalent and the non-equivalent ones. Ours is more methodical, as it avoids the generalization of the equivalent records. With the inclusion of sensitivity levels, we demonstrate that our framework can reduce the iteration steps and the time required to finalize the anonymization, and therefore enhance the overall efficiency of the process
引用
收藏
页码:58 / 64
页数:7
相关论文
共 50 条
  • [1] Towards Optimal Sensitivity-Based Anonymization for Big Data
    Al-Zobbi, Mohammed
    Shahrestani, Seyed
    Ruan, Chun
    [J]. 2017 27TH INTERNATIONAL TELECOMMUNICATION NETWORKS AND APPLICATIONS CONFERENCE (ITNAC), 2017, : 331 - 336
  • [2] Experimenting sensitivity-based anonymization framework in apache spark
    Al-Zobbi, Mohammed
    Shahrestani, Seyed
    Ruan, Chun
    [J]. JOURNAL OF BIG DATA, 2018, 5 (01)
  • [3] Improving MapReduce privacy by implementing multi-dimensional sensitivity-based anonymization
    Al-Zobbi M.
    Shahrestani S.
    Ruan C.
    [J]. Journal of Big Data, 4 (1)
  • [4] A Clustering Based Anonymization Model for Big Data
    Canbay, Yavuz
    Kalyoncu, Aydincan
    Ercimen, Mucahid
    Dogan, Adem
    Sagiroglu, Seref
    [J]. 2019 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2019, : 720 - 725
  • [5] Big Data Privacy and Anonymization
    Torra, Vicenc
    Navarro-Arribas, Guillermo
    [J]. PRIVACY AND IDENTITY MANAGEMENT: FACING UP TO NEXT STEPS, 2016, 498 : 15 - 26
  • [6] Big Data Anonymization with Spark
    Canbay, Yavuz
    Sagiroglu, Seref
    [J]. 2017 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2017, : 833 - 838
  • [7] Anonymization in the Time of Big Data
    Domingo-Ferrer, Josep
    Soria-Comas, Jordi
    [J]. PRIVACY IN STATISTICAL DATABASES: UNESCO CHAIR IN DATA PRIVACY, 2016, 9867 : 57 - 68
  • [8] DATA COVARIANCE ESTIMATION METHODS FOR SENSITIVITY-BASED DATA ASSESSMENT
    MUIR, DW
    [J]. TRANSACTIONS OF THE AMERICAN NUCLEAR SOCIETY, 1977, 26 : 484 - 485
  • [9] Efficient multimedia big data anonymization
    Sung-Bong Jang
    Young-Woong Ko
    [J]. Multimedia Tools and Applications, 2017, 76 : 17855 - 17872
  • [10] In-Situ Anonymization of Big Data
    Krizan, Tomislav
    Brakus, Marko
    Vukelic, Davorin
    [J]. 2015 8TH INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2015, : 292 - 298