SPARK-Based Partitioning Algorithm for k-Anonymization of Large RDFs

被引:0
|
作者
Temuujin, Odsuren [1 ]
Jeon, Minhyuk [1 ]
Seo, Kwangwon [1 ]
Ahn, Jinhyun [2 ]
Im, Dong-Hyuk [1 ]
机构
[1] Hoseo Univ, Dept Comp Engn, Asan, South Korea
[2] Jeju Natl Univ, Dept Management Informat Syst, Jeju, South Korea
基金
新加坡国家研究基金会;
关键词
k-anonymity; Resource description framework; Apache SPARK; Data privacy;
D O I
10.1007/978-981-32-9244-4_41
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Privacy protection for resource description framework data is very important because RDF (i.e., linked data) is widely used in published data format in many areas, including government open data, health-care for individuals, and social relationships. As data can include private information belonging to individuals or companies and can make private information available to third parties, there are several anonymization models provided for preserving privacy in practice. k-anonymity has thus gained attention in research. Recently, several RDF anonymization models have been proposed. However, current approaches focus on a model and a metric for measuring information loss but do not consider large-scale RDF data. In this paper, we propose an efficient anonymizing method for large-scale RDF data. We develop a greedy partitioning algorithm (i.e., SPARK) for RDF anonymization. SPARK is a leading platform for big data processing. The results of experiments on synthetic datasets demonstrate that our proposed method requires less running time than previous methods.
引用
收藏
页码:292 / 298
页数:7
相关论文
共 50 条
  • [1] Genetic algorithm-based clustering approach for k-anonymization
    Lin, Jun-Lin
    Wei, Meng-Cheng
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (06) : 9784 - 9792
  • [2] A weighted K-member clustering algorithm for K-anonymization
    Yan, Yan
    Herman, Eyeleko Anselme
    Mahmood, Adnan
    Feng, Tao
    Xie, Pengshou
    [J]. COMPUTING, 2021, 103 (10) : 2251 - 2273
  • [3] A Top-Down k-Anonymization Implementation for Apache Spark
    Sopaoglu, Ugur
    Abul, Osman
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 4513 - 4521
  • [4] Evaluation of Generalization Based K-Anonymization Algorithms
    Patil, Devyani
    Mohapatra, Ramesh K.
    Babu, Korra Sathya
    [J]. 2017 IEEE 3RD INTERNATIONAL CONFERENCE ON SENSING, SIGNAL PROCESSING AND SECURITY (ICSSS), 2017, : 171 - 175
  • [5] A weighted K-member clustering algorithm for K-anonymization
    Yan Yan
    Eyeleko Anselme Herman
    Adnan Mahmood
    Tao Feng
    Pengshou Xie
    [J]. Computing, 2021, 103 : 2251 - 2273
  • [6] Optimization algorithm for k-anonymization of datasets with low information loss
    Keisuke Murakami
    Takeaki Uno
    [J]. International Journal of Information Security, 2018, 17 : 631 - 644
  • [7] Optimization algorithm for k-anonymization of datasets with low information loss
    Murakami, Keisuke
    Uno, Takeaki
    [J]. INTERNATIONAL JOURNAL OF INFORMATION SECURITY, 2018, 17 (06) : 631 - 644
  • [8] An Efficient K-anonymization Algorithm Combining C-modes with MDAV
    Han Jian-min
    Yu Juan
    Yu Hui-qun
    Cen Ting-ting
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, VOLS 1 AND 2, 2008, : 257 - +
  • [9] A Spark-Based Artificial Bee Colony Algorithm for Unbalanced Large Data Classification
    Al-Sawwa, Jamil
    Almseidin, Mohammad
    [J]. INFORMATION, 2022, 13 (11)
  • [10] Spark-Based Scalable Algorithm for Link Prediction
    Saketh, K.
    Rajeswari, N. Raja
    Keerthana, M. Krishna
    Shaik, Fathimabi
    [J]. INNOVATIVE DATA COMMUNICATION TECHNOLOGIES AND APPLICATION, ICIDCA 2021, 2022, 96 : 619 - 635