SPARK-Based Partitioning Algorithm for k-Anonymization of Large RDFs

被引：0

作者：

Temuujin, Odsuren ^{[1
]}

Jeon, Minhyuk ^{[1
]}

Seo, Kwangwon ^{[1
]}

Ahn, Jinhyun ^{[2
]}

Im, Dong-Hyuk ^{[1
]}

机构：

[1] Hoseo Univ, Dept Comp Engn, Asan, South Korea

[2] Jeju Natl Univ, Dept Management Informat Syst, Jeju, South Korea

来源：

ADVANCED MULTIMEDIA AND UBIQUITOUS ENGINEERING | 2020年 / 590卷

基金：

新加坡国家研究基金会;

关键词：

k-anonymity; Resource description framework; Apache SPARK; Data privacy;

D O I：

10.1007/978-981-32-9244-4_41

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Privacy protection for resource description framework data is very important because RDF (i.e., linked data) is widely used in published data format in many areas, including government open data, health-care for individuals, and social relationships. As data can include private information belonging to individuals or companies and can make private information available to third parties, there are several anonymization models provided for preserving privacy in practice. k-anonymity has thus gained attention in research. Recently, several RDF anonymization models have been proposed. However, current approaches focus on a model and a metric for measuring information loss but do not consider large-scale RDF data. In this paper, we propose an efficient anonymizing method for large-scale RDF data. We develop a greedy partitioning algorithm (i.e., SPARK) for RDF anonymization. SPARK is a leading platform for big data processing. The results of experiments on synthetic datasets demonstrate that our proposed method requires less running time than previous methods.

引用

页码：292 / 298

页数：7

共 50 条

[1] Genetic algorithm-based clustering approach for k-anonymization
Lin, Jun-Lin
Wei, Meng-Cheng
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (06) : 9784 - 9792
[2] A weighted K-member clustering algorithm for K-anonymization
Yan, Yan
Herman, Eyeleko Anselme
Mahmood, Adnan
Feng, Tao
Xie, Pengshou
[J]. COMPUTING, 2021, 103 (10) : 2251 - 2273
[3] A Top-Down k-Anonymization Implementation for Apache Spark
Sopaoglu, Ugur
Abul, Osman
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 4513 - 4521
[4] Evaluation of Generalization Based K-Anonymization Algorithms
Patil, Devyani
Mohapatra, Ramesh K.
Babu, Korra Sathya
[J]. 2017 IEEE 3RD INTERNATIONAL CONFERENCE ON SENSING, SIGNAL PROCESSING AND SECURITY (ICSSS), 2017, : 171 - 175
[5] A weighted K-member clustering algorithm for K-anonymization
Yan Yan
Eyeleko Anselme Herman
Adnan Mahmood
Tao Feng
Pengshou Xie
[J]. Computing, 2021, 103 : 2251 - 2273
[6] Optimization algorithm for k-anonymization of datasets with low information loss
Keisuke Murakami
Takeaki Uno
[J]. International Journal of Information Security, 2018, 17 : 631 - 644
[7] Optimization algorithm for k-anonymization of datasets with low information loss
Murakami, Keisuke
Uno, Takeaki
[J]. INTERNATIONAL JOURNAL OF INFORMATION SECURITY, 2018, 17 (06) : 631 - 644
[8] An Efficient K-anonymization Algorithm Combining C-modes with MDAV
Han Jian-min
Yu Juan
Yu Hui-qun
Cen Ting-ting
[J]. 2008 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, VOLS 1 AND 2, 2008, : 257 - +
[9] A Spark-Based Artificial Bee Colony Algorithm for Unbalanced Large Data Classification
Al-Sawwa, Jamil
Almseidin, Mohammad
[J]. INFORMATION, 2022, 13 (11)
[10] Spark-Based Scalable Algorithm for Link Prediction
Saketh, K.
Rajeswari, N. Raja
Keerthana, M. Krishna
Shaik, Fathimabi
[J]. INNOVATIVE DATA COMMUNICATION TECHNOLOGIES AND APPLICATION, ICIDCA 2021, 2022, 96 : 619 - 635

← 1 2 3 4 5 →