Score, Arrange, and Cluster: A Novel Clustering-Based Technique for Privacy-Preserving Data Publishing

被引:0
|
作者
Sowmyarani, C. N. [1 ]
Namya, L. G. [1 ]
Nidhi, G. K. [1 ]
Ramakanth Kumar, P. [1 ]
机构
[1] RV Coll Engn, Bengaluru 560059, India
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Data privacy; Publishing; Stakeholders; Clustering algorithms; Data models; Information integrity; Genetic algorithms; Decision making; Homomorphic encryption; Clustering; k-anonymity; data privacy; privacy-preserving data publishing; genetic algorithm; MODEL;
D O I
10.1109/ACCESS.2024.3403372
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data-driven decision-making has become critical to every organization. There is a growing emphasis on adopting robust data governance frameworks for data management. This encompasses data publishing to empower stakeholders with the ability to access and analyze the published data, playing a pivotal role in decision-making. However, data publishing poses a threat to entity-specific information. Privacy-Preserving Data Publishing (PPDP) refers to publishing data while protecting the privacy of entity-specific information. K-anonymity is a well-recognized method that achieves PPDP and serves as the foundation of our proposed clustering-based data transformation algorithm, "Score, Arrange, and Cluster (SAC)". For effective data management and decision-making in organizations, it is crucial to address the varying data requirements and role-based access levels of the involved stakeholders. SAC was designed to offer only a generic data transformation with minimal data quality degradation. Hence, this work presents an enhancement to SAC that takes into account stakeholder roles and requirements, as illustrated through different scenarios. The scoring mechanism in SAC is augmented to accommodate customization or use the concepts of Genetic Algorithms to enforce role-based access control. The "Cost of Degradation" (CoD) metric is used to quantify the data quality degradation. As per the experimental results, in the customization scenario, a higher attribute priority leads to lower data quality degradation, while, in the role-based access control scenario a higher access level results in a lower data quality degradation.
引用
收藏
页码:79861 / 79874
页数:14
相关论文
共 50 条
  • [21] Privacy-Preserving Medical Reports Publishing for Cluster Analysis
    Hmood, Ali K.
    Fung, Benjamin C. M.
    Iqbal, Farkhund
    2014 6TH INTERNATIONAL CONFERENCE ON NEW TECHNOLOGIES, MOBILITY AND SECURITY (NTMS), 2014,
  • [22] Privacy-preserving clustering of data streams
    Chao, Ching-Ming
    Chen, Po-Zung
    Sun, Chu-Hao
    Tamkang Journal of Science and Engineering, 2010, 13 (03): : 349 - 358
  • [23] Anonymization-Based Attacks in Privacy-Preserving Data Publishing
    Wong, Raymond Chi-Wing
    Fu, Ada Wai-Chee
    Wang, Ke
    Pei, Jian
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2009, 34 (02):
  • [24] Personalized Privacy-Preserving Trajectory Data Publishing
    Lu Qiwei
    Wang Caimei
    Xiong Yan
    Xia Huihua
    Huang Wenchao
    Gong Xudong
    CHINESE JOURNAL OF ELECTRONICS, 2017, 26 (02) : 285 - 291
  • [25] An efficient privacy-preserving approach for data publishing
    Xinyu Qian
    Xinning Li
    Zhiping Zhou
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 : 2077 - 2093
  • [26] Privacy-Preserving Clustering of Data Streams
    Chao, Ching-Ming
    Chen, Po-Zung
    Sun, Chu-Hao
    JOURNAL OF APPLIED SCIENCE AND ENGINEERING, 2010, 13 (03): : 349 - 358
  • [27] Privacy-Preserving Continuous Event Data Publishing
    Rafiei, Majid
    van der Aalst, Wil M. P.
    BUSINESS PROCESS MANAGEMENT FORUM (BPM 2021), 2021, 427 : 178 - 194
  • [28] δ-Dependency for privacy-preserving XML data publishing
    Landberg, Anders H.
    Nguyen, Kinh
    Pardede, Eric
    Rahayu, J. Wenny
    JOURNAL OF BIOMEDICAL INFORMATICS, 2014, 50 : 77 - 94
  • [29] An efficient privacy-preserving approach for data publishing
    Qian, Xinyu
    Li, Xinning
    Zhou, Zhiping
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 14 (3) : 2077 - 2093
  • [30] Privacy-Preserving Data Publishing in Process Mining
    Rafiei, Majid
    van der Aalst, Wil M. P.
    BUSINESS PROCESS MANAGEMENT FORUM, BPM FORUM 2020, 2020, 392 : 122 - 138