Score, Arrange, and Cluster: A Novel Clustering-Based Technique for Privacy-Preserving Data Publishing

被引:0
|
作者
Sowmyarani, C. N. [1 ]
Namya, L. G. [1 ]
Nidhi, G. K. [1 ]
Ramakanth Kumar, P. [1 ]
机构
[1] RV Coll Engn, Bengaluru 560059, India
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Data privacy; Publishing; Stakeholders; Clustering algorithms; Data models; Information integrity; Genetic algorithms; Decision making; Homomorphic encryption; Clustering; k-anonymity; data privacy; privacy-preserving data publishing; genetic algorithm; MODEL;
D O I
10.1109/ACCESS.2024.3403372
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data-driven decision-making has become critical to every organization. There is a growing emphasis on adopting robust data governance frameworks for data management. This encompasses data publishing to empower stakeholders with the ability to access and analyze the published data, playing a pivotal role in decision-making. However, data publishing poses a threat to entity-specific information. Privacy-Preserving Data Publishing (PPDP) refers to publishing data while protecting the privacy of entity-specific information. K-anonymity is a well-recognized method that achieves PPDP and serves as the foundation of our proposed clustering-based data transformation algorithm, "Score, Arrange, and Cluster (SAC)". For effective data management and decision-making in organizations, it is crucial to address the varying data requirements and role-based access levels of the involved stakeholders. SAC was designed to offer only a generic data transformation with minimal data quality degradation. Hence, this work presents an enhancement to SAC that takes into account stakeholder roles and requirements, as illustrated through different scenarios. The scoring mechanism in SAC is augmented to accommodate customization or use the concepts of Genetic Algorithms to enforce role-based access control. The "Cost of Degradation" (CoD) metric is used to quantify the data quality degradation. As per the experimental results, in the customization scenario, a higher attribute priority leads to lower data quality degradation, while, in the role-based access control scenario a higher access level results in a lower data quality degradation.
引用
收藏
页码:79861 / 79874
页数:14
相关论文
共 50 条
  • [1] Privacy-preserving data publishing for cluster analysis
    Fung, Benjamin C. M.
    Wang, Ke
    Wang, Lingyu
    Hung, Patrick C. K.
    DATA & KNOWLEDGE ENGINEERING, 2009, 68 (06) : 552 - 575
  • [2] A Clustering-Based Privacy-Preserving Method for Uncertain Trajectory Data
    Cai, Zhou-Fu
    Yang, He-Xing
    Shuang, Wang
    Jian, Xu
    Wei, Wang-Ming
    Na, Wu-Li
    2014 IEEE 13TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM), 2014, : 1 - 8
  • [3] Privacy-preserving data publishing based on de-clustering
    Wei, Qiong
    Lu, Yansheng
    Lou, Qiang
    7TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE IN CONJUNCTION WITH 2ND IEEE/ACIS INTERNATIONAL WORKSHOP ON E-ACTIVITY, PROCEEDINGS, 2008, : 152 - +
  • [4] A privacy-preserving data publishing algorithm for clustering application
    Chong, Zhihong
    Ni, Weiwei
    Liu, Tengteng
    Zhang, Yong
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2010, 47 (12): : 2083 - 2089
  • [5] Clustering-oriented privacy-preserving data publishing
    Ni, Weiwei
    Chong, Zhihong
    KNOWLEDGE-BASED SYSTEMS, 2012, 35 : 264 - 270
  • [6] An Efficient Clustering-Based Privacy-Preserving Recommender System
    Luo, Junwei
    Yi, Xun
    Han, Fengling
    Yang, Xuechao
    Yang, Xu
    NETWORK AND SYSTEM SECURITY, NSS 2022, 2022, 13787 : 387 - 405
  • [7] Privacy-Preserving Data Publishing
    Liu, Ruilin
    Wang, Hui
    2010 IEEE 26TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDE 2010), 2010, : 305 - 308
  • [8] Privacy-Preserving Data Publishing
    Chen, Bee-Chung
    Kifer, Daniel
    LeFevre, Kristen
    Machanavajjhala, Ashwin
    FOUNDATIONS AND TRENDS IN DATABASES, 2009, 2 (1-2): : 1 - 167
  • [9] A clustering-based anonymization approach for privacy-preserving in the healthcare cloud
    Abbasi, Afsoon
    Mohammadi, Behnaz
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (01):
  • [10] A comparison of clustering-based privacy-preserving collaborative filtering schemes
    Bilge, Alper
    Polat, Huseyin
    APPLIED SOFT COMPUTING, 2013, 13 (05) : 2478 - 2489