An efficient privacy-preserving approach for data publishing

被引:0
|
作者
Xinyu Qian
Xinning Li
Zhiping Zhou
机构
[1] Jiangnan University,School of Internet of Things Engineering
关键词
Privacy-preserving; Data publishing; -anonymity; Weighted clustering;
D O I
暂无
中图分类号
学科分类号
摘要
Privacy-preserving algorithm based on k-anonymity plays an outstanding role in real-world data mining applications, such as medical records, bioinformatics, market, and social network. How to maximize the availability of published data without sacrificing users’ privacy is the emphasis of privacy-preserving research. In this paper, we propose a mixed-feature weighted clustering algorithm for k-anonymity (MWCK) to study the contradiction of efficiency and information loss for utility-type anonymization. First, we propose the concept of natural equivalence group, then tuples with same attributes in dataset can be pre-extracted to reduce time complexity and information loss. Second, a sorting algorithm based on the shortest distance is proposed, which selects the optimal initial cluster center at a lower computational cost to reduce the number of iterations. Finally, MWCK not only considers intra-cluster isomorphism to reduce generalization information loss and inter-cluster heterogeneity to avoid local optimal solutions, but also applies to both numerical and categorical datasets. Extensive experiments show that our algorithm can effectively protect data privacy and has better comprehensive performance in terms of information loss and computational complexity than state-of-art methods.
引用
收藏
页码:2077 / 2093
页数:16
相关论文
共 50 条
  • [21] A New Anonymity Model for Privacy-Preserving Data Publishing
    Huang Xuezhen
    Liu Jiqiang
    Han Zhen
    Yang Jun
    [J]. CHINA COMMUNICATIONS, 2014, 11 (09) : 47 - 59
  • [22] STDP: Secure Privacy-Preserving Trajectory Data Publishing
    Eom, Chris Soo-Hyun
    Lee, Wookey
    Leung, Carson Kai-Sang
    [J]. IEEE 2018 INTERNATIONAL CONGRESS ON CYBERMATICS / 2018 IEEE CONFERENCES ON INTERNET OF THINGS, GREEN COMPUTING AND COMMUNICATIONS, CYBER, PHYSICAL AND SOCIAL COMPUTING, SMART DATA, BLOCKCHAIN, COMPUTER AND INFORMATION TECHNOLOGY, 2018, : 892 - 899
  • [23] Lightweight Privacy-Preserving Raw Data Publishing Scheme
    Chen, Jingxue
    Liu, Gao
    Liu, Yining
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2021, 9 (04) : 2170 - 2174
  • [24] Clustering-oriented privacy-preserving data publishing
    Ni, Weiwei
    Chong, Zhihong
    [J]. KNOWLEDGE-BASED SYSTEMS, 2012, 35 : 264 - 270
  • [25] Suppression techniques for privacy-preserving trajectory data publishing
    Lin, Chen-Yi
    [J]. KNOWLEDGE-BASED SYSTEMS, 2020, 206
  • [26] On Privacy-Preserving Publishing of Spontaneous ADE Reporting Data
    Lin, Wen-Yang
    Yang, Duan-Chun
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2013,
  • [27] Privacy-Preserving Data Publishing Based On Utility Specification
    Tian, Hongwei
    Zhang, Weining
    [J]. 2013 ASE/IEEE INTERNATIONAL CONFERENCE ON SOCIAL COMPUTING (SOCIALCOM), 2013, : 114 - 121
  • [28] Privacy-Preserving Data Publishing: A Survey of Recent Developments
    Fung, Benjamin C. M.
    Wang, Ke
    Chen, Rui
    Yu, Philip S.
    [J]. ACM COMPUTING SURVEYS, 2010, 42 (04)
  • [29] Privacy-preserving governmental data publishing: A fog-computing-based differential privacy approach
    Piao, Chunhui
    Shi, Yajuan
    Yan, Jiaqi
    Zhang, Changyou
    Liu, Liping
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 90 : 158 - 174
  • [30] HyObscure: Hybrid Obscuring for Privacy-Preserving Data Publishing
    Han, Xiao
    Yang, Yuncong
    Wu, Junjie
    Xiong, Hui
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (08) : 3893 - 3905