An efficient privacy-preserving approach for data publishing

被引:0
|
作者
Xinyu Qian
Xinning Li
Zhiping Zhou
机构
[1] Jiangnan University,School of Internet of Things Engineering
关键词
Privacy-preserving; Data publishing; -anonymity; Weighted clustering;
D O I
暂无
中图分类号
学科分类号
摘要
Privacy-preserving algorithm based on k-anonymity plays an outstanding role in real-world data mining applications, such as medical records, bioinformatics, market, and social network. How to maximize the availability of published data without sacrificing users’ privacy is the emphasis of privacy-preserving research. In this paper, we propose a mixed-feature weighted clustering algorithm for k-anonymity (MWCK) to study the contradiction of efficiency and information loss for utility-type anonymization. First, we propose the concept of natural equivalence group, then tuples with same attributes in dataset can be pre-extracted to reduce time complexity and information loss. Second, a sorting algorithm based on the shortest distance is proposed, which selects the optimal initial cluster center at a lower computational cost to reduce the number of iterations. Finally, MWCK not only considers intra-cluster isomorphism to reduce generalization information loss and inter-cluster heterogeneity to avoid local optimal solutions, but also applies to both numerical and categorical datasets. Extensive experiments show that our algorithm can effectively protect data privacy and has better comprehensive performance in terms of information loss and computational complexity than state-of-art methods.
引用
收藏
页码:2077 / 2093
页数:16
相关论文
共 50 条
  • [1] An efficient privacy-preserving approach for data publishing
    Qian, Xinyu
    Li, Xinning
    Zhou, Zhiping
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 14 (3) : 2077 - 2093
  • [2] Privacy-Preserving Data Publishing
    Liu, Ruilin
    Wang, Hui
    [J]. 2010 IEEE 26TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDE 2010), 2010, : 305 - 308
  • [3] Privacy-Preserving Data Publishing
    Chen, Bee-Chung
    Kifer, Daniel
    LeFevre, Kristen
    Machanavajjhala, Ashwin
    [J]. FOUNDATIONS AND TRENDS IN DATABASES, 2009, 2 (1-2): : 1 - 167
  • [4] Privacy-Preserving Sequential Data Publishing
    Wang, Huili
    Ma, Wenping
    Zheng, Haibin
    Liang, Zhi
    Wu, Qianhong
    [J]. NETWORK AND SYSTEM SECURITY, NSS 2019, 2019, 11928 : 596 - 614
  • [5] Privacy-Preserving Big Data Publishing
    Zakerzadeh, Hessam
    Aggarwal, Charu C.
    Barker, Ken
    [J]. PROCEEDINGS OF THE 27TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, 2015,
  • [6] Privacy-Preserving Publishing of Hierarchical Data
    Ozalp, Ismet
    Gursoy, Mehmet Emre
    Nergiz, Mehmet Ercan
    Saygin, Yucel
    [J]. ACM TRANSACTIONS ON PRIVACY AND SECURITY, 2016, 19 (03)
  • [7] A New Approach to Privacy-Preserving Multiple Independent Data Publishing
    Hasan, A. S. M. Touhidul
    Jiang, Qingshan
    Chen, Hui
    Wang, Shengrui
    [J]. APPLIED SCIENCES-BASEL, 2018, 8 (05):
  • [8] An efficient method for privacy-preserving trajectory data publishing based on data partitioning
    Songyuan Li
    Hong Shen
    Yingpeng Sang
    Hui Tian
    [J]. The Journal of Supercomputing, 2020, 76 : 5276 - 5300
  • [9] An efficient method for privacy-preserving trajectory data publishing based on data partitioning
    Li, Songyuan
    Shen, Hong
    Sang, Yingpeng
    Tian, Hui
    [J]. JOURNAL OF SUPERCOMPUTING, 2020, 76 (07): : 5276 - 5300
  • [10] EDAMS: Efficient Data Anonymization Model Selector for Privacy-Preserving Data Publishing
    Qamar, Tehreem
    Bawany, Narmeen Zakaria
    Khan, Najeed Ahmed
    [J]. ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2020, 10 (02) : 5423 - 5427