Histogram Publishing Algorithm Based on Sampling Sorting and Greedy Clustering

被引:1
|
作者
Wu, Xiaonian [1 ]
Tong, Nian [1 ]
Ye, Zhibo [1 ]
Wang, Yujue [1 ]
机构
[1] Guilin Univ Elect Technol, Guangxi Key Lab Cryptog & Informat Secur, Guilin, Peoples R China
关键词
Differential privacy; Histogram publishing; Roulette sampling; Greedy clustering; Grouping error;
D O I
10.1007/978-981-15-2777-7_7
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The data produced by differential privacy histogram publishing algorithm based on grouping has low usability due to large approximation error and Laplace error. To solve this problem, a histogram publishing algorithm based on roulette sampling sort and greedy partition is proposed. Our algorithm combines the exponential mechanism with the roulette sampling sorting method, arranges the similar histogram bins together with a larger probability by the utility function and the restriction on the number of sampled entity. The greedy clustering algorithm is used to partition the sorted histogram bins into groups, and the error among histogram bins in each group is reduced by optimizing the lower bound error of the grouping. Extensive experimental results show that the proposed algorithm can effectively improve the usability of published data under the premise of satisfying differential privacy.
引用
收藏
页码:81 / 91
页数:11
相关论文
共 50 条
  • [1] Signal Sorting Algorithm Based on Extended Histogram
    Wu, Hao
    Liu, Zhangmeng
    Yan, Xingwei
    Liu, Zheng
    2019 IEEE 4TH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP 2019), 2019, : 674 - 678
  • [2] Algorithm for TDOA Sorting Based on Clustering
    Li, Tingjun
    Liu, Youming
    Che, Zhiyu
    Qu, Changwen
    Zhang, Yang
    MODERN TECHNOLOGIES IN MATERIALS, MECHANICS AND INTELLIGENT SYSTEMS, 2014, 1049 : 1308 - 1311
  • [3] An Efficient Clustering Algorithm Based on Histogram Threshold
    Shieh, Shu-Ling
    Lin, Tsu-Chun
    Szu, Yu-Chin
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS (ACIIDS 2012), PT II, 2012, 7197 : 32 - 39
  • [4] A greedy clustering and scheduling algorithm
    Ruan, YL
    Zhang, JJ
    Li, QH
    Yang, SD
    2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 247 - 250
  • [5] Greedy Sampling for Approximate Clustering in the Presence of Outliers
    Bhaskara, Aditya
    Vadgama, Sharvaree
    Xu, Hong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [6] Data publishing Anonymity Algorithm Research Based on Clustering
    Yang, Yu
    Zhang, Longjun
    PROCEEDINGS OF THE 2016 INTERNATIONAL FORUM ON MANAGEMENT, EDUCATION AND INFORMATION TECHNOLOGY APPLICATION, 2016, 47 : 758 - 762
  • [7] A signal sorting algorithm based on time difference of arrival.histogram
    Radar Research , Leihua Electronic Technology Institute, AVIC, Beijing
    100012, China
    Dianzi Yu Xinxi Xuebao, 11 (2762-2768):
  • [8] Clustering Algorithm of Similarity Segmentation based on Point Sorting
    Li, Hanbing
    Wang, Yan
    Huang, Lan
    Li, Mingda
    Sun, Ying
    Zhang, Hanyuan
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON LOGISTICS, ENGINEERING, MANAGEMENT AND COMPUTER SCIENCE (LEMCS 2015), 2015, 117 : 475 - 482
  • [9] A Variable Bin Width Histogram based Image Clustering Algorithm
    Gao, Song
    Zhang, Chengcui
    Chen, Wei-Bang
    2010 IEEE FOURTH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2010), 2010, : 166 - 171
  • [10] Histogram-based colour image fuzzy clustering algorithm
    Hai-peng Chen
    Xuan-Jing Shen
    Jian-Wu Long
    Multimedia Tools and Applications, 2016, 75 : 11417 - 11432