Differentially Private Network Data Release via Structural Inference

被引:100
|
作者
Xiao, Qian [1 ]
Chen, Rui [2 ]
Tan, Kian-Lee [1 ,3 ]
机构
[1] Natl Univ Singapore, NGS, Singapore, Singapore
[2] Hong Kong Baptist Univ, Dept Comp Sci, Hong Kong, Peoples R China
[3] Natl Univ Singapore, Sch Comp, Singapore, Singapore
关键词
Network data; differential privacy; structural inference;
D O I
10.1145/2623330.2623642
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Information networks, such as social media and email networks, often contain sensitive information. Releasing such network data could seriously jeopardize individual privacy. Therefore, we need to sanitize network data before the release. In this paper, we present a novel data sanitization solution that infers a network's structure in a differentially private manner. We observe that, by estimating the connection probabilities between vertices instead of considering the observed edges directly, the noise scale enforced by differential privacy can be greatly reduced. Our proposed method infers the network structure by using a statistical hierarchical random graph (HRG) model. The guarantee of differential privacy is achieved by sampling possible HRG structures in the model space via Markov chain Monte Carlo (MCMC). We theoretically prove that the sensitivity of such inference is only O(log n), where n is the number of vertices in a network. This bound implies less noise to be injected than those of existing works. We experimentally evaluate our approach on four real-life network datasets and show that our solution effectively preserves essential network structural properties like degree distribution, shortest path length distribution and influential nodes.
引用
收藏
页码:911 / 920
页数:10
相关论文
共 50 条
  • [21] Differentially private data release via statistical election to partition sequentiallyStatistical election to partition sequentially
    Claire McKay Bowen
    Fang Liu
    Bingyue Su
    METRON, 2021, 79 : 1 - 31
  • [22] Differentially Private Database Release via Kernel Mean Embeddings
    Balog, Matej
    Tolstikhin, Ilya
    Schoelkopf, Bernhard
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [23] Noise-Aware Statistical Inference with Differentially Private Synthetic Data
    Raisa, Ossi
    Jalko, Joonas
    Kaski, Samuel
    Honkela, Antti
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206
  • [24] An optimal differentially private data release mechanism with constrained error
    Hao WANG
    Zhengquan XU
    Xiaoshan ZHANG
    Xiao PENG
    Kaiju LI
    Frontiers of Computer Science, 2022, 16 (01) : 179 - 181
  • [25] An optimal differentially private data release mechanism with constrained error
    Hao Wang
    Zhengquan Xu
    Xiaoshan Zhang
    Xiao Peng
    Kaiju Li
    Frontiers of Computer Science, 2022, 16
  • [26] Differentially Private Real-Time Release of Sequential Data
    Zhang, Xueru
    Khalili, Mohammad Mahdi
    Liu, Mingyan
    ACM TRANSACTIONS ON PRIVACY AND SECURITY, 2023, 26 (01)
  • [27] An optimal differentially private data release mechanism with constrained error
    Wang, Hao
    Xu, Zhengquan
    Zhang, Xiaoshan
    Peng, Xiao
    Li, Kaiju
    FRONTIERS OF COMPUTER SCIENCE, 2022, 16 (01)
  • [28] Incremental release of differentially-private check-in data
    Riboni, Daniele
    Bettini, Claudio
    PERVASIVE AND MOBILE COMPUTING, 2015, 16 : 220 - 238
  • [29] Differentially Private Synthesis and Sharing of Network Data Via Bayesian Exponential Random Graph Models
    Liu, Fang
    Eugenio, Evercita C.
    Jin, Ick Hoon
    Bowen, Claire Mckay
    JOURNAL OF SURVEY STATISTICS AND METHODOLOGY, 2022, 10 (03) : 753 - 784
  • [30] Differentially private data release via statistical election to partition sequentially Statistical election to partition sequentially
    Bowen, Claire McKay
    Liu, Fang
    Su, Bingyue
    METRON-INTERNATIONAL JOURNAL OF STATISTICS, 2021, 79 (01): : 1 - 31