Geolocated Data Generation and Protection Using Generative Adversarial Networks

被引：2

作者：

Alatrista-Salas, Hugo ^{[1
]}

Montalvo-Garcia, Peter ^{[1
]}

Nunez-del-Prado, Miguel ^{[2
,3
]}

Salas, Julian ^{[4
,5
]}

机构：

[1] Pontificia Univ Catolica Peru, Lima, Peru

[2] Univ Andina Cusco, Inst Invest, Cuzco, Peru

[3] Peru Res Dev & Innovat Ctr, Lima, Peru

[4] Univ Oberta Catalunya UOC, Internet Interdisciplinary Inst IN3, Barcelona, Spain

[5] Ctr Cybersecur Res Catalonia CYBERCAT, Barcelona, Spain

来源：

MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE, MDAI 2022 | 2022年 / 13408卷

关键词：

Differential privacy; Generative Adversarial Networks; Disclosure risk; Information loss; Synthetic trajectories; Privacy; PRIVACY;

D O I：

10.1007/978-3-031-13448-7_7

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Data mining techniques allow us to discover patterns in large datasets. Nonetheless, data may contain sensitive information. This is especially true when data is georeferenced. Thus, an adversary could learn about individual whereabouts, points of interest, political affiliation, and even sexual habits. At the same time, human mobility is a rich source of information to analyze traffic jams, health care accessibility, food desserts, and even pandemics dynamics. Therefore, to enhance privacy, we study the use of Deep Learning techniques such as Generative Adversarial Network (GAN) and GAN with Differential Privacy (DP-GAN) to generate synthetic data with formal privacy guarantees. Our experiments demonstrate that we can generate synthetic data to maintain individuals' privacy and data quality depending on privacy parameters. Accordingly, based on the privacy settings, we generated data differing a few meters and a few kilometers from the original trajectories. After generating fine-grain mobility trajectories at the GPS level through an adversarial neural networks approach and using GAN to sanitize the original trajectories together with differential privacy, we analyze the privacy provided from the perspective of anonymization literature. We show that such epsilon-differentially private data may still have a risk of re-identification.

引用

页码：80 / 91

页数：12

共 50 条

[31] Energy data generation with Wasserstein Deep Convolutional Generative Adversarial Networks
Li, Jianbin
Chen, Zhiqiang
Cheng, Long
Liu, Xiufeng
ENERGY, 2022, 257
[32] Red blood cell image generation for data augmentation using Conditional Generative Adversarial Networks
Bailo, Oleksandr
Ham, DongShik
Shin, Young Min
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 1039 - 1048
[33] TCAC-GAN: Synthetic Trajectory Generation Model Using Auxiliary Classifier Generative Adversarial Networks for Improved Protection of Trajectory Data
Shin, Jihwan
Song, Yeji
Ahn, Jinhyun
Lee, Taewhi
2023 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING, BIGCOMP, 2023, : 314 - 315
[34] Land Clutter Data Generation Using Generative Adversarial Network
Dang, Xunwang
Chen, Yong
Wang, Chao
Yin, Hongcheng
Xu, Honglei
2020 IEEE MTT-S INTERNATIONAL CONFERENCE ON NUMERICAL ELECTROMAGNETIC AND MULTIPHYSICS MODELING AND OPTIMIZATION (NEMO 2020), 2020,
[35] Biomedical Data Augmentation Using Generative Adversarial Neural Networks
Calimeri, Francesco
Marzullo, Aldo
Stamile, Claudio
Terracina, Giorgio
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, PT II, 2017, 10614 : 626 - 634
[36] Data Preprocessing for Soft Sensor using Generative Adversarial Networks
Wang, Xiao
2018 15TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2018, : 1355 - 1360
[37] SEQUENTIAL IOT DATA AUGMENTATION USING GENERATIVE ADVERSARIAL NETWORKS
Tschuchnig, Maximilian Ernst
Ferner, Cornelia
Wegenkittl, Stefan
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 4212 - 4216
[38] Realistic Data Synthesis Using Enhanced Generative Adversarial Networks
Baowaly, Mrinal Kanti
Liu, Chao-Lin
Chen, Kuan-Ta
2019 IEEE SECOND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND KNOWLEDGE ENGINEERING (AIKE), 2019, : 289 - 292
[39] Efficient Approaches for Data Augmentation by Using Generative Adversarial Networks
Saha, Pretom Kumar
Logofatu, Doina
ENGINEERING APPLICATIONS OF NEURAL NETWORKS, EAAAI/EANN 2022, 2022, 1600 : 386 - 399
[40] Synthesizing credit data using autoencoders and generative adversarial networks
Oreski, Goran
KNOWLEDGE-BASED SYSTEMS, 2023, 274

← 1 2 3 4 5 →