Geolocated Data Generation and Protection Using Generative Adversarial Networks

被引:2
|
作者
Alatrista-Salas, Hugo [1 ]
Montalvo-Garcia, Peter [1 ]
Nunez-del-Prado, Miguel [2 ,3 ]
Salas, Julian [4 ,5 ]
机构
[1] Pontificia Univ Catolica Peru, Lima, Peru
[2] Univ Andina Cusco, Inst Invest, Cuzco, Peru
[3] Peru Res Dev & Innovat Ctr, Lima, Peru
[4] Univ Oberta Catalunya UOC, Internet Interdisciplinary Inst IN3, Barcelona, Spain
[5] Ctr Cybersecur Res Catalonia CYBERCAT, Barcelona, Spain
关键词
Differential privacy; Generative Adversarial Networks; Disclosure risk; Information loss; Synthetic trajectories; Privacy; PRIVACY;
D O I
10.1007/978-3-031-13448-7_7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data mining techniques allow us to discover patterns in large datasets. Nonetheless, data may contain sensitive information. This is especially true when data is georeferenced. Thus, an adversary could learn about individual whereabouts, points of interest, political affiliation, and even sexual habits. At the same time, human mobility is a rich source of information to analyze traffic jams, health care accessibility, food desserts, and even pandemics dynamics. Therefore, to enhance privacy, we study the use of Deep Learning techniques such as Generative Adversarial Network (GAN) and GAN with Differential Privacy (DP-GAN) to generate synthetic data with formal privacy guarantees. Our experiments demonstrate that we can generate synthetic data to maintain individuals' privacy and data quality depending on privacy parameters. Accordingly, based on the privacy settings, we generated data differing a few meters and a few kilometers from the original trajectories. After generating fine-grain mobility trajectories at the GPS level through an adversarial neural networks approach and using GAN to sanitize the original trajectories together with differential privacy, we analyze the privacy provided from the perspective of anonymization literature. We show that such epsilon-differentially private data may still have a risk of re-identification.
引用
收藏
页码:80 / 91
页数:12
相关论文
共 50 条
  • [31] Energy data generation with Wasserstein Deep Convolutional Generative Adversarial Networks
    Li, Jianbin
    Chen, Zhiqiang
    Cheng, Long
    Liu, Xiufeng
    ENERGY, 2022, 257
  • [32] Red blood cell image generation for data augmentation using Conditional Generative Adversarial Networks
    Bailo, Oleksandr
    Ham, DongShik
    Shin, Young Min
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 1039 - 1048
  • [33] TCAC-GAN: Synthetic Trajectory Generation Model Using Auxiliary Classifier Generative Adversarial Networks for Improved Protection of Trajectory Data
    Shin, Jihwan
    Song, Yeji
    Ahn, Jinhyun
    Lee, Taewhi
    2023 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING, BIGCOMP, 2023, : 314 - 315
  • [34] Land Clutter Data Generation Using Generative Adversarial Network
    Dang, Xunwang
    Chen, Yong
    Wang, Chao
    Yin, Hongcheng
    Xu, Honglei
    2020 IEEE MTT-S INTERNATIONAL CONFERENCE ON NUMERICAL ELECTROMAGNETIC AND MULTIPHYSICS MODELING AND OPTIMIZATION (NEMO 2020), 2020,
  • [35] Biomedical Data Augmentation Using Generative Adversarial Neural Networks
    Calimeri, Francesco
    Marzullo, Aldo
    Stamile, Claudio
    Terracina, Giorgio
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, PT II, 2017, 10614 : 626 - 634
  • [36] Data Preprocessing for Soft Sensor using Generative Adversarial Networks
    Wang, Xiao
    2018 15TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2018, : 1355 - 1360
  • [37] SEQUENTIAL IOT DATA AUGMENTATION USING GENERATIVE ADVERSARIAL NETWORKS
    Tschuchnig, Maximilian Ernst
    Ferner, Cornelia
    Wegenkittl, Stefan
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 4212 - 4216
  • [38] Realistic Data Synthesis Using Enhanced Generative Adversarial Networks
    Baowaly, Mrinal Kanti
    Liu, Chao-Lin
    Chen, Kuan-Ta
    2019 IEEE SECOND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND KNOWLEDGE ENGINEERING (AIKE), 2019, : 289 - 292
  • [39] Efficient Approaches for Data Augmentation by Using Generative Adversarial Networks
    Saha, Pretom Kumar
    Logofatu, Doina
    ENGINEERING APPLICATIONS OF NEURAL NETWORKS, EAAAI/EANN 2022, 2022, 1600 : 386 - 399
  • [40] Synthesizing credit data using autoencoders and generative adversarial networks
    Oreski, Goran
    KNOWLEDGE-BASED SYSTEMS, 2023, 274