Dataset Anonyization on Cloud: Open Problems and Perspectives

被引:0
|
作者
Cristani, Matteo [1 ]
Tomazzoli, Claudio [1 ]
机构
[1] Univ Verona, Dept Comp Sci, Verona, Italy
来源
CURRENT TRENDS IN WEB ENGINEERING, ICWE 2019 INTERNATIONAL WORKSHOPS | 2020年 / 11609卷
关键词
DATA PROVENANCE; RULES;
D O I
10.1007/978-3-030-51253-8_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data anonymization is the process of making information contained in a group of data such that it is not possible to identify unique references to single elements in the group after the process. This action, when conducted onto datasets used to make statistical inference is bound to have ananlogous behaviours on certain indices before and after the process itself. In this paper we study the pipeline of anonymization process for datasets, when this pipeline is managed on cloud technology, where cryptography is not applicable at all, for datasets being available in an open setting. We examine the open problems, and devise a method to address these problems in a logical framework.
引用
收藏
页码:74 / 85
页数:12
相关论文
共 50 条
  • [41] The BitMap dataset, an open dataset on performance assessment of diffuse optics instruments
    Orive-Miguel, David
    Lanka, Pranav
    Yang, Lin
    Tagliabue, Susanna
    Sudakou, Aleh
    Samaei, Saeed
    Veesa, Joshua Deepak
    Forcione, Mario
    Kovacsova, Zuzana
    Behera, Anurag
    Herve, Lionel
    Durduran, Turgut
    Liebert, Adam
    Sawosz, Piotr
    Belli, Antonio
    Tachtsidis, Ilias
    Dalla Mora, Alberto
    Mars, Jerome
    Condat, Laurent
    Torricelli, Alessandro
    Dehghani, Hamid
    Wabnitz, Heidrun
    Pifferi, Antonio
    DIFFUSE OPTICAL SPECTROSCOPY AND IMAGING VII, 2019, 11074
  • [42] DroneFace: An Open Dataset for Drone Research
    Hsu, Hwai-Jung
    Chen, Kuan-Ta
    PROCEEDINGS OF THE 8TH ACM MULTIMEDIA SYSTEMS CONFERENCE (MMSYS'17), 2017, : 187 - 192
  • [43] A Traceability Dataset for Open Source Systems
    Hammoudi, Mouna
    Mayr-Dorn, Christoph
    Mashkoor, Atif
    Egyed, Alexander
    2021 IEEE/ACM 18TH INTERNATIONAL CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR 2021), 2021, : 555 - 559
  • [44] An Open MRI Dataset For Multiscale Neuroscience
    Jessica Royer
    Raúl Rodríguez-Cruces
    Shahin Tavakol
    Sara Larivière
    Peer Herholz
    Qiongling Li
    Reinder Vos de Wael
    Casey Paquola
    Oualid Benkarim
    Bo-yong Park
    Alexander J. Lowe
    Daniel Margulies
    Jonathan Smallwood
    Andrea Bernasconi
    Neda Bernasconi
    Birgit Frauscher
    Boris C. Bernhardt
    Scientific Data, 9
  • [45] Open University Learning Analytics dataset
    Kuzilek, Jakub
    Hlosta, Martin
    Zdrahal, Zdenek
    SCIENTIFIC DATA, 2017, 4
  • [46] An Open MRI Dataset For Multiscale Neuroscience
    Royer, Jessica
    Rodriguez-Cruces, Raul
    Tavakol, Shahin
    Lariviere, Sara
    Herholz, Peer
    Li, Qiongling
    Vos de Wael, Reinder
    Paquola, Casey
    Benkarim, Oualid
    Park, Bo-yong
    Lowe, Alexander J.
    Margulies, Daniel
    Smallwood, Jonathan
    Bernasconi, Andrea
    Bernasconi, Neda
    Frauscher, Birgit
    Bernhardt, Boris C.
    SCIENTIFIC DATA, 2022, 9 (01)
  • [47] An Open Dataset and Model for Language Identification
    Burchell, Laurie
    Birch, Alexandra
    Bogoychev, Nikolay
    Heafield, Kenneth
    61ST CONFERENCE OF THE THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 2, 2023, : 865 - 879
  • [48] Open University Learning Analytics dataset
    Jakub Kuzilek
    Martin Hlosta
    Zdenek Zdrahal
    Scientific Data, 4
  • [49] A Dataset for Open Event Extraction in English
    Nguyen, Kiem-Hieu
    Tannier, Xavier
    Ferret, Olivier
    Besancon, Romaric
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 1939 - 1943
  • [50] A Dynamic Point Cloud Dataset for MPEG Point Cloud Compression and Performance Analysis
    Zhao, Lili
    Yin, Qian
    Ren, Lancao
    Yang, Lei
    Jia, Chuanmin
    Ma, Siwei
    2024 DATA COMPRESSION CONFERENCE, DCC, 2024, : 604 - 604