Data Anonymization: An Experimental Evaluation Using Open-Source Tools

被引:5
|
作者
Tomas, Joana [1 ]
Rasteiro, Deolinda [1 ]
Bernardino, Jorge [1 ,2 ]
机构
[1] Polytech Coimbra, Inst Engn Coimbra ISEC, Rua Pedro Nunes, P-3030199 Coimbra, Portugal
[2] Univ Coimbra, CISUC, Ctr Informat & Syst, Polo 2, P-3030290 Coimbra, Portugal
来源
FUTURE INTERNET | 2022年 / 14卷 / 06期
关键词
data anonymization; OSSpal methodology; ARX Data Anonymization tool; Amnesia;
D O I
10.3390/fi14060167
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, the use of personal data in marketing, scientific and medical investigation, and forecasting future trends has really increased. This information is used by the government, companies, and individuals, and should not contain any sensitive information that allows the identification of an individual. Therefore, data anonymization is essential nowadays. Data anonymization changes the original data to make it difficult to identify an individual. ARX Data Anonymization and Amnesia are two popular open-source tools that simplify this process. In this paper, we evaluate these tools in two ways: with the OSSpal methodology, and using a public dataset with the most recent tweets about the Pfizer and BioNTech vaccine. The assessment with the OSSpal methodology determines that ARX Data Anonymization has better results than Amnesia. In the experimental evaluation using the public dataset, it is possible to verify that Amnesia has some errors and limitations, but the anonymization process is simpler. Using ARX Data Anonymization, it is possible to upload big datasets and the tool does not show any error in the anonymization process. We concluded that ARX Data Anonymization is the one recommended to use in data anonymization.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] SecGraph: A Uniform and Open-source Evaluation System for Graph Data Anonymization and De-anonymization
    Ji, Shouling
    Li, Weiqing
    Mittal, Prateek
    Hu, Xin
    Beyah, Raheem
    [J]. PROCEEDINGS OF THE 24TH USENIX SECURITY SYMPOSIUM, 2015, : 303 - 318
  • [2] Open-source tools for data mining
    Zupan, Blaz
    Demsar, Janez
    [J]. CLINICS IN LABORATORY MEDICINE, 2008, 28 (01) : 37 - +
  • [3] Spatial Data Warehouses and SOLAP Using Open-Source Tools
    Bogantes Gonzalez, Diana
    Pandolfi Gonzalez, Leonardo
    [J]. PROCEEDINGS OF THE 2013 XXXIX LATIN AMERICAN COMPUTING CONFERENCE (CLEI), 2013,
  • [4] Evaluation of Open-Source Tools for Differential Privacy
    Zhang, Shiliang
    Hagermalm, Anton
    Slavnic, Sanjin
    Schiller, Elad Michael
    Almgren, Magnus
    [J]. SENSORS, 2023, 23 (14)
  • [5] Ransomware Detection Using Open-source Tools
    Lee, Sun-Jin
    Shim, Hye-Yeon
    Lee, Yu-Rim
    Park, Tae-Rim
    Lee, Il-Gu
    [J]. 2022 24TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT): ARITIFLCIAL INTELLIGENCE TECHNOLOGIES TOWARD CYBERSECURITY, 2022, : 1386 - +
  • [6] A Study of Open-Source Data Mining Tools for Forecasting
    Hasim, Nurdatillah
    Abu Haris, Norhaidah
    [J]. ACM IMCOM 2015, PROCEEDINGS, 2015,
  • [7] IoT Design Course using Open-Source Tools
    Papaefstathiou, Ioannis
    [J]. PROCEEDINGS OF 2016 IEEE GLOBAL ENGINEERING EDUCATION CONFERENCE (EDUCON2016), 2016, : 114 - 118
  • [8] HYDRODYNAMIC PERFORMANCE EVALUATION OF A SEMI-SUBMERSIBLE FLOATER USING OPEN-SOURCE TOOLS
    Hassani, Milad
    Li, Lin
    Kalofotias, Filippos
    Jiang, Zhiyu
    [J]. PROCEEDINGS OF ASME 2022 41ST INTERNATIONAL CONFERENCE ON OCEAN, OFFSHORE & ARCTIC ENGINEERING, OMAE2022, VOL 4, 2022,
  • [9] Open-Source Innovation in Practice: A Lean-Based Development Process Leveraging Open-Source Big Data Tools
    Alonso, Silvio
    Viana, Marx
    Cirilo, Elder
    Alencar, Paulo
    Lucena, Carlos
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 4662 - 4671
  • [10] Open-source neurophotonic tools for neuroscience
    Kodandaramaiah, Suhasa B.
    Aharoni, Daniel
    Gibson, Emily A.
    [J]. Neurophotonics, 2024, 11 (03)