Federated Clouds: A New Metric for Measuring the Quality of Data Anonymization

被引:0
|
作者
Gaye, Youssoupha [1 ]
Mbaye, Maissa [1 ]
Diongue, Dame [1 ]
Dieng, Ousmane [2 ]
Adetiba, Emmanuel [3 ,4 ]
Badejo, Joke A. [3 ]
机构
[1] Gaston Berger Univ, Lab Anal Numer & Informat LANI, CEA MITIC, St Louis 234, PB, Senegal
[2] Univ Pittsburgh, Power Management & Real Time Syst Lab, Pittsburgh, PA 15260 USA
[3] Covenant Univ, Covenant Appl Informat & Commun African Ctr Excel, Ota, Ogun State, Nigeria
[4] Durban Univ Technol, Inst Syst Sci, HRA, ZA-1334 Durban, South Africa
来源
关键词
Data Anonymization Metric; Data Privacy; Federated Cloud Security; Federated Cloud; Cloud Computing; PRIVACY;
D O I
10.1007/978-3-031-62488-9_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Federated cloud has emerged as solution for cloud service providers to get scalability in serving the growing demand for cloud resources. In a federated cloud, a cloud member can provide service or request it from other cloud provider members in the federation. The federation enables its cloud provider members to be able to satisfy a service beyond the resources they owned by using the resources market in the federation. Data privacy is a major concern in federated clouds. As the privacy regulations and laws of the countries in the federation may vary, it is difficult to assess and confirm that they are in compliance. This makes protecting privacy even more challenging. Privacy management strategies primarily involve anonymization, cryptography, and data splitting. Anonymization is the traditional approach to preserving privacy, which aims at masking the link between the quasi-identifier and sensitive data. The most widely used anonymization techniques are k-anonymity, l-diversity and t-closeness. However, there is a lack of a formal metric to measure the quality of the anonymization process in terms of its ability to prevent re-identification. This paper examines the issue of assessing anonymization quality and introduces a new metric, Mmaq, for this purpose. It can be used to evaluate the anonymization of one or multiple attributes. The metric is a combination of the Shannon index, which measures diversity, and a stabilizer factor, which corrects the Shannon index for pathological cases. The initial results suggest that Mmaq can be used to classify attributes as identifier, quasi-identifier, and anonymous. Furthermore, it can be employed as a Cloud Privacy Policy anonymization compliance checker.
引用
收藏
页码:17 / 30
页数:14
相关论文
共 50 条
  • [1] Federated tool for anonymization and annotation in image data
    van Rooij, Sabina B.
    Bouma, Henri
    van Mil, Jelle
    Ten Hove, Johan-Martijn
    COUNTERTERRORISM, CRIME FIGHTING, FORENSICS, AND SURVEILLANCE TECHNOLOGIES VI, 2022, 12275
  • [2] Synthetic Data for Anonymization in Secure Data Spaces for Federated Learning
    Angulo, Cecilio
    Raya, Cristobal
    ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT, 2022, 356 : 91 - 94
  • [3] A new metric for measuring the visual quality of video watermarks
    Trick, Daniel
    Thiemert, Stefan
    MEDIA WATERMARKING, SECURITY, AND FORENSICS III, 2011, 7880
  • [4] Data On-boarding in Federated Storage Clouds
    Vernik, Gil
    Shulman-Peleg, Alexandra
    Dippl, Sebastian
    Formisano, Ciro
    Jaeger, Michael C.
    Kolodner, Elliot K.
    Villari, Massimo
    2013 IEEE SIXTH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD 2013), 2013, : 244 - 251
  • [5] Urban Resolution: New Metric for Measuring the Quality of Urban Sensing
    Liu, Liang
    Wei, Wangyang
    Zhao, Dong
    Ma, Huadong
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2015, 14 (12) : 2560 - 2575
  • [6] Data Provenance Management of Bioinformatics Workflows in Federated Clouds
    Wercelens, Polyane
    da Silva, Waldeyr
    Castro, Klayton
    Araujo, Aleteia P. F.
    Lifschitz, Sergio
    Holanda, Maristela
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 750 - 754
  • [7] Storage Policy for Genomic Data in Hybrid Federated Clouds
    Gallon, Ricardo
    Holanda, Maristela
    Araujo, Aleteia
    Walter, Maria E.
    ADVANCES IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, BSB 2014, 2014, 8826 : 107 - 114
  • [8] A new metric for measuring metamodels quality-of-fit for deterministic simulations
    Hamad, Husam
    PROCEEDINGS OF THE 2006 WINTER SIMULATION CONFERENCE, VOLS 1-5, 2006, : 882 - 888
  • [9] Anonymization Techniques for Preserving Data Quality in Participatory Sensing
    Sabrina, Tishna
    Murshed, Manzur
    Iqbal, Anindya
    2016 IEEE 41ST CONFERENCE ON LOCAL COMPUTER NETWORKS (LCN), 2016, : 607 - 610
  • [10] Data quality for federated medical data lakes
    Eder, Johann
    Shekhovtsov, Vladimir A.
    INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2021, 17 (05) : 407 - 426