Federated Clouds: A New Metric for Measuring the Quality of Data Anonymization

被引:0
|
作者
Gaye, Youssoupha [1 ]
Mbaye, Maissa [1 ]
Diongue, Dame [1 ]
Dieng, Ousmane [2 ]
Adetiba, Emmanuel [3 ,4 ]
Badejo, Joke A. [3 ]
机构
[1] Gaston Berger Univ, Lab Anal Numer & Informat LANI, CEA MITIC, St Louis 234, PB, Senegal
[2] Univ Pittsburgh, Power Management & Real Time Syst Lab, Pittsburgh, PA 15260 USA
[3] Covenant Univ, Covenant Appl Informat & Commun African Ctr Excel, Ota, Ogun State, Nigeria
[4] Durban Univ Technol, Inst Syst Sci, HRA, ZA-1334 Durban, South Africa
来源
关键词
Data Anonymization Metric; Data Privacy; Federated Cloud Security; Federated Cloud; Cloud Computing; PRIVACY;
D O I
10.1007/978-3-031-62488-9_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Federated cloud has emerged as solution for cloud service providers to get scalability in serving the growing demand for cloud resources. In a federated cloud, a cloud member can provide service or request it from other cloud provider members in the federation. The federation enables its cloud provider members to be able to satisfy a service beyond the resources they owned by using the resources market in the federation. Data privacy is a major concern in federated clouds. As the privacy regulations and laws of the countries in the federation may vary, it is difficult to assess and confirm that they are in compliance. This makes protecting privacy even more challenging. Privacy management strategies primarily involve anonymization, cryptography, and data splitting. Anonymization is the traditional approach to preserving privacy, which aims at masking the link between the quasi-identifier and sensitive data. The most widely used anonymization techniques are k-anonymity, l-diversity and t-closeness. However, there is a lack of a formal metric to measure the quality of the anonymization process in terms of its ability to prevent re-identification. This paper examines the issue of assessing anonymization quality and introduces a new metric, Mmaq, for this purpose. It can be used to evaluate the anonymization of one or multiple attributes. The metric is a combination of the Shannon index, which measures diversity, and a stabilizer factor, which corrects the Shannon index for pathological cases. The initial results suggest that Mmaq can be used to classify attributes as identifier, quasi-identifier, and anonymous. Furthermore, it can be employed as a Cloud Privacy Policy anonymization compliance checker.
引用
收藏
页码:17 / 30
页数:14
相关论文
共 50 条
  • [21] A big video data transcoding service for social media over federated clouds
    Panarello, Alfonso
    Celesti, Antonio
    Fazio, Maria
    Puliafito, Antonio
    Villari, Massimo
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (13-14) : 9037 - 9061
  • [22] A big video data transcoding service for social media over federated clouds
    Alfonso Panarello
    Antonio Celesti
    Maria Fazio
    Antonio Puliafito
    Massimo Villari
    Multimedia Tools and Applications, 2020, 79 : 9037 - 9061
  • [23] A new metric for categorical data
    Al-Harbi, SH
    McKeown, GP
    Rayward-Smith, VJ
    STATISTICAL DATA MINING AND KNOWLEDGE DISCOVERY, 2004, : 339 - 351
  • [24] Data Quality Control Based on Metric Data Models
    Koeppen, Veit
    Lenz, Hans-J
    FRONTIERS IN STATISTICAL QUALITY CONTROL 9, 2010, : 263 - +
  • [25] SegGuard: Segmentation-Based Anonymization of Network Data in Clouds for Privacy-Preserving Security Auditing
    Oqaily, Momen
    Jarraya, Yosr
    Mohammady, Meisam
    Majumdar, Suryadipta
    Pourzandi, Makan
    Wang, Lingyu
    Debbabi, Mourad
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2021, 18 (05) : 2486 - 2505
  • [26] MEASURING DATA QUALITY IN VORTALS
    Caro, Angelica
    Angeles Moraga, Ma
    Moraga, Carmen
    Calero, Coral
    ICSOFT 2009: PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON SOFTWARE AND DATA TECHNOLOGIES, VOL 2, 2009, : 217 - +
  • [27] Data Anonymization for Requirements Quality Analysis: a Reproducible Automatic Error Detection Task
    Kang, Juyeon
    Park, Jungyeul
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 4432 - 4436
  • [28] Federated data warehousing application framework and platform-as-a-services to model virtual data marts in the clouds
    Nguyen, Thanh Binh, 1600, Inderscience Enterprises Ltd., 29, route de Pre-Bois, Case Postale 856, CH-1215 Geneva 15, CH-1215, Switzerland (08):
  • [29] Research on Data Quality Governance for Federated Cooperation Scenarios
    Shen, Junxin
    Zhou, Shuilan
    Xiao, Fanghao
    ELECTRONICS, 2024, 13 (18)
  • [30] Nordcan.R: a new tool for federated analysis and quality assurance of cancer registry data
    Laronningen, Siri
    Skog, Anna
    Engholm, Gerda
    Ferlay, Jacques
    Johannesen, Tom Borge
    Kristiansen, Marnar Fridheim
    Knoors, Daan
    Konig, Simon Mathis
    Olafsdottir, Elinborg J. J.
    Pejicic, Sasha
    Pettersson, David
    Skovlund, Charlotte Wessel
    Storm, Hans H. H.
    Tian, Huidong
    Aagnes, Bjarte
    Miettinen, Joonas
    FRONTIERS IN ONCOLOGY, 2023, 13