PACAS: Privacy-Aware, Data Cleaning-as-a-Service

被引:0
|
作者
Huang, Yu [1 ]
Milani, Mostafa [1 ]
Chiang, Fei [1 ]
机构
[1] McMaster Univ, Hamilton, ON, Canada
关键词
data quality; data cleaning; data privacy;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data cleaning consumes up to 80% of the data analysis pipeline. This is a significant overhead for organizations where data cleaning is still a manually driven process requiring domain expertise. Recent advances have fueled a new computing paradigm called Database-as-a-Service, where data management tasks are outsourced to large service providers. We propose a new Data Cleaning-as-a-Service model that allows a client to interact with a data cleaning provider who hosts curated, and sensitive data. We present PACAS: a Privacy-Aware data Cleaning-As-a-Service framework that facilitates communication between the client and the service provider via a data pricing scheme where clients issue queries, and the service provider returns clean answers for a price while protecting her data. We propose a practical privacy model in such interactive settings called (X,Y,L)-anonymity that extends existing data publishing techniques to consider the data semantics while protecting sensitive values. Our evaluation over real data shows that PACAS effectively safeguards semantically related sensitive values, and provides improved accuracy over existing privacy-aware cleaning techniques.
引用
收藏
页码:1023 / 1030
页数:8
相关论文
共 50 条
  • [1] Privacy-aware data cleaning-as-a-service
    Huang, Yu
    Milani, Mostafa
    Chiang, Fei
    [J]. INFORMATION SYSTEMS, 2020, 94
  • [2] PARC: Privacy-Aware Data Cleaning
    Huang, Dejun
    Gairola, Dhruv
    Huang, Yu
    Zheng, Zheng
    Chiang, Fei
    [J]. CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 2433 - 2436
  • [3] PACAS: A Privacy-Aware Smart Camera System
    Yu, Keyang
    Chen, Dong
    [J]. 2024 IEEE CLOUD SUMMIT, CLOUD SUMMIT 2024, 2024, : 170 - 177
  • [4] Privacy-aware service integration
    Parrend, Pierre
    Frenot, Stephane
    Hoehn, Sebastian
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE SERVICES, 2007, : 397 - +
  • [5] Privacy-Aware Data Publishing and Integration for Collaborative Service Recommendation
    Yan, Chao
    Cui, Xinchun
    Qi, Lianyong
    Xu, Xiaolong
    Zhang, Xuyun
    [J]. IEEE ACCESS, 2018, 6 : 43021 - 43028
  • [6] Privacy as a Service: Privacy-Aware Data Storage and Processing in Cloud Computing Architectures
    Itani, Wassim
    Kayssi, Ayman
    Chehab, Ali
    [J]. EIGHTH IEEE INTERNATIONAL CONFERENCE ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, PROCEEDINGS, 2009, : 711 - 716
  • [7] Privacy-Aware Data Trading
    Wang, Shengling
    Shi, Lina
    Hu, Qin
    Zhang, Junshan
    Cheng, Xiuzhen
    Yu, Jiguo
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2021, 16 : 3916 - 3927
  • [8] Agora: A Privacy-aware Data Marketplace
    Koutsos, Vlasis
    Papadopoulos, Dimitrios
    Chatzopoulos, Dimitris
    Tarkoma, Sasu
    Hui, Pan
    [J]. 2020 IEEE 40TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS), 2020, : 1211 - 1212
  • [9] Privacy-Aware Location Data Publishing
    Hu, Haibo
    Xu, Jianliang
    On, Sai Tung
    Du, Jing
    Ng, Joseph Kee-Yin
    [J]. ACM TRANSACTIONS ON DATABASE SYSTEMS, 2010, 35 (03):
  • [10] Agora: A Privacy-Aware Data Marketplace
    Koutsos, Vlasis
    Papadopoulos, Dimitrios
    Chatzopoulos, Dimitris
    Tarkoma, Sasu
    Hui, Pan
    [J]. IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2022, 19 (06) : 3728 - 3740