Hiding sensitive itemsets without side effects

被引:7
|
作者
Surendra, H. [1 ]
Mohan, H. S. [1 ]
机构
[1] SJB Inst Technol, Dept ISE, Bangalore, Karnataka, India
关键词
Data sanitization; Itemset hiding; Pattern sanitization; Privacy-preserving data mining (PPDM); Privacy-preserving data publishing (PPDP); PRIVACY;
D O I
10.1007/s10489-018-1329-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data mining techniques are being used to discover useful patterns hidden in the data. However, these data mining techniques also extract sensitive information posing a threat to privacy. Frequent Itemset mining is a widely used data mining technique and a pre-processing step for Association Rule Mining. These frequent itemsets may contain sensitive itemsets which need to be hidden from adversaries. Traditional data sanitization techniques modify transactions in the database to hide sensitive itemsets which suffer from undesired side effects and information loss. In this paper, we propose a pattern sanitization approach to hide sensitive itemsets for privacy preserved pattern sharing. The transactional database is modeled as a set of lossless compact patterns using Closed Itemsets. The novelty of the proposed technique is in sanitizing the closed itemsets/patterns instead of transactions in the database. The proposed Recursive Pattern Sanitization (RPS) algorithm hides multiple sensitive itemsets irrespective of their size and support in single parse of the closed patterns. The patterns in the sanitized model retain the closeness property, and the model has inherent support for finding frequent itemsets and association rules reducing mining activity by the end user. Experimental results show that the proposed approach is effective in hiding sensitive itemsets without side effects and unexpected information loss compared to other well-known transaction modification based itemset hiding techniques.
引用
收藏
页码:1213 / 1227
页数:15
相关论文
共 50 条
  • [31] Solving the Sensitive Itemset Hiding Problem Whilst Minimizing Side Effects on a Sanitized Database
    Lee, Guanling
    Chen, Yi-Chun
    Peng, Sheng-Lung
    Lin, Jyun-Hao
    SECURITY-ENRICHED URBAN COMPUTING AND SMART GRID, 2011, 223 : 104 - 113
  • [32] A MaxMin approach for hiding frequent itemsets
    Moustakides, George V.
    Verykios, Vassihos S.
    DATA & KNOWLEDGE ENGINEERING, 2008, 65 (01) : 75 - 89
  • [33] An Efficient Method for Hiding High Utility Itemsets
    Bay Vo
    Lin, Chun-Wei
    Hong, Tzung-Pei
    Vu, Vinh V.
    Minh Nguyen
    Bac Le
    ADVANCED METHODS AND TECHNOLOGIES FOR AGENT AND MULTI-AGENT SYSTEMS, 2013, 252 : 356 - 363
  • [34] Frequent Itemsets Hiding: A Performance Evaluation Framework
    Abul, Osman
    Gokce, Harun
    Sengez, Yagmur
    2009 24TH INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2009, : 666 - 671
  • [35] Fast algorithms for hiding sensitive high-utility itemsets in privacy-preserving utility mining
    Lin, Jerry Chun-Wei
    Wu, Tsu-Yang
    Fournier-Viger, Philippe
    Lin, Guo
    Zhan, Justin
    Voznak, Miroslav
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2016, 55 : 269 - 284
  • [36] NO EFFECTS WITHOUT SIDE EFFECTS!
    Pringle, Andy
    Zwolinsky, Stephen
    PERSPECTIVES IN PUBLIC HEALTH, 2014, 134 (06) : 309 - 309
  • [37] Hiding sensitive frequent itemsets by item removal via two-level multi-objective optimization
    Lefkir, Mira
    Nouioua, Farid
    Fournier-Viger, Philippe
    APPLIED INTELLIGENCE, 2023, 53 (09) : 10027 - 10052
  • [38] Hiding sensitive frequent itemsets by item removal via two-level multi-objective optimization
    Mira Lefkir
    Farid Nouioua
    Philippe Fournier-Viger
    Applied Intelligence, 2023, 53 : 10027 - 10052
  • [39] A max-min approach for hiding frequent itemsets
    Moustakides, George V.
    Verykios, Vassilios S.
    ICDM 2006: SIXTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, WORKSHOPS, 2006, : 502 - +
  • [40] SIDE EFFECTS WITHOUT DRUGS
    ANGST, J
    SCHWEIZERISCHE MEDIZINISCHE WOCHENSCHRIFT, 1969, 99 (40) : 1448 - &