Differentially private tree-based redescription mining

被引:1
|
作者
Mihelcic, Matej [1 ]
Miettinen, Pauli [2 ]
机构
[1] Univ Zagreb, Dept Math, Zagreb, Croatia
[2] Univ Eastern Finland, Sch Comp, Kuopio, Finland
关键词
Redescription mining; Differential privacy; Health care informatics; DATA PERTURBATION;
D O I
10.1007/s10618-023-00934-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Differential privacy provides a strong form of privacy and allows preserving most of the original characteristics of the dataset. Utilizing these benefits requires one to design specific differentially private data analysis algorithms. In this work, we present three tree-based algorithms for mining redescriptions while preserving differential privacy. Redescription mining is an exploratory data analysis method for finding connections between two views over the same entities, such as phenotypes and genotypes of medical patients, for example. It has applications in many fields, including some, like health care informatics, where privacy-preserving access to data is desired. Our algorithms are the first tree-based differentially private redescription mining algorithms, and we show via experiments that, despite the inherent noise in differential privacy, it can return trustworthy results even in smaller datasets where noise typically has a stronger effect.
引用
收藏
页码:1548 / 1590
页数:43
相关论文
共 50 条
  • [1] Differentially private tree-based redescription mining
    Matej Mihelčić
    Pauli Miettinen
    [J]. Data Mining and Knowledge Discovery, 2023, 37 : 1548 - 1590
  • [2] Differentially Private Online Task Assignment in Spatial Crowdsourcing: A Tree-based Approach
    Tao, Qian
    Tong, Yongxin
    Zhou, Zimu
    Shi, Yexuan
    Chen, Lei
    Xu, Ke
    [J]. 2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020), 2020, : 517 - 528
  • [3] Differentially Private Tree-Based Contextual Online Learning for Service Big Data Selection in IoT
    Zhao, Weiguang
    Chen, Mingxuan
    Mu, Difan
    Zhou, Pan
    Wang, Kehao
    [J]. 2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,
  • [4] Data Mining with a Tree-Based Scan Statistic
    Brown, Jeffrey S.
    Dashevsky, Inna
    Fireman, Bruce
    Herrinton, Lisa
    McClure, David
    Murphy, Michael
    Raebel, Marsha
    Sturtevant, Jessica
    Kulldorff, Martin
    [J]. PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2011, 20 : S331 - S331
  • [5] An Efficient Tree-based Fuzzy Data Mining Approach
    Lin, Chun-Wei
    Hong, Tzung-Pei
    Lu, Wen-Hsiang
    [J]. INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2010, 12 (02) : 150 - 157
  • [6] Tree-Based Contrast Subspace Mining for Categorical Data
    Florence Sia
    Rayner Alfred
    Yuto Lim
    [J]. International Journal of Computational Intelligence Systems, 2020, 13 : 1714 - 1722
  • [7] Tree-based partitioning of data for association rule mining
    Ahmed, Shakil
    Coenen, Frans
    Leng, Paul
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2006, 10 (03) : 315 - 331
  • [8] Tree-based partitioning of date for association rule mining
    Shakil Ahmed
    Frans Coenen
    Paul Leng
    [J]. Knowledge and Information Systems, 2006, 10 : 315 - 331
  • [9] Tree-Based Contrast Subspace Mining for Categorical Data
    Sia, Florence
    Alfred, Rayner
    Lim, Yuto
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2020, 13 (01) : 1714 - 1722
  • [10] Performance Analysis of Tree-Based Approaches for Pattern Mining
    Borah, Anindita
    Nath, Bhabesh
    [J]. COMPUTATIONAL INTELLIGENCE IN DATA MINING, 2019, 711 : 435 - 448