A Hybrid Protection Method to Enhance Data Utility while Preserving the Privacy of Medical Patients Data Publishing

被引:0
|
作者
Jeba, Shermina [1 ]
BinJubier, Mohammed [2 ]
Ismail, Mohd Arfian [2 ]
Krishnan, Reshmy [1 ]
Nair, Sarachandran [1 ]
Narasimhan, Girija [3 ]
机构
[1] Muscat Coll, Dept Comp, Muscat, Oman
[2] Univ Malaysia Pahang, Fac Comp, Kuantan, Pahang, Malaysia
[3] Univ Technol & Appl Sci, Informat Technol Dept, Muscat, Oman
关键词
Medical patients data publishing; anonymization; protection method for preserving the privacy; BIG DATA; COMPOSITION ATTACKS; K-ANONYMITY; CHALLENGES;
D O I
10.14569/IJACSA.2022.0131194
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Medical patient data need to be published and made available to researchers so that they can use, analyse, and evaluate the data effectively. However, publishing medical patient data raises privacy concerns regarding protecting sensitive data while preserving the utility of the released data. The privacy -preserving data publishing (PPDP) process attempts to keep public data useful without risking the medical patients' pri-vacy. Through protection methods like perturbing, suppressing, or generalizing values, which lead to uncertainty in identity inference or sensitive value estimation, the PPDP aims to reduce the risks of patient data being disclosed and to preserve the potential use of published data. Although this method is helpful, information loss is inevitable when attempting to achieve a high level of privacy using protection methods. In addition, the privacy-preserving techniques may affect the use of data, resulting in imprecise or even impractical knowledge extraction. Thus, balancing privacy and utility in medical patient data is essential. This study proposed an innovative technique that used a hybrid protection method for utility enhancement while preserving medical patients' data privacy. The utilized technique could partition information horizontally and vertically, resulting in data being grouped into columns and equivalence classes. Then, the attributes assumed to be easily known by any attacker are determined by upper and lower protection levels (UPL and LPL). This work also depends on making the false matches and value swapping to make sure that the attribute disclosure is less likely to happen. The innovative technique makes data more useful. According to the results, the innovative technique delivers about 93.4% data utility when the percentage of exchange level is 5% using LPL and 95% using UPL with a 4.5K medical patient dataset. In conclusion, the innovative technique has minimized risk disclosure compared to other existing works.
引用
收藏
页码:808 / 821
页数:14
相关论文
共 50 条
  • [1] Privacy protection data publishing method for data privacy differences
    Yu Y.
    Zhou D.
    Li H.
    Wu X.
    [J]. Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2020, 48 (09): : 57 - 63
  • [2] An Improved Data Sanitization Algorithm for Privacy Preserving Medical Data Publishing
    Zaman, A. N. K.
    Obimbo, Charlie
    Dara, Rozita A.
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE, CANADIAN AI 2017, 2017, 10233 : 64 - 70
  • [3] Privacy-Preserving Data Publishing Based On Utility Specification
    Tian, Hongwei
    Zhang, Weining
    [J]. 2013 ASE/IEEE INTERNATIONAL CONFERENCE ON SOCIAL COMPUTING (SOCIALCOM), 2013, : 114 - 121
  • [4] Preserving Data Privacy in Speech Data Publishing
    孙佳鑫
    蒋进
    赵萍
    [J]. Journal of Donghua University(English Edition), 2020, 37 (04) : 293 - 297
  • [5] Modeling Background Knowledge for Privacy Preserving Medical Data Publishing
    Wang, Eric Ke
    Jia, Binfeng
    Ke, Nie
    [J]. 2017 INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS, ELECTRONICS AND CONTROL (ICCSEC), 2017, : 136 - 141
  • [6] HyObscure: Hybrid Obscuring for Privacy-Preserving Data Publishing
    Han, Xiao
    Yang, Yuncong
    Wu, Junjie
    Xiong, Hui
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (08) : 3893 - 3905
  • [7] Utility-Friendly Heterogenous Generalization in Privacy Preserving Data Publishing
    He, Xianmang
    Li, Dong
    Hao, Yanni
    Chen, Huahui
    [J]. CONCEPTUAL MODELING, 2014, 8824 : 186 - 194
  • [8] A New Approach to Utility-based Privacy Preserving in Data Publishing
    Vural, Yilmaz
    Aydos, Murat
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (CIT), 2017, : 204 - 209
  • [9] Privacy and Utility Preserving Trajectory Data Publishing for Intelligent Transportation Systems
    Liu, Xiangwen
    Zhu, Yuquan
    [J]. IEEE ACCESS, 2020, 8 : 176454 - 176466
  • [10] Privacy-Preserving Data Publishing
    Liu, Ruilin
    Wang, Hui
    [J]. 2010 IEEE 26TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDE 2010), 2010, : 305 - 308