Privacy-preserving data cube for electronic medical records: An experimental evaluation

被引:21
|
作者
Kim, Soohyung [1 ]
Lee, Hyukki [2 ]
Chung, Yon Dohn [2 ]
机构
[1] Korea Univ, Dept IT Convergence, Seoul, South Korea
[2] Korea Univ, Dept Comp Sci & Engn, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
Electronic medical records; Data cube; Medical privacy; Anonymization; K-ANONYMITY; ANONYMIZATION;
D O I
10.1016/j.ijmedinf.2016.09.008
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Introduction: The aim of this study is to evaluate the effectiveness and efficiency of privacy-preserving data cubes of electronic medical records (EMRs). An EMR data cube is a complex of EMR statistics that are summarized or aggregated by all possible combinations of attributes. Data cubes are widely utilized for efficient big data analysis and also have great potential for EMR analysis. For safe data analysis without privacy breaches, we must consider the privacy preservation characteristics of the EMR data cube. In this paper, we introduce a design for a privacy-preserving EMR data cube and the anonymization methods needed to achieve data privacy. We further focus on changes in efficiency and effectiveness that are caused by the anonymization process for privacy preservation. Thus, we experimentally evaluate various types of privacy-preserving EMR data cubes using several practical metrics and discuss the applicability of each anonymization method with consideration for the EMR analysis environment. Methods: We construct privacy-preserving EMR data cubes from anonymized EMR datasets. A real EMR dataset and demographic dataset are used for the evaluation. There are a large number of anonymization methods to preserve EMR privacy, and the methods are classified into three categories (i.e., global generalization, local generalization, and bucketization) by anonymization rules. According to this classification, three types of privacy-preserving EMR data cubes were constructed for the evaluation. We perform a comparative analysis by measuring the data size, cell overlap, and information loss of the EMR data cubes. Results: Global generalization considerably reduced the size of the EMR data cube and did not cause the data cube cells to overlap, but incurred a large amount of information loss. Local generalization maintained the data size and generated only moderate information loss, but there were cell overlaps that could decrease the search performance. Bucketization did not cause cells to overlap and generated little information loss; however, the method considerably inflated the size of the EMR data cubes. Conclusions: The utility of anonymized EMR data cubes varies widely according to the anonymization method, and the applicability of the anonymization method depends on the features of the EMR analysis environment. The findings help to adopt the optimal anonymization method considering the EMR analysis environment and goal of the EMR analysis. (C) 2016 Elsevier Ireland Ltd. All rights reserved.
引用
收藏
页码:33 / 42
页数:10
相关论文
共 50 条
  • [1] Research on Privacy-Preserving Methods of Electronic Medical Records
    Wang, Qingfei
    Zhu, Gen
    Wang, Changbo
    Cheng, Hongping
    [J]. 2018 INTERNATIONAL SEMINAR ON COMPUTER SCIENCE AND ENGINEERING TECHNOLOGY (SCSET 2018), 2019, 1176
  • [2] BPDS: A Blockchain based Privacy-Preserving Data Sharing for Electronic Medical Records
    Liu, Jingwei
    Li, Xiaolu
    Ye, Lin
    Zhang, Hongli
    Du, Xiaojiang
    Guizani, Mohsen
    [J]. 2018 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2018,
  • [3] Privacy-Preserving Data Publishing for Free Text Chinese Electronic Medical Records
    Chen, Lei
    Yang, Ji-Jiang
    Wang, Qing
    [J]. 2012 IEEE 36TH ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), 2012, : 567 - 572
  • [4] A Privacy-Preserving Quantum Blockchain Technique for Electronic Medical Records
    Venkatesh, Ranjitha
    Hanumantha, Brahmananda Savadatti
    [J]. IEEE Engineering Management Review, 2023, 51 (04): : 137 - 144
  • [5] Privacy-preserving electronic health records
    Demuynck, L
    De Decker, B
    [J]. COMMUNICATIONS AND MULTIMEDIA SECURITY, 2005, 3677 : 150 - 159
  • [6] Data Sharing and Privacy-Preserving of Medical Records Using Blockchain
    Kavathekar, Shraddha Suhas
    Patil, Rahul
    [J]. SUSTAINABLE COMMUNICATION NETWORKS AND APPLICATION, ICSCN 2019, 2020, 39 : 65 - 72
  • [7] Privacy-preserving Electronic Medical Records Sharing Solution Based on Blockchain
    Shao, Mingqiang
    Liu, Momeng
    Wang, Zhenzhen
    [J]. International Journal of Network Security, 2023, 25 (01) : 68 - 75
  • [8] EPPFM: Efficient and Privacy-Preserving Querying of Electronic Medical Records With Forward Privacy in Multiuser Setting
    Xu, Chang
    Chan, Zijian
    Zhu, Liehuang
    Zhang, Can
    Lu, Rongxing
    Guan, Yunguo
    [J]. IEEE TRANSACTIONS ON SUSTAINABLE COMPUTING, 2023, 8 (03): : 492 - 503
  • [9] Privacy-Preserving Integration of Medical Data
    Miyaji, Atsuko
    Nakasho, Kazuhisa
    Nishida, Shohei
    [J]. JOURNAL OF MEDICAL SYSTEMS, 2017, 41 (03)
  • [10] Privacy-Preserving Secure Multiparty Computation on Electronic Medical Records for Star Exchange Topology
    Ahmed M. Tawfik
    Sahar F. Sabbeh
    Tarek EL-Shishtawy
    [J]. Arabian Journal for Science and Engineering, 2018, 43 : 7747 - 7756