Privacy protection and aggregate health data: a review of tabular cell suppression methods (not) employed in public health data systems

被引:4
|
作者
Matthews, Gregory J. [1 ]
Harel, Ofer [2 ]
Aseltine, Robert H., Jr. [3 ]
机构
[1] Loyola Univ, Dept Math & Stat, Chicago, IL 60611 USA
[2] Univ Connecticut, Dept Stat, Storrs, CT 06269 USA
[3] Univ Connecticut, Ctr Hlth, Div Behav Sci & Community Hlth, Publ Hlth Res Inst, E Hartford, CT USA
关键词
Privacy; Confidentiality; Health policy; Public health; Data sharing;
D O I
10.1007/s10742-016-0162-8
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Public health research often relies on individuals' confidential medical data. Therefore, data collecting entities, such as states, seek to disseminate this medical data as widely as possible while still maintaining the privacy of the individual for legal and ethical reasons. One common way in which this medical data is released is through the use of Web-based Data Query Systems (WDQS). In this article, we examined WDQS listed in the National Association for Public Health Statistics and Information Systems (NAPHSIS) specifically reviewing them for how they prevent statistical disclosure in queries that produce a tabular response. One of the most common methods to combat this type of disclosure is through the use of suppression, that is, if a cell count in a table is below a certain threshhold, the true value is suppressed. This technique does work to prevent the direct disclosure of small cell counts, however, primary suppression by itself is not always enough to preserve privacy in tabular data. Here, we present several real examples of tabular response queries that employ suppression, but we are able to infer the values of the suppressed cells, including cells with 1 counts, which could be linked to auxiliary data sources and thus has the possibility to create an identity disclosure. We seek to stimulate awareness of the potential for disclosure of information that individuals may wish to keep private through an online query system. This research is undertaken in the hope that privacy concerns can be dealt with preemptively rather than only after a major disclosure has taken place. In the wake of a such an event, a major concern is that state and local officials would react to this by permanently shutting down these sites and cutting off a valuable source of research data.
引用
收藏
页码:258 / 270
页数:13
相关论文
共 50 条
  • [1] Data protection: balancing personal privacy and public health
    不详
    [J]. LANCET RESPIRATORY MEDICINE, 2016, 4 (01): : 1 - 1
  • [2] CELL SUPPRESSION - DISCLOSURE PROTECTION FOR SENSITIVE TABULAR DATA
    KELLY, JP
    GOLDEN, BL
    ASSAD, AA
    [J]. NETWORKS, 1992, 22 (04) : 397 - 417
  • [3] Privacy Protection and Secondary Use of Health Data: Strategies and Methods
    Xiang, Dingyi
    Cai, Wei
    [J]. BIOMED RESEARCH INTERNATIONAL, 2021, 2021
  • [4] A Review of Data Quality Assessment Methods for Public Health Information Systems
    Chen, Hong
    Hailey, David
    Wang, Ning
    Yu, Ping
    [J]. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2014, 11 (05) : 5170 - 5207
  • [5] Methods for assessing the quality of data in public health information systems: A critical review
    Chen, Hong
    Yu, Ping
    Hailey, David
    Wang, Ning
    [J]. INVESTING IN E-HEALTH: PEOPLE, KNOWLEDGE AND TECHNOLOGY FOR A HEALTHY FUTURE, 2014, 204 : 13 - 18
  • [6] Synthetic data generation for tabular health records: A review
    Hernandez, Mikel
    Epelde, Gorka
    Alberdi, Ane
    Cilla, Rodrigo
    Rankin, Debbie
    [J]. NEUROCOMPUTING, 2022, 493 : 28 - 45
  • [7] Digital public health: data protection and data security
    Kunz, Thomas
    Lange, Benjamin
    Selzer, Annika
    [J]. BUNDESGESUNDHEITSBLATT-GESUNDHEITSFORSCHUNG-GESUNDHEITSSCHUTZ, 2020, 63 (02) : 206 - 214
  • [8] Revisiting Interval Protection, a.k.a. Partial Cell Suppression, for Tabular Data
    Castro, Jordi
    Via, Anna
    [J]. PRIVACY IN STATISTICAL DATABASES: UNESCO CHAIR IN DATA PRIVACY, 2016, 9867 : 3 - 14
  • [9] Ethical Collection, Storage, and Use of Public Health Data A Proposal for a National Privacy Protection
    Lee, Lisa M.
    Gostin, Lawrence O.
    [J]. JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2009, 302 (01): : 82 - 84
  • [10] Data Privacy Protection: A Study on Students Awareness of Personal Data Privacy Protection in an E-Health Environment
    Mohamed, Alif Aiman Bin Seeni
    Chen, Lim Fung
    [J]. ADVANCED SCIENCE LETTERS, 2017, 23 (06) : 5299 - 5303