A multi-institution evaluation of clinical profile anonymization

被引:9
|
作者
Heatherly, Raymond [1 ]
Rasmussen, Luke V. [2 ]
Peissig, Peggy L. [3 ]
Pacheco, Jennifer A. [2 ]
Harris, Paul [1 ,4 ]
Denny, Joshua C. [1 ,5 ]
Malin, Bradley A. [1 ,6 ]
机构
[1] Vanderbilt Univ, Dept Biomed Informat, 221 Kirkland Hall, Nashville, TN 37235 USA
[2] Northwestern Univ, Feinberg Sch Med, Chicago, IL 60611 USA
[3] Marshfield Clin Res Fdn, Biomed Informat Res Ctr, Marshfield, WI USA
[4] Vanderbilt Univ, Dept Biomed Engn, Nashville, TN 37235 USA
[5] Vanderbilt Univ, Dept Med, Nashville, TN USA
[6] Vanderbilt Univ, Dept Elect Engn & Comp Sci, 221 Kirkland Hall, Nashville, TN 37235 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
privacy; generalization; secondary use; anonymization; clinical codes; ELECTRONIC MEDICAL-RECORDS; PHENOME-WIDE ASSOCIATION; BIG DATA; PRIVACY;
D O I
10.1093/jamia/ocv154
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Methods: We apply a state-of-the-art k-anonymization algorithm, with k set to the standard value 5, to the International Classification of Disease, ninth edition codes for patients in a hypothyroidism association study at three medical centers: Marshfield Clinic, Northwestern University, and Vanderbilt University. We assess utility when anonymizing at three population levels: all patients in 1) the EHR system; 2) the biorepository; and 3) a hypothyroidism study. We evaluate utility using 1) changes to the number included in the dataset, 2) number of codes included, and 3) regions generalization and suppression were required. Results: Our findings yield several notable results. First, we show that anonymizing in the context of the entire EHR yields a significantly greater quantity of data by reducing the amount of generalized regions from similar to 15% to similar to 0.5%. Second, similar to 70% of codes that needed generalization only generalized two or three codes in the largest anonymization. Conclusions: Sharing large volumes of clinical data in support of phenome-wide association studies is possible while safeguarding privacy to the underlying individuals.
引用
收藏
页码:E131 / E137
页数:7
相关论文
共 50 条
  • [1] RANDOMIZATION WITH STRATIFICATION AND INSTITUTION BALANCE FOR MULTI-INSTITUTION CLINICAL-TRIALS
    BAUER, M
    WALKER, E
    REDMOND, C
    [J]. CONTROLLED CLINICAL TRIALS, 1982, 3 (02): : 151 - 151
  • [2] Surveying a New Multi-Institution Clinical Data Research Network
    Rosenman, Marc
    Madden, Margaret
    Oh, Elissa
    Goel, Satyender
    Kho, Abel
    [J]. MEDINFO 2017: PRECISION HEALTHCARE THROUGH INFORMATICS, 2017, 245 : 935 - 939
  • [3] Multi-institution Evaluation of Adherence to Comprehensive Postoperative VTE Chemoprophylaxis
    Yang, Anthony D.
    Hewitt, Daniel Brock
    Blay, Eddie, Jr.
    Kreutzer, Lindsey J.
    Quinn, Christopher M.
    Cradock, Kimberly A.
    Prachand, Vivek
    Bilimoria, Karl Y.
    [J]. ANNALS OF SURGERY, 2020, 271 (06) : 1072 - 1079
  • [4] DIFFERENTIAL TOXICITY PROFILE OF INDIVIDUAL MEK INHIBITORS: A MULTI-INSTITUTION RETROSPECTIVE REVIEW
    Shin, Jimin
    Zhang, Anthony
    Meyer, Ashley
    Ogle, Andrea
    Shatara, Margaret
    Cluster, Andrew
    Abdelbaki, Mohamed S.
    Navalkele, Pournima
    Faigenbaum, Erica
    Brosius, Stephanie
    Kotch, Chelsea
    Cantor, Evan
    Brossier, Nicole
    [J]. NEURO-ONCOLOGY, 2024, 26
  • [5] An approach for detecting multi-institution attacks
    Saif Zabarah
    Omar Naman
    Mohammad A. Salahuddin
    Raouf Boutaba
    Samer Al-Kiswany
    [J]. Annals of Telecommunications, 2024, 79 : 257 - 270
  • [6] SALG and POGIL: A multi-institution assessment
    Bressette, Andrew R.
    Simons, Emily
    Straumanis, Andrei
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2006, 231
  • [7] A multi-institution evaluation of MLC log files and performance in IMRT delivery
    Kerns, James R.
    Childress, Nathan
    Kry, Stephen F.
    [J]. RADIATION ONCOLOGY, 2014, 9
  • [8] An approach for detecting multi-institution attacks
    Zabarah, Saif
    Naman, Omar
    Salahuddin, Mohammad A.
    Boutaba, Raouf
    Al-Kiswany, Samer
    [J]. ANNALS OF TELECOMMUNICATIONS, 2024, 79 (3-4) : 257 - 270
  • [9] Validation of the prostate tumorlet model based on multi-institution clinical data
    Siebert, F. A.
    Anscher, M. S.
    Hagan, M.
    Kaden, S.
    Todor, D.
    [J]. RADIOTHERAPY AND ONCOLOGY, 2014, 111 : S74 - S75
  • [10] A multi-institution evaluation of MLC log files and performance in IMRT delivery
    James R Kerns
    Nathan Childress
    Stephen F Kry
    [J]. Radiation Oncology, 9