Protecting Genomic Data Privacy with Probabilistic Modeling

被引:0
|
作者
Simmons, Sean [1 ]
Berger, Bonnie [2 ,3 ]
Sahinalp, Cenk [4 ]
机构
[1] Broad Inst, Stanley Ctr, Cambriadge, MA 02142 USA
[2] MIT, CSAIL, Cambriadge, MA 02142 USA
[3] MIT, Dept Math, Cambriadge, MA 02142 USA
[4] Indiana Univ, Dept Comp Sci, Bloomington, IN 47405 USA
基金
加拿大自然科学与工程研究理事会; 美国国家卫生研究院;
关键词
Genomic Privacy; GWAS; MCMC; LIMITS;
D O I
暂无
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The proliferation of sequencing technologies in biomedical research has raised many new privacy concerns. These include concerns over the publication of aggregate data at a genomic scale (e.g. minor allele frequencies, regression coefficients). Methods such as differential privacy can overcome these concerns by providing strong privacy guarantees, but come at the cost of greatly perturbing the results of the analysis of interest. Here we investigate an alternative approach for achieving privacy-preserving aggregate genomic data sharing without the high cost to accuracy of differentially private methods. In particular, we demonstrate how other ideas from the statistical disclosure control literature (in particular, the idea of disclosure risk) can be applied to aggregate data to help ensure privacy. This is achieved by combining minimal amounts of perturbation with Bayesian statistics and Markov Chain Monte Carlo techniques. We test our technique on a GWAS dataset to demonstrate its utility in practice. An implementation is available at https://github.com/seanken/PrivMCMC.
引用
收藏
页码:403 / 414
页数:12
相关论文
共 50 条
  • [1] Probabilistic Topic Modeling for Genomic Data Interpretation
    Chen, Xin
    Hu, Xiaohua
    Shen, Xiajiong
    Rosen, Gail
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2010, : 149 - 152
  • [2] Approaches for protecting privacy in the genomic era
    Lin, Z
    Owen, AB
    Altman, RB
    [J]. GENETIC ENGINEERING NEWS, 2004, 24 (17): : 6 - +
  • [3] A probabilistic homomorphic encryption algorithm over integers - protecting data privacy in clouds
    Yeh, Jyh-Haw
    [J]. IEEE 12TH INT CONF UBIQUITOUS INTELLIGENCE & COMP/IEEE 12TH INT CONF ADV & TRUSTED COMP/IEEE 15TH INT CONF SCALABLE COMP & COMMUN/IEEE INT CONF CLOUD & BIG DATA COMP/IEEE INT CONF INTERNET PEOPLE AND ASSOCIATED SYMPOSIA/WORKSHOPS, 2015, : 653 - 656
  • [4] Privacy-preserving data sharing via probabilistic modeling
    Jalko, Joonas
    Lagerspetz, Eemil
    Haukka, Jari
    Tarkoma, Sasu
    Honkela, Antti
    Kaski, Samuel
    [J]. PATTERNS, 2021, 2 (07):
  • [5] Sharing data - protecting privacy
    不详
    [J]. R&D MAGAZINE, 2006, 48 (06): : 14 - 14
  • [6] Protecting Privacy and Security of Genomic Data in i2b2 with Homomorphic Encryption and Differential Privacy
    Raisaro, Jean Louis
    Choi, Gwangbae
    Pradervand, Sylvain
    Colsenet, Raphael
    Jacquemont, Nathalie
    Rosat, Nicolas
    Mooser, Vincent
    Hubaux, Jean-Pierre
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2018, 15 (05) : 1413 - 1426
  • [7] Protecting aggregate genomic data
    Zerhouni, Elias A.
    Nabel, Elizabeth G.
    [J]. SCIENCE, 2008, 322 (5898) : 44 - 44
  • [8] Protecting Data Buyer Privacy in Data Markets
    Zhang, Minxing
    Pei, Jian
    [J]. IEEE INTERNET COMPUTING, 2024, 28 (04) : 14 - 20
  • [9] A Sequence Obfuscation Method for Protecting Personal Genomic Privacy
    Wan, Shibiao
    Wang, Jieqiong
    [J]. FRONTIERS IN GENETICS, 2022, 13
  • [10] Protecting privacy in a clinical data warehouse
    Kong, Guilan
    Xiao, Zhichun
    [J]. HEALTH INFORMATICS JOURNAL, 2015, 21 (02) : 93 - 106