Efficient privacy-preserving whole-genome variant queries

被引:4
|
作者
Akguen, Mete [1 ,2 ,3 ,4 ]
Pfeifer, Nico [2 ,5 ,6 ]
Kohlbacher, Oliver [2 ,3 ,7 ]
机构
[1] Univ Tubingen, Dept Comp Sci, Med Data Privacy & Privacy Preserving ML Healthca, Tubingen, Germany
[2] Univ Tubingen, Inst Bioinformat & Med Informat, Tubingen, Germany
[3] Univ Hosp Tubingen, Translat Bioinformat, Tubingen, Germany
[4] Izmir Inst Technol, Dept Comp Engn, Izmir, Turkey
[5] Univ Tubingen, Dept Comp Sci, Methods Med Informat, Tubingen, Germany
[6] Max Planck Inst Informat, Stat Learning Computat Biol, Saarbrucken, Germany
[7] Univ Tubingen, Appl Bioinformat, Dept Comp Sci, Tubingen, Germany
关键词
D O I
10.1093/bioinformatics/btac070
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Diagnosis and treatment decisions on genomic data have become widespread as the cost of genome sequencing decreases gradually. In this context, disease-gene association studies are of great importance. However, genomic data are very sensitive when compared to other data types and contains information about individuals and their relatives. Many studies have shown that this information can be obtained from the query-response pairs on genomic databases. In this work, we propose a method that uses secure multi-party computation to query genomic databases in a privacy-protected manner. The proposed solution privately outsources genomic data from arbitrarily many sources to the two non-colluding proxies and allows genomic databases to be safely stored in semi-honest cloud environments. It provides data privacy, query privacy and output privacy by using XOR-based sharing and unlike previous solutions, it allows queries to run efficiently on hundreds of thousands of genomic data. Results: We measure the performance of our solution with parameters similar to real-world applications. It is possible to query a genomic database with 3 000 000 variants with five genomic query predicates under 400 ms. Querying 1 048 576 genomes, each containing 1 000 000 variants, for the presence of five different query variants can be achieved approximately in 6 min with a small amount of dedicated hardware and connectivity. These execution times are in the right range to enable real-world applications in medical research and healthcare. Unlike previous studies, it is possible to query multiple databases with response times fast enough for practical application. To the best of our knowledge, this is the first solution that provides this performance for querying large-scale genomic data.
引用
收藏
页码:2202 / 2210
页数:9
相关论文
共 50 条
  • [41] Privacy-Preserving Navigation Supporting Similar Queries in Vehicular Networks
    Li, Meng
    Chen, Yifei
    Zheng, Shuli
    Hu, Donghui
    Lal, Chhagan
    Conti, Mauro
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2022, 19 (02) : 1133 - 1148
  • [42] Privacy-preserving SQL queries on cross-organisation databases
    Han, Ye
    Guo, Xiaojie
    Li, Tong
    Liu, Xiaotao
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2025, 28 (02)
  • [43] Privacy-preserving location-based queries in mobile environments
    Gao, Jiali
    Xiao, Shali
    2008 PROCEEDINGS OF INFORMATION TECHNOLOGY AND ENVIRONMENTAL SYSTEM SCIENCES: ITESS 2008, VOL 1, 2008, : 805 - 812
  • [44] Privacy-Preserving Homomorphic MACs with Efficient Verification
    Li, Shimin
    Wang, Xin
    Zhang, Rui
    WEB SERVICES - ICWS 2018, 2018, 10966 : 100 - 115
  • [45] Efficient Privacy-preserving Aggregation for Mobile Crowdsensing
    Huai, Mengdi
    Huang, Liusheng
    Sun, Yu-e
    Yang, Wei
    PROCEEDINGS 2015 IEEE FIFTH INTERNATIONAL CONFERENCE ON BIG DATA AND CLOUD COMPUTING BDCLOUD 2015, 2015, : 275 - 280
  • [46] An Efficient Privacy-preserving Authentication Protocol in VANETs
    Zhang, Jianhong
    Zhen, Weina
    Xu, Min
    2013 IEEE NINTH INTERNATIONAL CONFERENCE ON MOBILE AD-HOC AND SENSOR NETWORKS (MSN 2013), 2013, : 272 - 277
  • [47] Efficient Privacy-Preserving Face Identification Protocol
    Huang, Hai
    Wang, Luyao
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2023, 16 (04) : 2632 - 2641
  • [48] An efficient privacy-preserving approach for data publishing
    Xinyu Qian
    Xinning Li
    Zhiping Zhou
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 : 2077 - 2093
  • [49] Efficient Privacy-Preserving Facial Expression Classification
    Rahulamathavan, Yogachandran
    Rajarajan, Muttukrishnan
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2017, 14 (03) : 326 - 338
  • [50] Accurate and efficient privacy-preserving string matching
    Sirintra Vaiwsri
    Thilina Ranbaduge
    Peter Christen
    International Journal of Data Science and Analytics, 2022, 14 : 191 - 215