Privacy-preserving federated genome-wide association studies via dynamic sampling

被引:1
|
作者
Wang, Xinyue [1 ,4 ]
Dervishi, Leonard [2 ]
Li, Wentao [3 ]
Ayday, Erman [2 ]
Jiang, Xiaoqian [3 ]
Vaidya, Jaideep [1 ]
机构
[1] Rutgers State Univ, Management Sci & Informat Syst Dept, New Brunswick, NJ 07102 USA
[2] Dept Comp & Data Sci, Cleveland, OH 44106 USA
[3] Dept Hlth Data Sci & Artificial Intelligence, Houston, TX 77030 USA
[4] Rutgers State Univ, Management Sci & Informat Syst Dept, 1 Washington Pl, Newark, NJ 07102 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
LOCI; GWAS;
D O I
10.1093/bioinformatics/btad639
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation Genome-wide association studies (GWAS) benefit from the increasing availability of genomic data and cross-institution collaborations. However, sharing data across institutional boundaries jeopardizes medical data confidentiality and patient privacy. While modern cryptographic techniques provide formal secure guarantees, the substantial communication and computational overheads hinder the practical application of large-scale collaborative GWAS.Results This work introduces an efficient framework for conducting collaborative GWAS on distributed datasets, maintaining data privacy without compromising the accuracy of the results. We propose a novel two-step strategy aimed at reducing communication and computational overheads, and we employ iterative and sampling techniques to ensure accurate results. We instantiate our approach using logistic regression, a commonly used statistical method for identifying associations between genetic markers and the phenotype of interest. We evaluate our proposed methods using two real genomic datasets and demonstrate their robustness in the presence of between-study heterogeneity and skewed phenotype distributions using a variety of experimental settings. The empirical results show the efficiency and applicability of the proposed method and the promise for its application for large-scale collaborative GWAS.Availability and implementation The source code and data are available at https://github.com/amioamo/TDS.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Privacy-Preserving Data Exploration in Genome-Wide Association Studies
    Johnson, Aaron
    Shmatikov, Vitaly
    19TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'13), 2013, : 1079 - 1087
  • [2] Scalable privacy-preserving data sharing methodology for genome-wide association studies
    Yu, Fei
    Fienberg, Stephen E.
    Slavkovic, Aleksandra B.
    Uhler, Caroline
    JOURNAL OF BIOMEDICAL INFORMATICS, 2014, 50 : 133 - 141
  • [3] Towards practical privacy-preserving genome-wide association study
    Bonte, Charlotte
    Makri, Eleftheria
    Ardeshirdavani, Amin
    Simm, Jaak
    Moreau, Yves
    Vercauteren, Frederik
    BMC BIOINFORMATICS, 2018, 19
  • [4] Towards practical privacy-preserving genome-wide association study
    Charlotte Bonte
    Eleftheria Makri
    Amin Ardeshirdavani
    Jaak Simm
    Yves Moreau
    Frederik Vercauteren
    BMC Bioinformatics, 19
  • [5] Realizing privacy preserving genome-wide association studies
    Simmons, Sean
    Berger, Bonnie
    BIOINFORMATICS, 2016, 32 (09) : 1293 - 1300
  • [6] A Hybrid Cloud Deployment Architecture for Privacy-Preserving Collaborative Genome-Wide Association Studies
    Boujdad, Fatima-Zahra
    Niyitegeka, David
    Bellafqira, Reda
    Coatrieux, Gouenou
    Genin, Emmanuelle
    Sudholt, Mario
    DIGITAL FORENSICS AND CYBER CRIME, ICDF2C 2021, 2022, 441 : 342 - 359
  • [7] Large-Scale Privacy-Preserving Statistical Computations for Distributed Genome-Wide Association Studies
    Tkachenko, Oleksandr
    Weinert, Christian
    Schneider, Thomas
    Hamacher, Kay
    PROCEEDINGS OF THE 2018 ACM ASIA CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY (ASIACCS'18), 2018, : 221 - 235
  • [8] A Privacy-Preserving Framework for Conducting Genome-Wide Association Studies Over Outsourced Patient Data
    Zhu, Xiaojie
    Ayday, Erman
    Vitenberg, Roman
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2023, 20 (03) : 2390 - 2405
  • [9] Privacy-preserving genome-wide association studies on cloud environment using fully homomorphic encryption
    Wen-Jie Lu
    Yoshiji Yamada
    Jun Sakuma
    BMC Medical Informatics and Decision Making, 15
  • [10] Privacy-preserving genome-wide association studies on cloud environment using fully homomorphic encryption
    Lu, Wen-Jie
    Yamada, Yoshiji
    Sakuma, Jun
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2015, 15