Privacy-preserving federated genome-wide association studies via dynamic sampling

被引:1
|
作者
Wang, Xinyue [1 ,4 ]
Dervishi, Leonard [2 ]
Li, Wentao [3 ]
Ayday, Erman [2 ]
Jiang, Xiaoqian [3 ]
Vaidya, Jaideep [1 ]
机构
[1] Rutgers State Univ, Management Sci & Informat Syst Dept, New Brunswick, NJ 07102 USA
[2] Dept Comp & Data Sci, Cleveland, OH 44106 USA
[3] Dept Hlth Data Sci & Artificial Intelligence, Houston, TX 77030 USA
[4] Rutgers State Univ, Management Sci & Informat Syst Dept, 1 Washington Pl, Newark, NJ 07102 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
LOCI; GWAS;
D O I
10.1093/bioinformatics/btad639
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation Genome-wide association studies (GWAS) benefit from the increasing availability of genomic data and cross-institution collaborations. However, sharing data across institutional boundaries jeopardizes medical data confidentiality and patient privacy. While modern cryptographic techniques provide formal secure guarantees, the substantial communication and computational overheads hinder the practical application of large-scale collaborative GWAS.Results This work introduces an efficient framework for conducting collaborative GWAS on distributed datasets, maintaining data privacy without compromising the accuracy of the results. We propose a novel two-step strategy aimed at reducing communication and computational overheads, and we employ iterative and sampling techniques to ensure accurate results. We instantiate our approach using logistic regression, a commonly used statistical method for identifying associations between genetic markers and the phenotype of interest. We evaluate our proposed methods using two real genomic datasets and demonstrate their robustness in the presence of between-study heterogeneity and skewed phenotype distributions using a variety of experimental settings. The empirical results show the efficiency and applicability of the proposed method and the promise for its application for large-scale collaborative GWAS.Availability and implementation The source code and data are available at https://github.com/amioamo/TDS.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Federated learning for privacy-preserving AI
    Cheng, Yong
    Liu, Yang
    Chen, Tianjian
    Yang, Qiang
    COMMUNICATIONS OF THE ACM, 2020, 63 (12) : 33 - 36
  • [32] Privacy-Preserving and Reliable Federated Learning
    Lu, Yi
    Zhang, Lei
    Wang, Lulu
    Gao, Yuanyuan
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2021, PT III, 2022, 13157 : 346 - 361
  • [33] Efficient Genome-Wide, Privacy-Preserving Similar Patient Query based on Private Edit Distance
    Wang, Xiao Shaun
    Huang, Yan
    Zhao, Yongan
    Tang, Haixu
    Wang, XiaoFeng
    Bu, Diyue
    CCS'15: PROCEEDINGS OF THE 22ND ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2015, : 492 - 503
  • [34] An Efficient and Dynamic Privacy-Preserving Federated Learning System for Edge Computing
    Tang, Xinyu
    Guo, Cheng
    Choo, Kim-Kwang Raymond
    Liu, Yining
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 207 - 220
  • [35] FedGRU: Privacy-preserving Traffic Flow Prediction via Federated Learning
    Liu, Yi
    Zhang, Shuyu
    Zhang, Chenhan
    Yu, James J. Q.
    2020 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2020,
  • [36] Privacy-Preserving Anomaly Detection in Cloud Manufacturing Via Federated Transformer
    Ma, Shiyao
    Nie, Jiangtian
    Kang, Jiawen
    Lyu, Lingjuan
    Liu, Ryan Wen
    Zhao, Ruihui
    Liu, Ziyao
    Niyato, Dusit
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (12) : 8977 - 8987
  • [37] Poster: Towards Privacy-Preserving Federated Recommendation via Synthetic Interactions
    Ariyarathna, Thirasara
    Kanhere, Salil S.
    Paik, Hye-Young
    PROCEEDINGS 45TH IEEE SYMPOSIUM ON SECURITY AND PRIVACY WORKSHOPS, SPW 2024, 2024, : 297 - 297
  • [38] CryptoFE: Practical and Privacy-Preserving Federated Learning via Functional Encryption
    Qian, Xinyuan
    Li, Hongwei
    Hao, Meng
    Yuan, Shuai
    Zhang, Xilin
    Guo, Song
    2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 2999 - 3004
  • [39] PILE: Robust Privacy-Preserving Federated Learning Via Verifiable Perturbations
    Tang, Xiangyun
    Shen, Meng
    Li, Qi
    Zhu, Liehuang
    Xue, Tengfei
    Qu, Qiang
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2023, 20 (06) : 5005 - 5023
  • [40] FedMDO: Privacy-preserving Federated Learning via Mixup Differential Objective
    You X.
    Liu C.
    Li J.
    Sun Y.
    Liu X.
    IEEE Transactions on Circuits and Systems for Video Technology, 2024, 34 (10) : 1 - 1