Federated Principal Component Analysis for Genome-Wide Association Studies

被引:8
|
作者
Hartebrodt, Anne [1 ]
Nasirigerdeh, Reza [2 ]
Blumenthal, David B. [3 ]
Rottger, Richard [1 ]
机构
[1] Univ Southern Denmark, Odense, Denmark
[2] Tech Univ Munich, Munich, Germany
[3] Friedrich Alexander Univ Erlangen Nurnberg, Erlangen, Germany
关键词
ALGORITHMS;
D O I
10.1109/ICDM51629.2021.00127
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Federated learning (FL) has emerged as a privacy-aware alternative to centralized data analysis, especially for biomedical analyses such as genome-wide association studies (GWAS). The data remains with the owner, which enables studies previously impossible due to privacy protection regulations. Principal component analysis (PCA) is a frequent preprocessing step in GWAS, where the eigenvectors of the sample-by-sample covariance matrix are used as covariates in the statistical tests. Therefore, a federated version of PCA suitable for vertical data partitioning is required for federated GWAS. Existing federated PCA algorithms exchange the complete sample eigenvectors, a potential privacy breach. In this paper, we present a federated PCA algorithm for vertically partitioned data which does not exchange the sample eigenvectors and is hence suitable for federated GWAS. We show that it outperforms existing federated solutions in terms of convergence behavior and scalability. Additionally, we provide a user-friendly privacy-aware web tool to promote acceptance of federated PCA among GWAS researchers.
引用
收藏
页码:1090 / 1095
页数:6
相关论文
共 50 条
  • [31] Fast Principal Component Analysis of Large-Scale Genome-Wide Data
    Abraham, Gad
    Inouye, Michael
    PLOS ONE, 2014, 9 (04):
  • [32] GENOME-WIDE ASSOCIATION STUDIES Validating, augmenting and refining genome-wide association signals
    Ioannidis, John P. A.
    Thomas, Gilles
    Daly, Mark J.
    NATURE REVIEWS GENETICS, 2009, 10 (05) : 318 - 329
  • [33] Multiple-trait genome-wide association study based on principal component analysis for residual covariance matrix
    Gao, H.
    Zhang, T.
    Wu, Y.
    Wu, Y.
    Jiang, L.
    Zhan, J.
    Li, J.
    Yang, R.
    HEREDITY, 2014, 113 (06) : 526 - 532
  • [34] Multiple-trait genome-wide association study based on principal component analysis for residual covariance matrix
    H Gao
    T Zhang
    Y Wu
    Y Wu
    L Jiang
    J Zhan
    J Li
    R Yang
    Heredity, 2014, 113 : 526 - 532
  • [35] Pulmonary Function: From Genome-Wide Association Studies to Genome-Wide Interaction Studies
    Christiani, David C.
    AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2019, 199 (05) : 557 - 559
  • [36] Privacy-preserving federated genome-wide association studies via dynamic sampling
    Wang, Xinyue
    Dervishi, Leonard
    Li, Wentao
    Ayday, Erman
    Jiang, Xiaoqian
    Vaidya, Jaideep
    BIOINFORMATICS, 2023, 39 (10)
  • [37] Genome-Wide Association Studies in Atherosclerosis
    Sivapalaratnam, S.
    Motazacker, M. M.
    Maiwald, S.
    Hovingh, G. K.
    Kastelein, J. J. P.
    Levi, M.
    Trip, M. D.
    Dallinga-Thie, G. M.
    CURRENT ATHEROSCLEROSIS REPORTS, 2011, 13 (03) : 225 - 232
  • [38] Genome-wide association studies in ADHD
    Franke, Barbara
    Neale, Benjamin M.
    Faraone, Stephen V.
    HUMAN GENETICS, 2009, 126 (01) : 13 - 50
  • [39] Genome-Wide Association Studies of Autism
    Glessner J.T.
    Connolly J.J.
    Hakonarson H.
    Current Behavioral Neuroscience Reports, 2014, 1 (4) : 234 - 241
  • [40] Genome-wide association studies with metabolomics
    Adamski, Jerzy
    GENOME MEDICINE, 2012, 4