Federated Principal Component Analysis for Genome-Wide Association Studies

被引:8
|
作者
Hartebrodt, Anne [1 ]
Nasirigerdeh, Reza [2 ]
Blumenthal, David B. [3 ]
Rottger, Richard [1 ]
机构
[1] Univ Southern Denmark, Odense, Denmark
[2] Tech Univ Munich, Munich, Germany
[3] Friedrich Alexander Univ Erlangen Nurnberg, Erlangen, Germany
关键词
ALGORITHMS;
D O I
10.1109/ICDM51629.2021.00127
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Federated learning (FL) has emerged as a privacy-aware alternative to centralized data analysis, especially for biomedical analyses such as genome-wide association studies (GWAS). The data remains with the owner, which enables studies previously impossible due to privacy protection regulations. Principal component analysis (PCA) is a frequent preprocessing step in GWAS, where the eigenvectors of the sample-by-sample covariance matrix are used as covariates in the statistical tests. Therefore, a federated version of PCA suitable for vertical data partitioning is required for federated GWAS. Existing federated PCA algorithms exchange the complete sample eigenvectors, a potential privacy breach. In this paper, we present a federated PCA algorithm for vertically partitioned data which does not exchange the sample eigenvectors and is hence suitable for federated GWAS. We show that it outperforms existing federated solutions in terms of convergence behavior and scalability. Additionally, we provide a user-friendly privacy-aware web tool to promote acceptance of federated PCA among GWAS researchers.
引用
收藏
页码:1090 / 1095
页数:6
相关论文
共 50 条
  • [21] sPLINK: a hybrid federated tool as a robust alternative to meta-analysis in genome-wide association studies
    Nasirigerdeh, Reza
    Torkzadehmahani, Reihaneh
    Matschinske, Julian
    Frisch, Tobias
    List, Markus
    Spaeth, Julian
    Weiss, Stefan
    Voelker, Uwe
    Pitkaenen, Esa
    Heider, Dominik
    Wenke, Nina Kerstin
    Kaissis, Georgios
    Rueckert, Daniel
    Kacprowski, Tim
    Baumbach, Jan
    GENOME BIOLOGY, 2022, 23 (01)
  • [22] sPLINK: a hybrid federated tool as a robust alternative to meta-analysis in genome-wide association studies
    Reza Nasirigerdeh
    Reihaneh Torkzadehmahani
    Julian Matschinske
    Tobias Frisch
    Markus List
    Julian Späth
    Stefan Weiss
    Uwe Völker
    Esa Pitkänen
    Dominik Heider
    Nina Kerstin Wenke
    Georgios Kaissis
    Daniel Rueckert
    Tim Kacprowski
    Jan Baumbach
    Genome Biology, 23
  • [23] A pathway analysis method for genome-wide association studies
    Shahbaba, Babak
    Shachaf, Catherine M.
    Yu, Zhaoxia
    STATISTICS IN MEDICINE, 2012, 31 (10) : 988 - 1000
  • [24] Analysis of Corrections Methods in Genome-Wide Association Studies
    Zheng, Ming
    Zhuo, Mugui
    PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON EDUCATION, MANAGEMENT, INFORMATION AND MECHANICAL ENGINEERING (EMIM 2017), 2017, 76 : 439 - 442
  • [25] GENOME-WIDE PATHWAY ANALYSIS OF GENOME-WIDE ASSOCIATION STUDIES ON PSORIASIS AND BEHCET'S DISEASE
    Lee, Y. H.
    ANNALS OF THE RHEUMATIC DISEASES, 2013, 71 : 160 - 160
  • [26] The meta-analysis of genome-wide association studies
    Thompson, John R.
    Attia, John
    Minelli, Cosetta
    BRIEFINGS IN BIOINFORMATICS, 2011, 12 (03) : 259 - 269
  • [27] Longitudinal Data Analysis in Genome-Wide Association Studies
    Beyene, Joseph
    Hamid, Jemila S.
    GENETIC EPIDEMIOLOGY, 2014, 38 : S68 - S73
  • [28] Meta-analysis in genome-wide association studies
    Zeggini, E.
    Ioannidis, J. P. A.
    PHARMACOGENOMICS, 2009, 10 (02) : 191 - 201
  • [29] Gene Set analysis in Genome-wide Association Studies
    Tintle, Nathan L.
    GENETIC EPIDEMIOLOGY, 2009, 33 (08) : 805 - 806
  • [30] Genome-Wide Pathway Analysis of Genome-Wide Association Studies On Systemic Lupus Erythematosus.
    Lee, Young Ho
    Choi, Sung Jae
    Ji, Jong Dae
    Song, Gwan Gyu
    ARTHRITIS AND RHEUMATISM, 2012, 64 (10): : S959 - S960