Federated Principal Component Analysis for Genome-Wide Association Studies

被引:8
|
作者
Hartebrodt, Anne [1 ]
Nasirigerdeh, Reza [2 ]
Blumenthal, David B. [3 ]
Rottger, Richard [1 ]
机构
[1] Univ Southern Denmark, Odense, Denmark
[2] Tech Univ Munich, Munich, Germany
[3] Friedrich Alexander Univ Erlangen Nurnberg, Erlangen, Germany
关键词
ALGORITHMS;
D O I
10.1109/ICDM51629.2021.00127
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Federated learning (FL) has emerged as a privacy-aware alternative to centralized data analysis, especially for biomedical analyses such as genome-wide association studies (GWAS). The data remains with the owner, which enables studies previously impossible due to privacy protection regulations. Principal component analysis (PCA) is a frequent preprocessing step in GWAS, where the eigenvectors of the sample-by-sample covariance matrix are used as covariates in the statistical tests. Therefore, a federated version of PCA suitable for vertical data partitioning is required for federated GWAS. Existing federated PCA algorithms exchange the complete sample eigenvectors, a potential privacy breach. In this paper, we present a federated PCA algorithm for vertically partitioned data which does not exchange the sample eigenvectors and is hence suitable for federated GWAS. We show that it outperforms existing federated solutions in terms of convergence behavior and scalability. Additionally, we provide a user-friendly privacy-aware web tool to promote acceptance of federated PCA among GWAS researchers.
引用
收藏
页码:1090 / 1095
页数:6
相关论文
共 50 条
  • [11] REPLICABILITY ANALYSIS FOR GENOME-WIDE ASSOCIATION STUDIES
    Heller, Ruth
    Yekutieli, Daniel
    ANNALS OF APPLIED STATISTICS, 2014, 8 (01): : 481 - 498
  • [12] Power analysis for genome-wide association studies
    Robert J Klein
    BMC Genetics, 8
  • [13] Power analysis for genome-wide association studies
    Klein, Robert J.
    BMC GENETICS, 2007, 8 (1)
  • [14] Genome-wide association studies
    Nature Reviews Methods Primers, 1
  • [15] Genome-wide association studies
    Willson, Joseph
    NATURE REVIEWS METHODS PRIMERS, 2021, 1 (01):
  • [16] Genome-Wide Association Studies
    Guo, Xiuqing
    Rotter, Jerome I.
    JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2019, 322 (17): : 1705 - 1706
  • [17] Principal component analysis of canine hip dysplasia phenotypes and their statistical power for genome-wide association mapping
    Duan, Faping
    Ogden, Daniel
    Xu, Ling
    Liu, Kang
    Lust, George
    Sandler, Jody
    Dykes, Nathan L.
    Zhu, Lan
    Harris, Steven
    Jones, Paul
    Todhunter, Rory J.
    Zhang, Zhiwu
    JOURNAL OF APPLIED STATISTICS, 2013, 40 (02) : 235 - 251
  • [18] Federated generalized linear mixed models for collaborative genome-wide association studies
    Li, Wentao
    Chen, Han
    Jiang, Xiaoqian
    Harmanci, Arif
    ISCIENCE, 2023, 26 (08)
  • [19] Pathway-Based Analysis for Genome-Wide Association Studies Using Supervised Principal Components
    Chen, Xi
    Wang, Lily
    Hu, Bo
    Guo, Mingsheng
    Barnard, John
    Zhu, Xiaofeng
    GENETIC EPIDEMIOLOGY, 2010, 34 (07) : 716 - 724
  • [20] SNP Set Association Analysis for Genome-Wide Association Studies
    Cai, Min
    Dai, Hui
    Qiu, Yongyong
    Zhao, Yang
    Zhang, Ruyang
    Chu, Minjie
    Dai, Juncheng
    Hu, Zhibin
    Shen, Hongbing
    Chen, Feng
    PLOS ONE, 2013, 8 (05):