Federated singular value decomposition for high-dimensional data

被引:3
|
作者
Hartebrodt, Anne [1 ,2 ]
Rottger, Richard [1 ]
Blumenthal, David B. [2 ]
机构
[1] Univ Southern Denmark, Dept Math & Comp Sci, Campusvej 55, DK-5230 Odense, Denmark
[2] Friedrich Alexander Univ Erlangen Nurnberg FAU, Dept Artificial Intelligence Biomed Engn AIBE, Werner von Siemens Str 61, D-91052 Erlangen, Germany
关键词
Singular value decomposition; Federated learning; Principal component analysis; Genome-wide association studies; PRINCIPAL-COMPONENT ANALYSIS; ALGORITHMS;
D O I
10.1007/s10618-023-00983-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Federated learning (FL) is emerging as a privacy-aware alternative to classical cloud-based machine learning. In FL, the sensitive data remains in data silos and only aggregated parameters are exchanged. Hospitals and research institutions which are not willing to share their data can join a federated study without breaching confidentiality. In addition to the extreme sensitivity of biomedical data, the high dimensionality poses a challenge in the context of federated genome-wide association studies (GWAS). In this article, we present a federated singular value decomposition algorithm, suitable for the privacy-related and computational requirements of GWAS. Notably, the algorithm has a transmission cost independent of the number of samples and is only weakly dependent on the number of features, because the singular vectors corresponding to the samples are never exchanged and the vectors associated with the features are only transmitted to an aggregator for a fixed number of iterations. Although motivated by GWAS, the algorithm is generically applicable for both horizontally and vertically partitioned data.
引用
收藏
页码:938 / 975
页数:38
相关论文
共 50 条
  • [1] Federated singular value decomposition for high-dimensional data
    Anne Hartebrodt
    Richard Röttger
    David B. Blumenthal
    [J]. Data Mining and Knowledge Discovery, 2024, 38 : 938 - 975
  • [2] A Sparse Singular Value Decomposition Method for High-Dimensional Data
    Yang, Dan
    Ma, Zongming
    Buja, Andreas
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2014, 23 (04) : 923 - 942
  • [3] Asymptotic Conditional Singular Value Decomposition for High-Dimensional Genomic Data
    Leek, Jeffrey T.
    [J]. BIOMETRICS, 2011, 67 (02) : 344 - 352
  • [4] Mining high-dimensional scientific data sets using singular value decomposition
    Maltseva, E
    Pizzuti, C
    Talia, D
    [J]. DATA MINING FOR SCIENTIFIC AND ENGINEERING APPLICATIONS, 2001, 2 : 425 - 438
  • [5] Optimal Sparse Singular Value Decomposition for High-Dimensional High-Order Data
    Zhang, Anru
    Han, Rungang
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2019, 114 (528) : 1708 - 1725
  • [6] Improving the efficiency of multidimensional scaling in the analysis of high-dimensional data using singular value decomposition
    Becavin, Christophe
    Tchitchek, Nicolas
    Mintsa-Eya, Colette
    Lesne, Annick
    Benecke, Arndt
    [J]. BIOINFORMATICS, 2011, 27 (10) : 1413 - 1421
  • [7] High-Dimensional Generalized Orthogonal Matching Pursuit With Singular Value Decomposition
    Zong, Zhaoyun
    Fu, Ting
    Yin, Xingyao
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [8] CSVD: Clustering and Singular Value Decomposition for approximate similarity search in high-dimensional spaces
    Castelli, V
    Thomasian, A
    Li, CS
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2003, 15 (03) : 671 - 685
  • [9] Modified Regularization for High-dimensional Data Decomposition
    Chai, Sheng
    Feng, Wenying
    Hassanein, Hossam
    [J]. 2022 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY, WI-IAT, 2022, : 710 - 714
  • [10] Privacy-Preserving Federated Singular Value Decomposition
    Liu, Bowen
    Pejo, Balazs
    Tang, Qiang
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (13):