Federated singular value decomposition for high-dimensional data

被引:3
|
作者
Hartebrodt, Anne [1 ,2 ]
Rottger, Richard [1 ]
Blumenthal, David B. [2 ]
机构
[1] Univ Southern Denmark, Dept Math & Comp Sci, Campusvej 55, DK-5230 Odense, Denmark
[2] Friedrich Alexander Univ Erlangen Nurnberg FAU, Dept Artificial Intelligence Biomed Engn AIBE, Werner von Siemens Str 61, D-91052 Erlangen, Germany
关键词
Singular value decomposition; Federated learning; Principal component analysis; Genome-wide association studies; PRINCIPAL-COMPONENT ANALYSIS; ALGORITHMS;
D O I
10.1007/s10618-023-00983-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Federated learning (FL) is emerging as a privacy-aware alternative to classical cloud-based machine learning. In FL, the sensitive data remains in data silos and only aggregated parameters are exchanged. Hospitals and research institutions which are not willing to share their data can join a federated study without breaching confidentiality. In addition to the extreme sensitivity of biomedical data, the high dimensionality poses a challenge in the context of federated genome-wide association studies (GWAS). In this article, we present a federated singular value decomposition algorithm, suitable for the privacy-related and computational requirements of GWAS. Notably, the algorithm has a transmission cost independent of the number of samples and is only weakly dependent on the number of features, because the singular vectors corresponding to the samples are never exchanged and the vectors associated with the features are only transmitted to an aggregator for a fixed number of iterations. Although motivated by GWAS, the algorithm is generically applicable for both horizontally and vertically partitioned data.
引用
收藏
页码:938 / 975
页数:38
相关论文
共 50 条
  • [41] Applying tensor decomposition model for high-dimensional toxicogenomics data analysis and interpretation
    Gao, Ce
    Gu, April
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2015, 250
  • [42] Gaussian Partial Information Decomposition: Bias Correction and Application to High-dimensional Data
    Venkatesh, Praveen
    Bennett, Corbett
    Gale, Sam
    Ramirez, Tamina K.
    Heller, Greggory
    Durand, Severine
    Olsen, Shawn
    Mihalas, Stefan
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [43] High-dimensional data analytics in civil engineering: A review on matrix and tensor decomposition
    Salehi, Hadi
    Gorodetsky, Alex
    Solhmirzaei, Roya
    Jiao, Pengcheng
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 125
  • [44] Estimation of the precision matrix of a singular Wishart distribution and its application in high-dimensional data
    Kubokawa, Tatsuya
    Srivastava, Muni S.
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2008, 99 (09) : 1906 - 1928
  • [45] High-performance singular value decomposition
    Skillicorn, DB
    Yang, XL
    [J]. DATA MINING FOR SCIENTIFIC AND ENGINEERING APPLICATIONS, 2001, 2 : 401 - 424
  • [46] Boundary constraints for singular value decomposition of spectral data
    Gruninger, John
    Dothe, Hoang
    [J]. IMAGE AND SIGNAL PROCESSING FOR REMOTE SENSING XIX, 2013, 8892
  • [47] A combined p-value test for the mean difference of high-dimensional data
    Yu, Wei
    Xu, Wangli
    Zhu, Lixing
    [J]. SCIENCE CHINA-MATHEMATICS, 2019, 62 (05) : 961 - 978
  • [48] Biclustering of microarray data based on singular value decomposition
    Yang, Wen-Hui
    Dai, Dao-Qing
    Yan, Hong
    [J]. EMERGING TECHNOLOGIES IN KNOWLEDGE DISCOVERY AND DATA MINING, 2007, 4819 : 194 - +
  • [49] Singular value decomposition of noisy data: mode corruption
    Epps, Brenden P.
    Krivitzky, Eric M.
    [J]. EXPERIMENTS IN FLUIDS, 2019, 60 (08)
  • [50] Imputation of Mixed Data With Multilevel Singular Value Decomposition
    Husson, Francois
    Josse, Julie
    Narasimhan, Balasubramanian
    Robin, Genevieve
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2019, 28 (03) : 552 - 566