Federated data processing and learning for collaboration in the physical sciences

被引:2
|
作者
Huang, W. [1 ]
Barnard, A. S. [1 ]
机构
[1] Australian Natl Univ, Sch Comp, Acton 2601, Australia
来源
关键词
machine learning; federated learning; physical science; nanoparticles;
D O I
10.1088/2632-2153/aca87c
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Property analysis and prediction is a challenging topic in fields such as chemistry, nanotechnology and materials science, and often suffers from lack of data. Federated learning (FL) is a machine learning (ML) framework that encourages privacy-preserving collaborations between data owners, and potentially overcomes the need to combine data that may contain proprietary information. Combining information from different data sets within the same domain can also produce ML models with more general insight and reduce the impact of the selection bias inherent in small, individual studies. In this paper we propose using horizontal FL to mitigate these data limitation issues and explore the opportunity for data-driven collaboration under these constraints. We also propose FedRed, a new dimensionality reduction method for FL, that allows faster convergence and accounts for differences between individual data sets. The FL pipeline has been tested on a collection of eight different data sets of metallic nanoparticles, and while there are expected losses compared to a combined data set that does not preserve the privacy of the collaborators, we obtained extremely good result compared to local training on individual data sets. We conclude that FL is an effective and efficient method for the physical science domain that could hugely reduce the negative effect of insufficient data.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Data Valuation Methods for Federated Learning
    Ardic, Emre
    Genc, Yakup
    [J]. 2023 31ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2023,
  • [42] Federated Fuzzy Learning with Imbalanced Data
    Dust, Lukas Johannes
    Murcia, Marina Lopez
    Makila, Andreas
    Nordin, Petter
    Xiong, Ning
    Herrera, Francisco
    [J]. 20TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2021), 2021, : 1130 - 1137
  • [43] Federated Learning with Positive and Unlabeled Data
    Lin, Xinyang
    Chen, Hanting
    Xu, Yixing
    Xu, Chao
    Gui, Xiaolin
    Deng, Yiping
    Wang, Yunhe
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [44] Quantum Federated Learning With Decentralized Data
    Huang, Rui
    Tan, Xiaoqing
    Xu, Qingshan
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN QUANTUM ELECTRONICS, 2022, 28 (04)
  • [45] Unsupervised Federated Learning for Unbalanced Data
    Servetnyk, Mykola
    Fung, Carrson C.
    Han, Zhu
    [J]. 2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
  • [46] A survey on federated learning in data mining
    Yu, Bin
    Mao, Wenjie
    Lv, Yihan
    Zhang, Chen
    Xie, Yu
    [J]. WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2022, 12 (01)
  • [47] Fair Federated Learning for Heterogeneous Data
    Kanaparthy, Samhita
    Padala, Manisha
    Damle, Sankarshan
    Gujar, Sujit
    [J]. PROCEEDINGS OF THE 5TH JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE & MANAGEMENT OF DATA, CODS COMAD 2022, 2022, : 298 - 299
  • [48] QUANTUM FEDERATED LEARNING WITH QUANTUM DATA
    Chehimi, Mahdi
    Saad, Walid
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8617 - 8621
  • [49] A federated database approach to integrating Life Sciences data.
    Schwarz, P
    Rice, J
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2001, 222 : U393 - U393
  • [50] Federated data access and federated learning: improved data sharing, AI model development, and learning in intensive care
    van Genderen, Michel E.
    Cecconi, Maurizio
    Jung, Christian
    [J]. INTENSIVE CARE MEDICINE, 2024, 50 (06) : 974 - 977