Federated data processing and learning for collaboration in the physical sciences

被引:2
|
作者
Huang, W. [1 ]
Barnard, A. S. [1 ]
机构
[1] Australian Natl Univ, Sch Comp, Acton 2601, Australia
来源
关键词
machine learning; federated learning; physical science; nanoparticles;
D O I
10.1088/2632-2153/aca87c
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Property analysis and prediction is a challenging topic in fields such as chemistry, nanotechnology and materials science, and often suffers from lack of data. Federated learning (FL) is a machine learning (ML) framework that encourages privacy-preserving collaborations between data owners, and potentially overcomes the need to combine data that may contain proprietary information. Combining information from different data sets within the same domain can also produce ML models with more general insight and reduce the impact of the selection bias inherent in small, individual studies. In this paper we propose using horizontal FL to mitigate these data limitation issues and explore the opportunity for data-driven collaboration under these constraints. We also propose FedRed, a new dimensionality reduction method for FL, that allows faster convergence and accounts for differences between individual data sets. The FL pipeline has been tested on a collection of eight different data sets of metallic nanoparticles, and while there are expected losses compared to a combined data set that does not preserve the privacy of the collaborators, we obtained extremely good result compared to local training on individual data sets. We conclude that FL is an effective and efficient method for the physical science domain that could hugely reduce the negative effect of insufficient data.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Risk and Advantages of Federated Learning for Health Care Data Collaboration
    Bogdanova, Anna
    Attoh-Okine, Nii
    Sakurai, Tetsuya
    [J]. ASCE-ASME JOURNAL OF RISK AND UNCERTAINTY IN ENGINEERING SYSTEMS PART A-CIVIL ENGINEERING, 2020, 6 (03):
  • [2] Collaboration Equilibrium in Federated Learning
    Cui, Sen
    Liang, Jian
    Pan, Weishen
    Chen, Kun
    Zhang, Changshui
    Wang, Fei
    [J]. PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 241 - 251
  • [3] Collaboration Management for Federated Learning
    Schlegel, Marius
    Scheliga, Daniel
    Sattler, Kai-Uwe
    Seeland, Marco
    Maeder, Patrick
    [J]. 2024 IEEE 40TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOP, ICDEW, 2024, : 291 - 300
  • [4] BioFed: federated query processing over life sciences linked open data
    Ali Hasnain
    Qaiser Mehmood
    Syeda Sana e Zainab
    Muhammad Saleem
    Claude Warren
    Durre Zehra
    Stefan Decker
    Dietrich Rebholz-Schuhmann
    [J]. Journal of Biomedical Semantics, 8
  • [5] BioFed: federated query processing over life sciences linked open data
    Hasnain, Ali
    Mehmood, Qaiser
    Sana e Zainab, Syeda
    Saleem, Muhammad
    Warren, Claude, Jr.
    Zehra, Durre
    Decker, Stefan
    Rebholz-Schuhmann, Dietrich
    [J]. JOURNAL OF BIOMEDICAL SEMANTICS, 2017, 8
  • [6] Towards flexible data stream collaboration: Federated Learning in Kafka-ML
    Chaves, Antonio Jesus
    Martin, Cristian
    Diaz, Manuel
    [J]. INTERNET OF THINGS, 2024, 25
  • [7] Rethinking Personalized Client Collaboration in Federated Learning
    Wu, Leijie
    Guo, Song
    Ding, Yaohong
    Wang, Junxiao
    Xu, Wenchao
    Zhan, Yufeng
    Kermarrec, Anne-Marie
    [J]. IEEE Transactions on Mobile Computing, 2024, 23 (12) : 11227 - 11239
  • [8] Federated Learning: A signal processing perspective
    Gafni, Tomer
    Shlezinger, Nir
    Cohen, Kobi
    Eldar, Yonina C.
    Poor, H. Vincent
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2022, 39 (03) : 14 - 41
  • [9] Federated Learning Enabled Credit Priority Task Processing for Transportation Big Data
    Wu, Guangjun
    Li, Jun
    Ning, Zhaolong
    Wang, Yong
    Li, Binbin
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (01) : 839 - 849
  • [10] FedCPD: A Federated Learning Algorithm for Processing and Securing Distributed Heterogeneous Data in the Metaverse
    Sun, Le
    Zhang, Zhimeng
    Muhammad, Ghulam
    [J]. IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2024, 5 : 5540 - 5551