Privacy preserving distributed machine learning with federated learning

被引:48
|
作者
Chamikara, M. A. P. [1 ,2 ]
Bertok, P. [1 ]
Khalil, I. [1 ]
Liu, D. [2 ]
Camtepe, S. [2 ]
机构
[1] RMIT Univ, Melbourne, Vic, Australia
[2] CSIRO Data61, Sydney, NSW, Australia
关键词
Data privacy; Distributed data privacy; Privacy preserving machine learning; Distributed machine learning; Federated learning; DATA PERTURBATION; T-CLOSENESS; K-ANONYMITY; INFORMATION; SECURITY;
D O I
10.1016/j.comcom.2021.02.014
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Edge computing and distributed machine learning have advanced to a level that can revolutionize a particular organization. Distributed devices such as the Internet of Things (IoT) often produce a large amount of data, eventually resulting in big data that can be vital in uncovering hidden patterns, and other insights in numerous fields such as healthcare, banking, and policing. Data related to areas such as healthcare and banking can contain potentially sensitive data that can become public if they are not appropriately sanitized. Federated learning (FedML) is a recently developed distributed machine learning (DML) approach that tries to preserve privacy by bringing the learning of an ML model to data owners' devices. However, literature shows different attack methods such as membership inference that exploit the vulnerabilities of ML models as well as the coordinating servers to retrieve private data. Hence, FedML needs additional measures to guarantee data privacy. Furthermore, big data often requires more resources than available in a standard computer. This paper addresses these issues by proposing a distributed perturbation algorithm named as DISTPAB, for privacy preservation of horizontally partitioned data. DISTPAB alleviates computational bottlenecks by distributing the task of privacy preservation utilizing the asymmetry of resources of a distributed environment, which can have resource-constrained devices as well as high-performance computers. Experiments show that DISTPAB provides high accuracy, high efficiency, high scalability, and high attack resistance. Further experiments on privacy-preserving FedML show that DISTPAB is an excellent solution to stop privacy leaks in DML while preserving high data utility.
引用
收藏
页码:112 / 125
页数:14
相关论文
共 50 条
  • [11] Preserving Model Privacy for Machine Learning in Distributed Systems
    Jia, Qi
    Guo, Linke
    Jin, Zhanpeng
    Fang, Yuguang
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2018, 29 (08) : 1808 - 1822
  • [12] From distributed machine learning to federated learning: In the view of data privacy and security
    Shen, Sheng
    Zhu, Tianqing
    Wu, Di
    Wang, Wei
    Zhou, Wanlei
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (16):
  • [13] Preserving Privacy and Security in Federated Learning
    Nguyen, Truc
    Thai, My T.
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2024, 32 (01) : 833 - 843
  • [14] Distributed additive encryption and quantization for privacy preserving federated deep learning
    Zhu, Hangyu
    Wang, Rui
    Jin, Yaochu
    Liang, Kaitai
    Ning, Jianting
    NEUROCOMPUTING, 2021, 463 : 309 - 327
  • [15] Privacy-Preserving Asynchronous Federated Learning Framework in Distributed IoT
    Yan, Xinru
    Miao, Yinbin
    Li, Xinghua
    Choo, Kim-Kwang Raymond
    Meng, Xiangdong
    Deng, Robert H. H.
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (15) : 13281 - 13291
  • [16] Landscape of machine learning evolution: privacy-preserving federated learning frameworks and tools
    Nguyen, Giang
    Sáinz-Pardo Díaz, Judith
    Calatrava, Amanda
    Berberi, Lisana
    Lytvyn, Oleksandr
    Kozlov, Valentin
    Tran, Viet
    Moltó, Germán
    López García, Álvaro
    Artificial Intelligence Review, 2025, 58 (02)
  • [17] Privacy Preserving Misbehavior Detection in IoV using Federated Machine Learning
    Uprety, Aashma
    Rawat, Danda B.
    Li, Jiang
    2021 IEEE 18TH ANNUAL CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE (CCNC), 2021,
  • [18] Secure, privacy-preserving and federated machine learning in medical imaging
    Georgios A. Kaissis
    Marcus R. Makowski
    Daniel Rückert
    Rickmer F. Braren
    Nature Machine Intelligence, 2020, 2 : 305 - 311
  • [19] Secure, privacy-preserving and federated machine learning in medical imaging
    Kaissis, Georgios A.
    Makowski, Marcus R.
    Ruckert, Daniel
    Braren, Rickmer F.
    NATURE MACHINE INTELLIGENCE, 2020, 2 (06) : 305 - 311
  • [20] BlockFLow: Decentralized, Privacy-Preserving, and Accountable Federated Machine Learning
    Mugunthan, Vaikkunth
    Rahman, Ravi
    Kagal, Lalana
    BLOCKCHAIN AND APPLICATIONS, 2022, 320 : 233 - 242