Privacy preserving distributed machine learning with federated learning

被引:48
|
作者
Chamikara, M. A. P. [1 ,2 ]
Bertok, P. [1 ]
Khalil, I. [1 ]
Liu, D. [2 ]
Camtepe, S. [2 ]
机构
[1] RMIT Univ, Melbourne, Vic, Australia
[2] CSIRO Data61, Sydney, NSW, Australia
关键词
Data privacy; Distributed data privacy; Privacy preserving machine learning; Distributed machine learning; Federated learning; DATA PERTURBATION; T-CLOSENESS; K-ANONYMITY; INFORMATION; SECURITY;
D O I
10.1016/j.comcom.2021.02.014
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Edge computing and distributed machine learning have advanced to a level that can revolutionize a particular organization. Distributed devices such as the Internet of Things (IoT) often produce a large amount of data, eventually resulting in big data that can be vital in uncovering hidden patterns, and other insights in numerous fields such as healthcare, banking, and policing. Data related to areas such as healthcare and banking can contain potentially sensitive data that can become public if they are not appropriately sanitized. Federated learning (FedML) is a recently developed distributed machine learning (DML) approach that tries to preserve privacy by bringing the learning of an ML model to data owners' devices. However, literature shows different attack methods such as membership inference that exploit the vulnerabilities of ML models as well as the coordinating servers to retrieve private data. Hence, FedML needs additional measures to guarantee data privacy. Furthermore, big data often requires more resources than available in a standard computer. This paper addresses these issues by proposing a distributed perturbation algorithm named as DISTPAB, for privacy preservation of horizontally partitioned data. DISTPAB alleviates computational bottlenecks by distributing the task of privacy preservation utilizing the asymmetry of resources of a distributed environment, which can have resource-constrained devices as well as high-performance computers. Experiments show that DISTPAB provides high accuracy, high efficiency, high scalability, and high attack resistance. Further experiments on privacy-preserving FedML show that DISTPAB is an excellent solution to stop privacy leaks in DML while preserving high data utility.
引用
收藏
页码:112 / 125
页数:14
相关论文
共 50 条
  • [1] Federated Learning: The Pioneering Distributed Machine Learning and Privacy-Preserving Data Technology
    Treleaven, Philip
    Smietanka, Malgorzata
    Pithadia, Hirsh
    COMPUTER, 2022, 55 (04) : 20 - 29
  • [2] Preserving User Privacy for Machine Learning: Local Differential Privacy or Federated Machine Learning?
    Zheng, Huadi
    Hu, Haibo
    Han, Ziyang
    IEEE INTELLIGENT SYSTEMS, 2020, 35 (04) : 5 - 14
  • [3] Privacy Preserving Machine Learning with Homomorphic Encryption and Federated Learning
    Fang, Haokun
    Qian, Quan
    FUTURE INTERNET, 2021, 13 (04):
  • [4] AN EXPLORATION OF FEDERATED LEARNING FOR PRIVACY-PRESERVING MACHINE LEARNING
    Kumar, K. Kiran
    Rao, Thalakola Syamsundara
    Vullam, Nagagopiraju
    Vellela, Sai Srinivas
    Jyosthna, B.
    Farjana, Shaik
    Javvadi, Sravanthi
    2024 5TH INTERNATIONAL CONFERENCE ON INNOVATIVE TRENDS IN INFORMATION TECHNOLOGY, ICITIIT 2024, 2024,
  • [5] Privacy-Preserving and Reliable Distributed Federated Learning
    Dong, Yipeng
    Zhang, Lei
    Xu, Lin
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2023, PT I, 2024, 14487 : 130 - 149
  • [6] Privacy-Preserving Robust Federated Learning with Distributed Differential Privacy
    Wang, Fayao
    He, Yuanyuan
    Guo, Yunchuan
    Li, Peizhi
    Wei, Xinyu
    2022 IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, 2022, : 598 - 605
  • [7] Soteria: Preserving Privacy in Distributed Machine Learning
    Brito, Claudia
    Ferreira, Pedro
    Portela, Bernardo
    Oliveira, Rui
    Paulo, Joao
    38TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2023, 2023, : 135 - 142
  • [8] Privacy-Preserving Machine Learning Using Federated Learning and Secure Aggregation
    Lia, Dragos
    Togan, Mihai
    PROCEEDINGS OF THE 2020 12TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTERS AND ARTIFICIAL INTELLIGENCE (ECAI-2020), 2020,
  • [9] Advancements in Privacy-Preserving Techniques for Federated Learning: A Machine Learning Perspective
    Rokade, Monika Dhananjay
    Deshmukh, Suruchi
    Gumaste, Smita
    Shelake, Rekha Maruti
    Inamdar, Saba Afreen Ghayasuddin
    Chandre, Pankaj
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (02) : 1075 - 1088
  • [10] Differential Privacy-preserving Distributed Machine Learning
    Wang, Xin
    Ishii, Hideaki
    Du, Linkang
    Cheng, Peng
    Chen, Jiming
    2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 7339 - 7344