Privacy preserving distributed machine learning with federated learning

被引：48

作者：

Chamikara, M. A. P. ^{[1
,2
]}

Bertok, P. ^{[1
]}

Khalil, I. ^{[1
]}

Liu, D. ^{[2
]}

Camtepe, S. ^{[2
]}

机构：

[1] RMIT Univ, Melbourne, Vic, Australia

[2] CSIRO Data61, Sydney, NSW, Australia

来源：

COMPUTER COMMUNICATIONS | 2021年 / 171卷

关键词：

Data privacy; Distributed data privacy; Privacy preserving machine learning; Distributed machine learning; Federated learning; DATA PERTURBATION; T-CLOSENESS; K-ANONYMITY; INFORMATION; SECURITY;

D O I：

10.1016/j.comcom.2021.02.014

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Edge computing and distributed machine learning have advanced to a level that can revolutionize a particular organization. Distributed devices such as the Internet of Things (IoT) often produce a large amount of data, eventually resulting in big data that can be vital in uncovering hidden patterns, and other insights in numerous fields such as healthcare, banking, and policing. Data related to areas such as healthcare and banking can contain potentially sensitive data that can become public if they are not appropriately sanitized. Federated learning (FedML) is a recently developed distributed machine learning (DML) approach that tries to preserve privacy by bringing the learning of an ML model to data owners' devices. However, literature shows different attack methods such as membership inference that exploit the vulnerabilities of ML models as well as the coordinating servers to retrieve private data. Hence, FedML needs additional measures to guarantee data privacy. Furthermore, big data often requires more resources than available in a standard computer. This paper addresses these issues by proposing a distributed perturbation algorithm named as DISTPAB, for privacy preservation of horizontally partitioned data. DISTPAB alleviates computational bottlenecks by distributing the task of privacy preservation utilizing the asymmetry of resources of a distributed environment, which can have resource-constrained devices as well as high-performance computers. Experiments show that DISTPAB provides high accuracy, high efficiency, high scalability, and high attack resistance. Further experiments on privacy-preserving FedML show that DISTPAB is an excellent solution to stop privacy leaks in DML while preserving high data utility.

引用

页码：112 / 125

页数：14

共 50 条

[1] Federated Learning: The Pioneering Distributed Machine Learning and Privacy-Preserving Data Technology
Treleaven, Philip
Smietanka, Malgorzata
Pithadia, Hirsh
COMPUTER, 2022, 55 (04) : 20 - 29
[2] Preserving User Privacy for Machine Learning: Local Differential Privacy or Federated Machine Learning?
Zheng, Huadi
Hu, Haibo
Han, Ziyang
IEEE INTELLIGENT SYSTEMS, 2020, 35 (04) : 5 - 14
[3] Privacy Preserving Machine Learning with Homomorphic Encryption and Federated Learning
Fang, Haokun
Qian, Quan
FUTURE INTERNET, 2021, 13 (04):
[4] AN EXPLORATION OF FEDERATED LEARNING FOR PRIVACY-PRESERVING MACHINE LEARNING
Kumar, K. Kiran
Rao, Thalakola Syamsundara
Vullam, Nagagopiraju
Vellela, Sai Srinivas
Jyosthna, B.
Farjana, Shaik
Javvadi, Sravanthi
2024 5TH INTERNATIONAL CONFERENCE ON INNOVATIVE TRENDS IN INFORMATION TECHNOLOGY, ICITIIT 2024, 2024,
[5] Privacy-Preserving and Reliable Distributed Federated Learning
Dong, Yipeng
Zhang, Lei
Xu, Lin
ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2023, PT I, 2024, 14487 : 130 - 149
[6] Privacy-Preserving Robust Federated Learning with Distributed Differential Privacy
Wang, Fayao
He, Yuanyuan
Guo, Yunchuan
Li, Peizhi
Wei, Xinyu
2022 IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, 2022, : 598 - 605
[7] Soteria: Preserving Privacy in Distributed Machine Learning
Brito, Claudia
Ferreira, Pedro
Portela, Bernardo
Oliveira, Rui
Paulo, Joao
38TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2023, 2023, : 135 - 142
[8] Privacy-Preserving Machine Learning Using Federated Learning and Secure Aggregation
Lia, Dragos
Togan, Mihai
PROCEEDINGS OF THE 2020 12TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTERS AND ARTIFICIAL INTELLIGENCE (ECAI-2020), 2020,
[9] Advancements in Privacy-Preserving Techniques for Federated Learning: A Machine Learning Perspective
Rokade, Monika Dhananjay
Deshmukh, Suruchi
Gumaste, Smita
Shelake, Rekha Maruti
Inamdar, Saba Afreen Ghayasuddin
Chandre, Pankaj
JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (02) : 1075 - 1088
[10] Differential Privacy-preserving Distributed Machine Learning
Wang, Xin
Ishii, Hideaki
Du, Linkang
Cheng, Peng
Chen, Jiming
2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 7339 - 7344

← 1 2 3 4 5 →