Privacy preserving distributed machine learning with federated learning

被引：48

作者：

Chamikara, M. A. P. ^{[1
,2
]}

Bertok, P. ^{[1
]}

Khalil, I. ^{[1
]}

Liu, D. ^{[2
]}

Camtepe, S. ^{[2
]}

机构：

[1] RMIT Univ, Melbourne, Vic, Australia

[2] CSIRO Data61, Sydney, NSW, Australia

来源：

COMPUTER COMMUNICATIONS | 2021年 / 171卷

关键词：

Data privacy; Distributed data privacy; Privacy preserving machine learning; Distributed machine learning; Federated learning; DATA PERTURBATION; T-CLOSENESS; K-ANONYMITY; INFORMATION; SECURITY;

D O I：

10.1016/j.comcom.2021.02.014

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Edge computing and distributed machine learning have advanced to a level that can revolutionize a particular organization. Distributed devices such as the Internet of Things (IoT) often produce a large amount of data, eventually resulting in big data that can be vital in uncovering hidden patterns, and other insights in numerous fields such as healthcare, banking, and policing. Data related to areas such as healthcare and banking can contain potentially sensitive data that can become public if they are not appropriately sanitized. Federated learning (FedML) is a recently developed distributed machine learning (DML) approach that tries to preserve privacy by bringing the learning of an ML model to data owners' devices. However, literature shows different attack methods such as membership inference that exploit the vulnerabilities of ML models as well as the coordinating servers to retrieve private data. Hence, FedML needs additional measures to guarantee data privacy. Furthermore, big data often requires more resources than available in a standard computer. This paper addresses these issues by proposing a distributed perturbation algorithm named as DISTPAB, for privacy preservation of horizontally partitioned data. DISTPAB alleviates computational bottlenecks by distributing the task of privacy preservation utilizing the asymmetry of resources of a distributed environment, which can have resource-constrained devices as well as high-performance computers. Experiments show that DISTPAB provides high accuracy, high efficiency, high scalability, and high attack resistance. Further experiments on privacy-preserving FedML show that DISTPAB is an excellent solution to stop privacy leaks in DML while preserving high data utility.

引用

页码：112 / 125

页数：14

共 50 条

[11] Preserving Model Privacy for Machine Learning in Distributed Systems
Jia, Qi
Guo, Linke
Jin, Zhanpeng
Fang, Yuguang
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2018, 29 (08) : 1808 - 1822
[12] From distributed machine learning to federated learning: In the view of data privacy and security
Shen, Sheng
Zhu, Tianqing
Wu, Di
Wang, Wei
Zhou, Wanlei
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (16):
[13] Preserving Privacy and Security in Federated Learning
Nguyen, Truc
Thai, My T.
IEEE-ACM TRANSACTIONS ON NETWORKING, 2024, 32 (01) : 833 - 843
[14] Distributed additive encryption and quantization for privacy preserving federated deep learning
Zhu, Hangyu
Wang, Rui
Jin, Yaochu
Liang, Kaitai
Ning, Jianting
NEUROCOMPUTING, 2021, 463 : 309 - 327
[15] Privacy-Preserving Asynchronous Federated Learning Framework in Distributed IoT
Yan, Xinru
Miao, Yinbin
Li, Xinghua
Choo, Kim-Kwang Raymond
Meng, Xiangdong
Deng, Robert H. H.
IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (15) : 13281 - 13291
[16] Landscape of machine learning evolution: privacy-preserving federated learning frameworks and tools
Nguyen, Giang
Sáinz-Pardo Díaz, Judith
Calatrava, Amanda
Berberi, Lisana
Lytvyn, Oleksandr
Kozlov, Valentin
Tran, Viet
Moltó, Germán
López García, Álvaro
Artificial Intelligence Review, 2025, 58 (02)
[17] Privacy Preserving Misbehavior Detection in IoV using Federated Machine Learning
Uprety, Aashma
Rawat, Danda B.
Li, Jiang
2021 IEEE 18TH ANNUAL CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE (CCNC), 2021,
[18] Secure, privacy-preserving and federated machine learning in medical imaging
Georgios A. Kaissis
Marcus R. Makowski
Daniel Rückert
Rickmer F. Braren
Nature Machine Intelligence, 2020, 2 : 305 - 311
[19] Secure, privacy-preserving and federated machine learning in medical imaging
Kaissis, Georgios A.
Makowski, Marcus R.
Ruckert, Daniel
Braren, Rickmer F.
NATURE MACHINE INTELLIGENCE, 2020, 2 (06) : 305 - 311
[20] BlockFLow: Decentralized, Privacy-Preserving, and Accountable Federated Machine Learning
Mugunthan, Vaikkunth
Rahman, Ravi
Kagal, Lalana
BLOCKCHAIN AND APPLICATIONS, 2022, 320 : 233 - 242

← 1 2 3 4 5 →