The Value of Collaboration in Convex Machine Learning with Differential Privacy

被引:77
|
作者
Wu, Nan [1 ]
Farokhi, Farhad [2 ,3 ]
Smith, David [2 ,4 ]
Kaafar, Mohamed Ali [1 ,2 ]
机构
[1] Macquarie Univ, N Ryde, NSW, Australia
[2] CSIRO, Data61, Canberra, ACT, Australia
[3] Univ Melbourne, Melbourne, Vic 3010, Australia
[4] Australian Natl Univ, Canberra, ACT, Australia
关键词
Machine learning; Differential privacy; Stochastic gradient algorithm; REGRESSION;
D O I
10.1109/SP40000.2020.00025
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we apply machine learning to distributed private data owned by multiple data owners, entities with access to non-overlapping training datasets. We use noisy, differentially-private gradients to minimize the fitness cost of the machine learning model using stochastic gradient descent. We quantify the quality of the trained model, using the fitness cost, as a function of privacy budget and size of the distributed datasets to capture the trade-off between privacy and utility in machine learning. This way, we can predict the outcome of collaboration among privacy-aware data owners prior to executing potentially computationally-expensive machine learning algorithms. Particularly, we show that the difference between the fitness of the trained machine learning model using differentially-private gradient queries and the fitness of the trained machine model in the absence of any privacy concerns is inversely proportional to the size of the training datasets squared and the privacy budget squared. We successfully validate the performance prediction with the actual performance of the proposed privacy-aware learning algorithms, applied to: financial datasets for determining interest rates of loans using regression; and detecting credit card frauds using support vector machines.
引用
收藏
页码:304 / 317
页数:14
相关论文
共 50 条
  • [1] Quantum machine learning with differential privacy
    William M. Watkins
    Samuel Yen-Chi Chen
    Shinjae Yoo
    Scientific Reports, 13
  • [2] Quantum machine learning with differential privacy
    Watkins, William M.
    Chen, Samuel Yen-Chi
    Yoo, Shinjae
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [3] Preserving User Privacy for Machine Learning: Local Differential Privacy or Federated Machine Learning?
    Zheng, Huadi
    Hu, Haibo
    Han, Ziyang
    IEEE INTELLIGENT SYSTEMS, 2020, 35 (04) : 5 - 14
  • [4] How Differential Privacy Reinforces Privacy of Machine Learning Models?
    Ben Hamida, Sana
    Mrabet, Hichem
    Jemai, Abderrazak
    ADVANCES IN COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2022, 2022, 1653 : 661 - 673
  • [5] Collaboration in Federated Learning With Differential Privacy: A Stackelberg Game Analysis
    Huang, Guangjing
    Wu, Qiong
    Sun, Peng
    Ma, Qian
    Chen, Xu
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2024, 35 (03) : 455 - 469
  • [6] Privacy-preserving quantum machine learning using differential privacy
    Senekane, Makhamisa
    Mafu, Mhlambululi
    Taele, Benedict Molibeli
    2017 IEEE AFRICON, 2017, : 1432 - 1435
  • [7] Differential Privacy-preserving Distributed Machine Learning
    Wang, Xin
    Ishii, Hideaki
    Du, Linkang
    Cheng, Peng
    Chen, Jiming
    2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 7339 - 7344
  • [8] Correlated Differential Privacy: Feature Selection in Machine Learning
    Zhang, Tao
    Zhu, Tianqing
    Xiong, Ping
    Huo, Huan
    Tari, Zahir
    Zhou, Wanlei
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (03) : 2115 - 2124
  • [9] Optimizing the Numbers of Queries and Replies in Convex Federated Learning With Differential Privacy
    Zhou, Yipeng
    Liu, Xuezheng
    Fu, Yao
    Wu, Di
    Wang, Jessie Hui
    Yu, Shui
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2023, 20 (06) : 4823 - 4837
  • [10] Enhancing correlated big data privacy using differential privacy and machine learning
    Biswas, Sreemoyee
    Fole, Anuja
    Khare, Nilay
    Agrawal, Pragati
    JOURNAL OF BIG DATA, 2023, 10 (01)