The Value of Collaboration in Convex Machine Learning with Differential Privacy

被引:77
|
作者
Wu, Nan [1 ]
Farokhi, Farhad [2 ,3 ]
Smith, David [2 ,4 ]
Kaafar, Mohamed Ali [1 ,2 ]
机构
[1] Macquarie Univ, N Ryde, NSW, Australia
[2] CSIRO, Data61, Canberra, ACT, Australia
[3] Univ Melbourne, Melbourne, Vic 3010, Australia
[4] Australian Natl Univ, Canberra, ACT, Australia
关键词
Machine learning; Differential privacy; Stochastic gradient algorithm; REGRESSION;
D O I
10.1109/SP40000.2020.00025
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we apply machine learning to distributed private data owned by multiple data owners, entities with access to non-overlapping training datasets. We use noisy, differentially-private gradients to minimize the fitness cost of the machine learning model using stochastic gradient descent. We quantify the quality of the trained model, using the fitness cost, as a function of privacy budget and size of the distributed datasets to capture the trade-off between privacy and utility in machine learning. This way, we can predict the outcome of collaboration among privacy-aware data owners prior to executing potentially computationally-expensive machine learning algorithms. Particularly, we show that the difference between the fitness of the trained machine learning model using differentially-private gradient queries and the fitness of the trained machine model in the absence of any privacy concerns is inversely proportional to the size of the training datasets squared and the privacy budget squared. We successfully validate the performance prediction with the actual performance of the proposed privacy-aware learning algorithms, applied to: financial datasets for determining interest rates of loans using regression; and detecting credit card frauds using support vector machines.
引用
收藏
页码:304 / 317
页数:14
相关论文
共 50 条
  • [41] Efficient Data Collaboration Using Multi-Party Privacy Preserving Machine Learning Framework
    Salam, Abdu
    Abrar, Mohammad
    Ullah, Faizan
    Khan, Izaz Ahmad
    Amin, Farhan
    Choi, Gyu Sang
    IEEE ACCESS, 2023, 11 : 138151 - 138164
  • [42] Security and Privacy in Machine Learning
    Chandran, Nishanth
    INFORMATION SYSTEMS SECURITY, ICISS 2023, 2023, 14424 : 229 - 248
  • [43] Privacy: A machine learning view
    Vinterbo, SA
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2004, 16 (08) : 939 - 948
  • [44] Leveraging Edge Computing and Differential Privacy to Securely Enable Industrial Cloud Collaboration Along the Value Chain
    Giehl, Alexander
    Heinl, Michael P.
    Busch, Maximilian
    2021 IEEE 17TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2021, : 2023 - 2028
  • [45] Deep Learning with Differential Privacy
    Abadi, Martin
    Chu, Andy
    Goodfellow, Ian
    McMahan, H. Brendan
    Mironov, Ilya
    Talwar, Kunal
    Zhang, Li
    CCS'16: PROCEEDINGS OF THE 2016 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2016, : 308 - 318
  • [46] Advancing privacy in learning analytics using differential privacy
    Liu, Qinyi
    Shakya, Ronas
    Khalil, Mohammad
    Jovanovic, Jelena
    FIFTEENTH INTERNATIONAL CONFERENCE ON LEARNING ANALYTICS & KNOWLEDGE, LAK 2025, 2025, : 181 - 191
  • [47] How to DP-fy ML: A Practical Guide to Machine Learning with Differential Privacy
    Ponomareva, Natalia
    Hazimeh, Hussein
    Kurakin, Alex
    Xu, Zheng
    Denison, Carson
    McMahan, H. Brendan
    Vassilvitskii, Sergei
    Chien, Steve
    Thakurta, Abhradeep
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2023, 77 : 1113 - 1201
  • [48] Differential Privacy Protection Against Membership Inference Attack on Machine Learning for Genomic Data
    Chen, Junjie
    Wang, Wendy Hui
    Shi, Xinghua
    PACIFIC SYMPOSIUM ON BICOMPUTING 2021, 2021, : 26 - 37
  • [49] How to DP-fy ML: A Practical Tutorial to Machine Learning with Differential Privacy
    Ponomareva, Natalia
    Vassilvitskii, Sergei
    Xu, Zheng
    McMahan, Brendan
    Kurakin, Alexey
    Zhang, Chiyuan
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 5823 - 5824
  • [50] Comparison of Machine Learning Algorithms trained under Differential Privacy for Intrusion Detection Systems
    Siachos, Ioannis
    Kaltakis, Konstantinos
    Papachristopoulou, Konstantina
    Giannoulakis, Ioannis
    Kafetzakis, Emmanouil
    2023 IEEE INTERNATIONAL CONFERENCE ON CYBER SECURITY AND RESILIENCE, CSR, 2023, : 654 - 658