Differentially Private Learning with Small Public Data

被引:0
|
作者
Wang, Jun [1 ]
Zhou, Zhi-Hua [1 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing 210023, Peoples R China
基金
国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Differentially private learning tackles tasks where the data are private and the learning process is subject to differential privacy requirements. In real applications, however, some public data are generally available in addition to private data, and it is interesting to consider how to exploit them. In this paper, we study a common situation where a small amount of public data can be used when solving the Empirical Risk Minimization problem over a private database. Specifically, we propose Private-Public Stochastic Gradient Descent, which utilizes such public information to adjust parameters in differentially private stochastic gradient descent and fine-tunes the final result with model reuse. Our method keeps differential privacy for the private database, and empirical study validates its superiority compared with existing approaches.
引用
收藏
页码:6219 / 6226
页数:8
相关论文
共 50 条
  • [1] Differentially Private Distance Learning in Categorical Data
    Battaglia, Elena
    Celano, Simone
    Pensa, Ruggero G.
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2021, 35 (05) : 2050 - 2088
  • [2] Differentially private Bayesian learning on distributed data
    Heikkila, Mikko
    Lagerspetz, Eemil
    Kaski, Samuel
    Shimizu, Kana
    Tarkoma, Sasu
    Honkela, Antti
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [3] Differentially Private Federated Learning on Heterogeneous Data
    Noble, Maxence
    Bellet, Aurelien
    Dieuleveut, Aymeric
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
  • [4] Differentially Private Distance Learning in Categorical Data
    Elena Battaglia
    Simone Celano
    Ruggero G. Pensa
    [J]. Data Mining and Knowledge Discovery, 2021, 35 : 2050 - 2088
  • [5] Differentially private distributed logistic regression using private and public data
    Zhanglong Ji
    Xiaoqian Jiang
    Shuang Wang
    Li Xiong
    Lucila Ohno-Machado
    [J]. BMC Medical Genomics, 7
  • [6] Differentially private distributed logistic regression using private and public data
    Ji, Zhanglong
    Jiang, Xiaoqian
    Wang, Shuang
    Xiong, Li
    Ohno-Machado, Lucila
    [J]. BMC MEDICAL GENOMICS, 2014, 7
  • [7] Distributionally Robust Federated Learning for Differentially Private Data
    Shi, Siping
    Hu, Chuang
    Wang, Dan
    Zhu, Yifei
    Han, Zhu
    [J]. 2022 IEEE 42ND INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2022), 2022, : 842 - 852
  • [8] Collaborative learning from distributed data with differentially private synthetic data
    Prediger, Lukas
    Jalko, Joonas
    Honkela, Antti
    Kaski, Samuel
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2024, 24 (01)
  • [9] Differentially Private Crowdsourcing With the Public and Private Blockchain
    Wang, Minghao
    Zhu, Tianqing
    Zuo, Xuhan
    Yang, Mengmeng
    Yu, Shui
    Zhou, Wanlei
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (10) : 8918 - 8930
  • [10] Generalized genomic data sharing for differentially private federated learning
    Al Aziz, Md Momin
    Anjum, Md Monowar
    Mohammed, Noman
    Jiang, Xiaoqian
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2022, 132