Supervised machine learning using encrypted training data

被引:1
|
作者
Francisco-Javier González-Serrano
Adrián Amor-Martín
Jorge Casamayón-Antón
机构
[1] Universidad Carlos III de Madrid,Department of Signal Theory and Communications
关键词
Classification; Security; integrity; protection; Machine learning; Privacy protection; Homomorphic encryption;
D O I
暂无
中图分类号
学科分类号
摘要
Preservation of privacy in data mining and machine learning has emerged as an absolute prerequisite in many practical scenarios, especially when the processing of sensitive data is outsourced to an external third party. Currently, privacy preservation methods are mainly based on randomization and/or perturbation, secure multiparty computations and cryptographic methods. In this paper, we take advantage of the partial homomorphic property of some cryptosystems to train simple machine learning models with encrypted data. Our basic scenario has three parties: multiple Data Owners, which provide encrypted training examples; the Algorithm Owner (or Application), which processes them to adjust the parameters of its models; and a semi-trusted third party, which provides privacy and secure computation services to the Application in some operations not supported by the homomorphic cryptosystem. In particular, we focus on two issues: the use of multiple-key cryptosystems, and the impact of the quantization of real-valued input data required before encryption. In addition, we develop primitives based on the outsourcing of a reduced set of operations that allows to implement general machine learning algorithms using efficient dedicated hardware. As applications, we consider the training of classifiers using privacy-protected data and the tracking of a moving target using encrypted distance measurements.
引用
收藏
页码:365 / 377
页数:12
相关论文
共 50 条
  • [21] NeuroCrypt: Machine Learning Over Encrypted Distributed Neuroimaging Data
    Senanayake, Nipuna
    Podschwadt, Robert
    Takabi, Daniel
    Calhoun, Vince D.
    Plis, Sergey M.
    [J]. NEUROINFORMATICS, 2022, 20 (01) : 91 - 108
  • [22] Encrypted Network Traffic Classification using Self-supervised Learning
    Towhid, Md Shamim
    Shahriar, Nashid
    [J]. PROCEEDINGS OF THE 2022 IEEE 8TH INTERNATIONAL CONFERENCE ON NETWORK SOFTWARIZATION (NETSOFT 2022): NETWORK SOFTWARIZATION COMING OF AGE: NEW CHALLENGES AND OPPORTUNITIES, 2022, : 366 - 374
  • [23] Training Data Generation for Machine Learning Using GPR Images
    Boldt, Markus
    Thiele, Antje
    Schulz, Karsten
    [J]. EARTH RESOURCES AND ENVIRONMENTAL REMOTE SENSING/GIS APPLICATIONS XIII, 2022, 12268
  • [24] Frailty Screening at Scale Using Core Clinical Data and Supervised Machine Learning
    Fountotos, Rosie
    Afilalo, Jonathan
    [J]. CIRCULATION, 2022, 146
  • [25] Automatic Labelling of Clusters with Discrete and Continuous Data Using Supervised Machine Learning
    de Sousa Junior, Joselito Mendes
    de Sales Santos, Roney Lira
    Lopes, Lucas Araujo
    Machado, Vinicius Ponte
    Silva, Ivan Saraiva
    [J]. PROCEEDINGS OF THE 2016 35TH INTERNATIONAL CONFERENCE OF THE CHILEAN COMPUTER SCIENCE SOCIETY (SCCC), 2016,
  • [26] Design of a Data Glove for Assessment of Hand Performance Using Supervised Machine Learning
    Sarwat, Hussein
    Sarwat, Hassan
    Maged, Shady A.
    Emara, Tamer H.
    Elbokl, Ahmed M.
    Awad, Mohammed Ibrahim
    [J]. SENSORS, 2021, 21 (21)
  • [27] Predicting Diabetes Diseases Using Mixed Data and Supervised Machine Learning Algorithms
    Daanouni, Othmane
    Cherradi, Bouchaib
    Tmiri, Amal
    [J]. 4TH INTERNATIONAL CONFERENCE ON SMART CITY APPLICATIONS (SCA' 19), 2019,
  • [28] Automatic oxidation threshold recognition of XAFS data using supervised machine learning
    Miyazato, Itsuki
    Takahashi, Lauren
    Takahashi, Keisuke
    [J]. MOLECULAR SYSTEMS DESIGN & ENGINEERING, 2019, 4 (05): : 1014 - 1018
  • [29] Address Standardization using Supervised Machine Learning
    Kaleem, Abdul
    Ghori, Khawaja Moyeezullah
    Khanzada, Zahra
    Malik, M. Noman
    [J]. COMPUTER COMMUNICATION AND MANAGEMENT, 2011, 5 : 441 - 445
  • [30] Efficiency of Supervised Machine Learning Algorithms in Regular and Encrypted VoIP Classification within NFV Environment
    Ilievski, Gjorgji
    Latkoski, Pero
    [J]. RADIOENGINEERING, 2020, 29 (01) : 243 - 250