IDPriU: A two-party ID-private data union protocol for privacy-preserving machine learning

被引:0
|
作者
Yan, Jianping [1 ]
Wei, Lifei [1 ]
Qian, Xiansong [2 ]
Zhang, Lei [2 ]
机构
[1] Shanghai Maritime Univ, Coll Informat Engn, Shanghai 201306, Peoples R China
[2] Shanghai Ocean Univ, Coll Informat Technol, Shanghai 201306, Peoples R China
基金
上海市自然科学基金;
关键词
Private data union; Privacy-preserving machine learning; Data security; Data preprocessing; Private set union;
D O I
10.1016/j.jisa.2024.103913
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to significant data security concerns in machine learning, such as the data silo problem, there has been a growing trend towards the development of privacy-preserving machine learning applications. The initial step in training data across silos involves establishing secure data joins, specifically private data joins, to ensure the consistency and accuracy of the dataset. While the majority of current research focuses on the inner join of private data, this paper specifically addresses the privacy-preserving full join of private data and develops two-party unbalanced private data full join protocols utilizing secure multi-party computation tools. Notably, our paper introduces the novel component of Private Match-and-Connect (PMC), which performs a union operation on the ID and feature values, and ensure the secret sharing of the resulting union set. Each participant receives only a portion of the secret share, thereby guaranteeing data security during the pre-processing phase. Furthermore, we propose the two-party ID-private data union protocol (IDPriU), which facilitates secure and accurate matching of feature value shares and ID shares and also enables the data alignment. Our protocol represents a significant advancement in the field of privacy-preserving data preprocessing in machine learning and privacy-preserving federated queries. It extends the concept that private data joins are limited to inner connections, offering a novel approach by Private Set Union (PSU). We have experimentally implemented our protocol and obtained favorable results in terms of both runtime and communication overhead.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Privacy-preserving Machine Learning Algorithms for Big Data Systems
    Xu, Kaihe
    Yue, Hao
    Guo, Linke
    Guo, Yuanxiong
    Fang, Yuguang
    2015 IEEE 35TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, 2015, : 318 - 327
  • [42] A two-party privacy preserving set intersection protocol against malicious users in cloud computing
    Cao, Xuefei
    Li, Hui.
    Dang, Lanjun
    Lin, Yin
    COMPUTER STANDARDS & INTERFACES, 2017, 54 : 41 - 45
  • [43] Federated Learning: The Pioneering Distributed Machine Learning and Privacy-Preserving Data Technology
    Treleaven, Philip
    Smietanka, Malgorzata
    Pithadia, Hirsh
    COMPUTER, 2022, 55 (04) : 20 - 29
  • [44] A Privacy-Preserving Distributed Machine Learning Protocol Based on Homomorphic Hash Authentication
    Hong, Yang
    Wang, Lisong
    Meng, Weizhi
    Cao, Jian
    Ge, Chunpeng
    Zhang, Qin
    Zhang, Rui
    NETWORK AND SYSTEM SECURITY, NSS 2022, 2022, 13787 : 374 - 386
  • [45] A Solution to Privacy-preserving Two-party Sign Test on Vertically Partitioned Data (P22NSTv) Using Data Disguising Techniques
    Liu, Meng-chang
    Zhang, Ning
    2010 INTERNATIONAL CONFERENCE ON NETWORKING AND INFORMATION TECHNOLOGY (ICNIT 2010), 2010, : 526 - 534
  • [46] Privacy-Preserving Machine Learning Based Data Analytics on Edge Devices
    Zhao, Jianxin
    Mortier, Richard
    Crowcroft, Jon
    Wang, Liang
    PROCEEDINGS OF THE 2018 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY (AIES'18), 2018, : 341 - 346
  • [47] Force: Highly Efficient Four-Party Privacy-Preserving Machine Learning on GPU
    Dai, Tianxiang
    Duan, Li
    Jiang, Yufan
    Li, Yong
    Mei, Fei
    Sun, Yulian
    SECURE IT SYSTEMS, NORDSEC 2023, 2024, 14324 : 330 - 349
  • [48] Privacy-Preserving Student Learning with Differentially Private Data-Free Distillation
    Liu, Bochao
    Lu, Jianghu
    Wang, Pengju
    Zhang, Junjie
    Zeng, Dan
    Qian, Zhenxing
    Ge, Shiming
    2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,
  • [49] Outsourcing Two-party Privacy Preserving K-means Clustering Protocol In Wireless Sensor Networks
    Liu, Xiaoyan
    Jiang, Zoe L.
    Yiu, S. M.
    Wang, Xuan
    Tan, Chuting
    Li, Ye
    Liu, Zechao
    Jin, Yabin
    Fang, Junbin
    2015 11TH INTERNATIONAL CONFERENCE ON MOBILE AD-HOC AND SENSOR NETWORKS (MSN), 2015, : 124 - 133