Privacy-preserving of SVM over vertically partitioned with imputing missing data

被引:9
|
作者
Omer, Mohammed Z. [1 ,2 ]
Gao, Hui [1 ,2 ]
Mustafa, Nadir [1 ]
机构
[1] UESTC, Sch Comp Sci & Engn, Chengdu 611731, Sichuan, Peoples R China
[2] UESTC, Big Data Res Ctr, Chengdu 611731, Sichuan, Peoples R China
关键词
Data imputation; Distributed privacy-preserving; Gram matrix; Paillier cryptosystem; MULTIPLE IMPUTATION; CHAINED EQUATIONS; CLASSIFICATION; SYSTEM; WORK;
D O I
10.1007/s10619-017-7203-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most distributed data mining algorithms can efficiently manage and mine complete data from distributed resources. However, for an incomplete data some modifications are required in order to perform distributed data mining techniques and maintaining the privacy of the sensitive information to provide pretty good results of data mining. Classification is important tasks of data mining aimed at discovering knowledge and classify new instances. SVM is classified as one of the most important algorithm used for classification problems in several various spheres. In this paper, we proposed a new distributed privacy-preserving protocol with multiple imputations of missing or incomplete data. More so, a multiple imputations based on multivariate imputation by chained equations is used for missing data and Paillier cryptosystem for maintaining the privacy of the participants. Finally we constructed a global SVM model by introducing a third party (semi-honest approach) over vertical partition data based in Gram matrix without revealing the privacy of the data and used to classify new instances. The performance evolution of the proposed protocol was investigated while using accuracy metric on the distributed and centralized data. Results of our experiments reveal that the accuracy is the same as centralized data and achieve better results with imputed data while compared with omitted data. The performance of distributed data on our protocol achieves better processing time compared with centralized data.
引用
收藏
页码:363 / 382
页数:20
相关论文
共 50 条
  • [1] Privacy-preserving of SVM over vertically partitioned with imputing missing data
    Mohammed Z. Omer
    Hui Gao
    Nadir Mustafa
    [J]. Distributed and Parallel Databases, 2017, 35 : 363 - 382
  • [2] Privacy-preserving SVM classification on vertically partitioned data
    Yu, Hwanjo
    Vaidya, Jaideep
    Jiang, Xiaoqian
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2006, 3918 : 647 - 656
  • [3] Privacy-Preserving Outsourcing Scheme for SVM on Vertically Partitioned Data
    Qiu, Guowei
    Huo, Hua
    Gui, Xiaolin
    Dai, Huijun
    [J]. SECURITY AND COMMUNICATION NETWORKS, 2022, 2022
  • [4] Privacy-preserving DBSCAN clustering over vertically partitioned data
    Xu Wei-jiang
    Huang Liu-sheng
    Luo Yong-long
    Yao Yi-fei
    Jing Wei-wei
    [J]. MUE: 2007 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND UBIQUITOUS ENGINEERING, PROCEEDINGS, 2007, : 850 - 856
  • [5] Privacy-preserving decision trees over vertically partitioned data
    Vaidya, J
    Clifton, C
    [J]. DATA AND APPLICATIONS SECURITY XIX, PROCEEDINGS, 2005, 3654 : 139 - 152
  • [6] Privacy-Preserving Kth Element Score over Vertically Partitioned Data
    Vaidya, Jaideep
    Clifton, Christopher W.
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (02) : 253 - 258
  • [7] Privacy-preserving collaborative filtering on vertically partitioned data
    Polat, H
    Du, WL
    [J]. KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2005, 2005, 3721 : 651 - 658
  • [8] Privacy-Preserving Logistic Regression on Vertically Partitioned Data
    Song, Lei
    Ma, Chunguang
    Duan, Guanghan
    Yuan, Qi
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2019, 56 (10): : 2243 - 2249
  • [9] Approximate Privacy-Preserving Data Mining on Vertically Partitioned Data
    Nix, Robert
    Kantarcioglu, Murat
    Han, Keesook J.
    [J]. DATA AND APPLICATIONS SECURITY AND PRIVACY XXVI, 2012, 7371 : 129 - 144
  • [10] Efficient and Privacy-Preserving Logistic Regression Prediction over Vertically Partitioned Data
    Zhao, Jiaqi
    Zhu, Hui
    Wang, Fengwei
    Lu, Rongxing
    Li, Hui
    [J]. IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 4253 - 4258