Privacy-preserving multi-party PCA computation on horizontally and vertically partitioned data based on outsourced QR decomposition

被引:1
|
作者
Jaberi, Mehrad [1 ]
Mala, Hamid [1 ]
机构
[1] Univ Isfahan, Fac Comp Engn, Dept Informat Technol Engn, Esfahan, Iran
来源
JOURNAL OF SUPERCOMPUTING | 2023年 / 79卷 / 13期
关键词
Principal component analysis; Secure computation; Somewhat homomorphic encryption; Secure outsourcing; PRINCIPAL COMPONENT ANALYSIS; MATRIX;
D O I
10.1007/s11227-023-05206-2
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Data mining has received many applications in diverse areas such as banking, marketing, healthcare and fraud detection. One of the valuable tools in data mining is principal component analysis (PCA). Computing PCA over data belonging to several data owners with respect to their privacy is a need in many industries such as healthcare. Here, we propose a privacy-preserving multi-party protocol to compute PCA over horizontally and vertically distributed data using QR matrix decomposition and homomorphic encryption. Our protocol is the first privacy-preserving PCA computation scheme which is applicable for both horizontally and vertically partitioned data and finds all of the principal components. Our protocol is secure against collusion of the data owners in the semi-honest security model. In the performance analysis, we show that in the horizontal settings increasing the number of data owners will decrease the computation overhead of each of data owners, but it will increase the communication and the computation overhead of the server. We also show that the time consumption of using our proposed scheme on Australian data set of size 690 x 14, distributed horizontally among 50 data owners, is 4.38 s. On the Ionosphere data set of size 351 x 34, distributed horizontally among 10 data owners, it takes 31.8 s. In the vertical distribution, the time consumption of using our scheme on Gait data set of size 48 x 321 distributed among 7 data owners and on Gastrointestinal Lesions data set of size 76 x 698 distributed among 10 data owners is 4.4 h and 15.7 h, respectively.
引用
收藏
页码:14358 / 14387
页数:30
相关论文
共 50 条
  • [1] Privacy-preserving multi-party PCA computation on horizontally and vertically partitioned data based on outsourced QR decomposition
    Mehrad Jaberi
    Hamid Mala
    [J]. The Journal of Supercomputing, 2023, 79 : 14358 - 14387
  • [2] Privacy-preserving SVM classification on horizontally partitioned data with secure multi-party computation
    Hu, Yunhong
    Fang, Liang
    He, Guoping
    [J]. Journal of Information and Computational Science, 2009, 6 (06): : 2341 - 2348
  • [3] Privacy-preserving SVM on Outsourced Genomic Data via Secure Multi-party Computation
    Chen, Huajie
    Uenal, Ali Burak
    Akguen, Mete
    Pfeifer, Nico
    [J]. PROCEEDINGS OF THE SIXTH INTERNATIONAL WORKSHOP ON SECURITY AND PRIVACY ANALYTICS (IWSPA'20), 2020, : 61 - 69
  • [4] Scalable Multi-Party Privacy-Preserving Gradient Tree Boosting over Vertically Partitioned Dataset with Outsourced Computations
    Edemacu, Kennedy
    Kim, Jong Wook
    [J]. MATHEMATICS, 2022, 10 (13)
  • [5] Privacy-Preserving PCA on Horizontally-Partitioned Data
    Al-Rubaie, Mohammad
    Wu, Pei-yuan
    Chang, J. Morris
    Kung, Sun-Yuan
    [J]. 2017 IEEE CONFERENCE ON DEPENDABLE AND SECURE COMPUTING, 2017, : 280 - 287
  • [6] Efficient multi-party privacy preserving data mining for vertically partitioned data
    Sharma, Surbhi
    Shukla, Deepak
    [J]. 2016 INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT), VOL 2, 2016, : 189 - 195
  • [7] Outsourced Privacy-Preserving Data Alignment on Vertically Partitioned Database
    Wang, Zhuzhu
    Hu, Cui
    Xiao, Bin
    Liu, Yang
    Li, Teng
    Ma, Zhuo
    Ma, Jianfeng
    [J]. IEEE TRANSACTIONS ON BIG DATA, 2023, 9 (05) : 1408 - 1419
  • [8] Privacy-Preserving Distributed k-Nearest Neighbor Mining on Horizontally Partitioned Multi-Party Data
    Zhang, Feng
    Zhao, Gansen
    Xing, Tingyan
    [J]. ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2009, 5678 : 755 - +
  • [9] Privacy-preserving Multi-party Analytics over Arbitrarily Partitioned Data
    Mehnaz, Shagufta
    Bertino, Elisa
    [J]. 2017 IEEE 10TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD), 2017, : 342 - 349
  • [10] Privacy-Preserving Query Processing by Multi-Party Computation
    Sepehri, Maryam
    Cimato, Stelvio
    Damiani, Ernesto
    [J]. COMPUTER JOURNAL, 2015, 58 (10): : 2195 - 2212