Privacy-preserving Data Classification and Similarity Evaluation for Distributed Systems

被引：14

作者：

Jia, Qi ^{[1
]}

Guo, Linke ^{[1
]}

Jin, Zhanpeng ^{[1
]}

Fang, Yuguang ^{[2
]}

机构：

[1] Binghamton Univ, Dept Elect & Comp Engn, Binghamton, NY 13902 USA

[2] Univ Florida, Dept Elect & Comp Engn, Gainesville, FL 32611 USA

来源：

PROCEEDINGS 2016 IEEE 36TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS ICDCS 2016 | 2016年

关键词：

Privacy Preservation; Data Classification; Similarity Evaluation; Machine Learning;

D O I：

10.1109/ICDCS.2016.94

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Data classification is a widely used data mining technique for big data analysis. By training massive data collected from the real world, data classification helps learners discover hidden data patterns. In addition to data training, given a trained model from collected data, a user can classify whether a new incoming data belongs to an existing class; or, multiple distributed entities may collaborate to test the similarity of their trained results. However, due to data locality and privacy concerns, it is infeasible for large-scale distributed systems to share each individual's datasets with each other for data similarity check. On the one hand, the trained model is an entity's private asset and may leak private information, which should be well protected from all other non-collaborative entities. On the other hand, the new incoming data may contain sensitive information which cannot be disclosed directly for classification. To address the above privacy issues, we propose a privacy-preserving data classification and similarity evaluation scheme for distributed systems. With our scheme, neither new arriving data nor trained models are directly revealed during the classification and similarity evaluation procedures. The proposed scheme can be applied to many fields using data classification and evaluation. Based on extensive real-world experiments, we have also evaluated the privacy preservation, feasibility, and efficiency of the proposed scheme.

引用

页码：690 / 699

页数：10

共 50 条

[21] Distributed Privacy-preserving Data Mining Method Research
Chen, Qi
[J]. 2011 AASRI CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INDUSTRY APPLICATION (AASRI-AIIA 2011), VOL 2, 2011, : 88 - 90
[22] An effective distributed privacy-preserving data mining algorithm
Fukasawa, T
Wang, JH
Takata, T
Miyazaki, M
[J]. INTELLIGENT DAA ENGINEERING AND AUTOMATED LEARNING IDEAL 2004, PROCEEDINGS, 2004, 3177 : 320 - 325
[23] Privacy-preserving Naive Bayes classification in semi-fully distributed data model
Duy-Hien Vu
[J]. COMPUTERS & SECURITY, 2022, 115
[24] Privacy-preserving distributed clustering
Erkin, Zekeriya
Veugen, Thijs
Toft, Tomas
Lagendijk, Reginald L.
[J]. EURASIP JOURNAL ON INFORMATION SECURITY, 2013, (01):
[25] A Distributed Anonymization Scheme for Privacy-preserving Recommendation Systems
Luo, Zhifeng
Chen, Shuhong
Li, Yutian
[J]. PROCEEDINGS OF 2013 IEEE 4TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2012, : 491 - 494
[26] Privacy-preserving similarity evaluation and application to remote biometrics authentication
Kikuchi, Hiroaki
Nagai, Kei
Ogata, Wakaha
Nishigaki, Masakatsu
[J]. SOFT COMPUTING, 2010, 14 (05) : 529 - 536
[27] EsPRESSO: Efficient privacy-preserving evaluation of sample set similarity
Blundo, Carlo
De Cristofaro, Emiliano
Gasti, Paolo
[J]. JOURNAL OF COMPUTER SECURITY, 2014, 22 (03) : 355 - 381
[28] Privacy-Preserving Similarity Evaluation and Application to Remote Biometrics Authentication
Kikuchi, Hiroaki
Nagai, Kei
Ogata, Wakaha
Nishigaki, Masakatsu
[J]. MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2008, 5285 : 3 - +
[29] Privacy-preserving similarity evaluation and application to remote biometrics authentication
Hiroaki Kikuchi
Kei Nagai
Wakaha Ogata
Masakatsu Nishigaki
[J]. Soft Computing, 2010, 14 : 529 - 536
[30] Toward Transparent and Accountable Privacy-Preserving Data Classification
Zhao, Yanqi
Yu, Yong
Chen, Ruonan
Li, Yannan
Tian, Aikui
[J]. IEEE NETWORK, 2021, 35 (04): : 184 - 189

← 1 2 3 4 5 →