Privacy-preserving Data Classification and Similarity Evaluation for Distributed Systems

被引:14
|
作者
Jia, Qi [1 ]
Guo, Linke [1 ]
Jin, Zhanpeng [1 ]
Fang, Yuguang [2 ]
机构
[1] Binghamton Univ, Dept Elect & Comp Engn, Binghamton, NY 13902 USA
[2] Univ Florida, Dept Elect & Comp Engn, Gainesville, FL 32611 USA
关键词
Privacy Preservation; Data Classification; Similarity Evaluation; Machine Learning;
D O I
10.1109/ICDCS.2016.94
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Data classification is a widely used data mining technique for big data analysis. By training massive data collected from the real world, data classification helps learners discover hidden data patterns. In addition to data training, given a trained model from collected data, a user can classify whether a new incoming data belongs to an existing class; or, multiple distributed entities may collaborate to test the similarity of their trained results. However, due to data locality and privacy concerns, it is infeasible for large-scale distributed systems to share each individual's datasets with each other for data similarity check. On the one hand, the trained model is an entity's private asset and may leak private information, which should be well protected from all other non-collaborative entities. On the other hand, the new incoming data may contain sensitive information which cannot be disclosed directly for classification. To address the above privacy issues, we propose a privacy-preserving data classification and similarity evaluation scheme for distributed systems. With our scheme, neither new arriving data nor trained models are directly revealed during the classification and similarity evaluation procedures. The proposed scheme can be applied to many fields using data classification and evaluation. Based on extensive real-world experiments, we have also evaluated the privacy preservation, feasibility, and efficiency of the proposed scheme.
引用
收藏
页码:690 / 699
页数:10
相关论文
共 50 条
  • [21] Distributed Privacy-preserving Data Mining Method Research
    Chen, Qi
    [J]. 2011 AASRI CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INDUSTRY APPLICATION (AASRI-AIIA 2011), VOL 2, 2011, : 88 - 90
  • [22] An effective distributed privacy-preserving data mining algorithm
    Fukasawa, T
    Wang, JH
    Takata, T
    Miyazaki, M
    [J]. INTELLIGENT DAA ENGINEERING AND AUTOMATED LEARNING IDEAL 2004, PROCEEDINGS, 2004, 3177 : 320 - 325
  • [23] Privacy-preserving Naive Bayes classification in semi-fully distributed data model
    Duy-Hien Vu
    [J]. COMPUTERS & SECURITY, 2022, 115
  • [24] Privacy-preserving distributed clustering
    Erkin, Zekeriya
    Veugen, Thijs
    Toft, Tomas
    Lagendijk, Reginald L.
    [J]. EURASIP JOURNAL ON INFORMATION SECURITY, 2013, (01):
  • [25] A Distributed Anonymization Scheme for Privacy-preserving Recommendation Systems
    Luo, Zhifeng
    Chen, Shuhong
    Li, Yutian
    [J]. PROCEEDINGS OF 2013 IEEE 4TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2012, : 491 - 494
  • [26] Privacy-preserving similarity evaluation and application to remote biometrics authentication
    Kikuchi, Hiroaki
    Nagai, Kei
    Ogata, Wakaha
    Nishigaki, Masakatsu
    [J]. SOFT COMPUTING, 2010, 14 (05) : 529 - 536
  • [27] EsPRESSO: Efficient privacy-preserving evaluation of sample set similarity
    Blundo, Carlo
    De Cristofaro, Emiliano
    Gasti, Paolo
    [J]. JOURNAL OF COMPUTER SECURITY, 2014, 22 (03) : 355 - 381
  • [28] Privacy-Preserving Similarity Evaluation and Application to Remote Biometrics Authentication
    Kikuchi, Hiroaki
    Nagai, Kei
    Ogata, Wakaha
    Nishigaki, Masakatsu
    [J]. MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2008, 5285 : 3 - +
  • [29] Privacy-preserving similarity evaluation and application to remote biometrics authentication
    Hiroaki Kikuchi
    Kei Nagai
    Wakaha Ogata
    Masakatsu Nishigaki
    [J]. Soft Computing, 2010, 14 : 529 - 536
  • [30] Toward Transparent and Accountable Privacy-Preserving Data Classification
    Zhao, Yanqi
    Yu, Yong
    Chen, Ruonan
    Li, Yannan
    Tian, Aikui
    [J]. IEEE NETWORK, 2021, 35 (04): : 184 - 189