Anonymizing k-NN Classification on MapReduce

被引:5
|
作者
Bazai, Sibghat Ullah [1 ]
Jang-Jaccard, Julian [1 ]
Wang, Ruili [1 ]
机构
[1] Massey Univ, Inst Nat & Math Sci, Auckland, New Zealand
来源
MOBILE NETWORKS AND MANAGEMENT (MONAMI 2017) | 2018年 / 235卷
关键词
MapReduce; Data anonymization; K-anonymity; k-NN classification; PRIVACY;
D O I
10.1007/978-3-319-90775-8_29
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data analytics scenario such as a classification algorithm plays an important role in data mining to identify a category of a new observation and is often used to drive new knowledge. However, classification algorithm on a big data analytics platform such as MapReduce and Spark, often runs on plain text without an appropriate privacy protection mechanism. This leaves user's data to be vulnerable from unauthorized access and puts the data at a great privacy risk. To address such concern, we propose a new novel k-NN classifier which can run on an anonymized dataset on MapReduce platform. We describe new Map and Reduce algorithms to produce different anonymized datasets for k-NN classifier. We also illustrate the details of experiments we performed on the multiple anonymized data sets to understand the effects between the level of privacy protection (data privacy) and the high-value insights (data utility) trade-off before and after data anonymization.
引用
收藏
页码:364 / 377
页数:14
相关论文
共 50 条
  • [31] A New Method For Selection Optimum k Value In k-NN Classification Algorithm
    Maleki, Masoud
    Eroglu, Kubra
    Aydemir, Onder
    Manshoori, Negin
    Kayikcioglu, Temel
    2013 21ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2013,
  • [32] On k-NN method with preprocessing
    University of Information Technology and Management, H. Sucharskiego 2, 35-225 Rzeszow, Poland
    不详
    Fundam Inf, 2006, 3 (343-358):
  • [33] On the Merge of k-NN Graph
    Zhao, Wan-Lei
    Wang, Hui
    Lin, Peng-Cheng
    Ngo, Chong-Wah
    IEEE TRANSACTIONS ON BIG DATA, 2022, 8 (06) : 1496 - 1510
  • [34] Fuzzy k-NN SVM
    Cheng, Hui-Chuan
    Yang, Chan-Yun
    Jan, Gene Eu
    Chen, Angela Shin-yih
    2015 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2015): BIG DATA ANALYTICS FOR HUMAN-CENTRIC SYSTEMS, 2015, : 1227 - 1232
  • [35] Trajectory Clustering and k-NN for Robust Privacy Preserving k-NN Query Processing in GeoSpark
    Dritsas, Elias
    Kanavos, Andreas
    Trigka, Maria
    Vonitsanos, Gerasimos
    Sioutas, Spyros
    Tsakalidis, Athanasios
    ALGORITHMS, 2020, 13 (08)
  • [36] Moderating k-NN classifiers
    Alkoot, FM
    Kittler, J
    PATTERN ANALYSIS AND APPLICATIONS, 2002, 5 (03) : 326 - 332
  • [37] On k-NN method with preprocessing
    Suraj, Z
    Delinnata, P
    FUNDAMENTA INFORMATICAE, 2006, 69 (03) : 343 - 358
  • [38] Moderating k-NN Classifiers
    Fuad M. Alkoot
    Josef Kittler
    Pattern Analysis & Applications, 2002, 5 : 326 - 332
  • [39] GENERALIZATION OF K-NN RULE
    TOMEK, I
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1976, 6 (02): : 121 - 126
  • [40] Detection of Cancer in Lung With K-NN Classification Using Genetic Algorithm
    Bhuvaneswari, P.
    Therese, A. Brintha
    2ND INTERNATIONAL CONFERENCE ON NANOMATERIALS AND TECHNOLOGIES (CNT 2014), 2015, 10 : 433 - 440