Anonymizing k-NN Classification on MapReduce

被引:5
|
作者
Bazai, Sibghat Ullah [1 ]
Jang-Jaccard, Julian [1 ]
Wang, Ruili [1 ]
机构
[1] Massey Univ, Inst Nat & Math Sci, Auckland, New Zealand
来源
MOBILE NETWORKS AND MANAGEMENT (MONAMI 2017) | 2018年 / 235卷
关键词
MapReduce; Data anonymization; K-anonymity; k-NN classification; PRIVACY;
D O I
10.1007/978-3-319-90775-8_29
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data analytics scenario such as a classification algorithm plays an important role in data mining to identify a category of a new observation and is often used to drive new knowledge. However, classification algorithm on a big data analytics platform such as MapReduce and Spark, often runs on plain text without an appropriate privacy protection mechanism. This leaves user's data to be vulnerable from unauthorized access and puts the data at a great privacy risk. To address such concern, we propose a new novel k-NN classifier which can run on an anonymized dataset on MapReduce platform. We describe new Map and Reduce algorithms to produce different anonymized datasets for k-NN classifier. We also illustrate the details of experiments we performed on the multiple anonymized data sets to understand the effects between the level of privacy protection (data privacy) and the high-value insights (data utility) trade-off before and after data anonymization.
引用
收藏
页码:364 / 377
页数:14
相关论文
共 50 条
  • [41] Evaluation of normalization methods for cDNA microarray data by k-NN classification
    Wei Wu
    Eric P Xing
    Connie Myers
    I Saira Mian
    Mina J Bissell
    BMC Bioinformatics, 6
  • [42] An Optimized k-NN Approach for Classification on Imbalanced Datasets with Missing Data
    Ozan, Ezgi Can
    Riabchenko, Ekaterina
    Kiranyaz, Serkan
    Gabbouj, Moncef
    ADVANCES IN INTELLIGENT DATA ANALYSIS XV, 2016, 9897 : 387 - 392
  • [43] Utilization of K-NN Algorithm for Expectation Maximization Based Classification Method
    Aci, M.
    Inan, C.
    Avci, M.
    2008 4TH INTERNATIONAL IEEE CONFERENCE INTELLIGENT SYSTEMS, VOLS 1 AND 2, 2008, : 786 - 788
  • [44] Human movement detection based on acceleration measurements and k-NN classification
    Darko, Fuduric
    Denis, Siladi
    Mario, Zagar
    EUROCON 2007: THE INTERNATIONAL CONFERENCE ON COMPUTER AS A TOOL, VOLS 1-6, 2007, : 1352 - 1357
  • [45] k-NN Classification of Handwritten Characters via Accelerated GAT Correlation
    Wakahara, Toru
    Yamashita, Yukihiko
    13TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2012), 2012, : 143 - 148
  • [46] EEG Features Extraction and k-NN Classification During Eyes Closed
    Aris, Siti Armiza Mohd
    Jalil, Siti Zura A.
    Bani, Nurul Aini
    Kaidi, Hazilah Mad
    Muhtazaruddin, Mohd Nabil
    2016 IEEE EMBS CONFERENCE ON BIOMEDICAL ENGINEERING AND SCIENCES (IECBES), 2016, : 679 - 684
  • [47] Neural implementation of fuzzy K-NN classification for seismic pattern recognition
    Huang, KY
    Yuan, YW
    ICNN - 1996 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS. 1-4, 1996, : 1588 - 1593
  • [48] SAR image classification method based on Gabor feature and K-NN
    Wang, Zhiru
    Chen, Liang
    Shi, Hao
    Qi, Baogui
    Wang, Guanqun
    JOURNAL OF ENGINEERING-JOE, 2019, 2019 (20): : 6734 - 6736
  • [49] Distance based k-NN Classification of Gabor Jet Local Descriptors
    Lefkovits, Szidonia
    Lefkovits, Laszlo
    8TH INTERNATIONAL CONFERENCE INTERDISCIPLINARITY IN ENGINEERING, INTER-ENG 2014, 2015, 19 : 780 - 785
  • [50] Blog Classification: Adding Linguistic Knowledge to Improve the K-NN Algorithm
    Bayoudh, Ines
    Bechet, Nicolas
    Roche, Mathieu
    INTELLIGENT INFORMATION PROCESSING IV, 2008, : 68 - +