Clustering-based k-nearest neighbor classification for large-scale data with neural codes representation

被引:74
|
作者
Gallego, Antonio-Javier [1 ]
Calvo-Zaragoza, Jorge [1 ]
Valero-Mas, Jose J. [1 ]
Rico-Juan, Juan R. [1 ]
机构
[1] Univ Alicante, Dept Lenguajes & Sistemas Informat, Carretera San Vicente Raspeig S-N, Alicante 03690, Spain
关键词
Efficient kNN classification; Clustering; Deep neural networks; ALGORITHMS; SELECTION;
D O I
10.1016/j.patcog.2017.09.038
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While standing as one of the most widely considered and successful supervised classification algorithms, the k-nearest Neighbor (kNN) classifier generally depicts a poor efficiency due to being an instance-based method. In this sense, Approximated Similarity Search (ASS) stands as a possible alternative to improve those efficiency issues at the expense of typically lowering the performance of the classifier. In this paper we take as initial point an ASS strategy based on clustering. We then improve its performance by solving issues related to instances located close to the cluster boundaries by enlarging their size and considering the use of Deep Neural Networks for learning a suitable representation for the classification task at issue. Results using a collection of eight different datasets show that the combined use of these two strategies entails a significant improvement in the accuracy performance, with a considerable reduction in the number of distances needed to classify a sample in comparison to the basic kNN rule. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:531 / 543
页数:13
相关论文
共 50 条
  • [21] Scalable Evidential K-Nearest Neighbor Classification on Big Data
    Gong, Chaoyu
    Demmel, Jim
    You, Yang
    IEEE TRANSACTIONS ON BIG DATA, 2024, 10 (03) : 226 - 237
  • [22] Effective k-nearest neighbor models for data classification enhancement
    Ali A. Amer
    Sri Devi Ravana
    Riyaz Ahamed Ariyaluran Habeeb
    Journal of Big Data, 12 (1)
  • [23] Microarray Data Classification using Fuzzy K-Nearest Neighbor
    Kumar, Mukesh
    Rath, Santanu Ku
    2014 INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2014, : 1032 - 1038
  • [24] k-Nearest Neighbor based Clustering with Shape Alternation Adaptivity
    Lu, Yifeng
    Zhang, Yao
    Richter, Florian
    Seidl, Thomas
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [25] A MapReduce-based k-Nearest Neighbor Approach for Big Data Classification
    Maillo, Jesus
    Triguero, Isaac
    Herrera, Francisco
    2015 IEEE TRUSTCOM/BIGDATASE/ISPA, VOL 2, 2015, : 167 - 172
  • [26] Rates of Convergence for Large-scale Nearest Neighbor Classification
    Qiao, Xingye
    Duan, Jiexin
    Cheng, Guang
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [27] wSparse Coefficient-Based k-Nearest Neighbor Classification
    Ma, Hongxing
    Gou, Jianping
    Wang, Xili
    Ke, Jia
    Zeng, Shaoning
    IEEE ACCESS, 2017, 5 : 16618 - 16634
  • [28] A Grid-based k-Nearest Neighbor Join for Large Scale Datasets on MapReduce
    Jang, Miyoung
    Shin, Young-Sung
    Chang, Jae-Woo
    2015 IEEE 17TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2015 IEEE 7TH INTERNATIONAL SYMPOSIUM ON CYBERSPACE SAFETY AND SECURITY, AND 2015 IEEE 12TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS), 2015, : 888 - 891
  • [29] Rehub: Extending Hub labels for reverse k-nearest neighbor queries on large-scale networks
    Efentakis A.
    Pfoser D.
    ACM Journal of Experimental Algorithmics, 2016, 21 (01):
  • [30] Network Transmission Flags Data Affinity-based Classification by K-Nearest Neighbor
    Aljojo, Nahla
    ARO-THE SCIENTIFIC JOURNAL OF KOYA UNIVERSITY, 2022, 10 (01): : 35 - 43